Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012207-HA-COREI747935
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
December 19 2020
  11 Hours, 50 Minutes
2
December 19 2020
  12 Hours, 43 Minutes
3
December 20 2020
  11 Hours, 42 Minutes
Invert Behavior (Only Show Selected Data)
  12 Hours, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%105%109%114%CLOMPRedisBetsy GPU CompressorNCNNHPC ChallengeeSpeak-NG Speech EngineLAMMPS Molecular Dynamics SimulatoroneDNNWaifu2x-NCNN VulkanSunflow Rendering Systemrav1eNode.js V8 Web Tooling BenchmarkBuild2simdjsonSQLite SpeedtestCoremarkGROMACSTimed MAFFT AlignmentStockfishBRL-CADNumpy BenchmarkTimed FFmpeg Compilationx265EmbreeLibplaceboLZ4 CompressionWavPack Audio Encodingyquake2asmFishMonkey Audio EncodingBasis UniversalIndigoBenchASTC EncoderKvazaarPHPBenchTimed Eigen CompilationDDraceNetworkTimed HMMer Search

Core i7 4790K 202waifu2x-ncnn: 2x - 3 - Nowaifu2x-ncnn: 2x - 3 - Yeslibplacebo: deband_heavylibplacebo: polar_nocomputelibplacebo: hdr_peakdetectlibplacebo: av1_grain_lapbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapyquake2: OpenGL 1.x - 1920 x 1080yquake2: OpenGL 1.x - 2560 x 1440yquake2: OpenGL 1.x - 3840 x 2160yquake2: OpenGL 3.x - 1920 x 1080yquake2: OpenGL 3.x - 2560 x 1440yquake2: OpenGL 3.x - 3840 x 2160yquake2: Software CPU - 1920 x 1080yquake2: Software CPU - 2560 x 1440yquake2: Software CPU - 3840 x 2160hpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthhmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAlammps: Rhodopsin Proteinsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDcompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonembree: Pathtracer - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastrav1e: 1rav1e: 5rav1e: 6rav1e: 10x265: Bosphorus 4Kx265: Bosphorus 1080pcoremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthbuild-ffmpeg: Time To Compilebuild2: Time To Compilenumpy: espeak: Text-To-Speech Synthesisnode-web-tooling: gromacs: Water Benchmarkastcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3basis: UASTC Level 2 + RDO Post-Processingsqlite-speedtest: Timed Time - Size 1,000redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mindigobench: CPU - Bedroomindigobench: CPU - Supercarphpbench: PHP Benchmark Suitesunflow: Global Illumination + Image Synthesisbrl-cad: VGR Performance Metricclomp: Static OMP Speedupbuild-eigen: Time To Compileencode-ape: WAV To APEencode-wavpack: WAV To WavPack12324.7816.95018.9513186.9148607.60711.866.9756.71145.4613.03101.1529.58206.8130.467.1200.9125.962.898.060.327.890.313082.0652643.790870.493984.037900.026500.278793.3936519281.572123.72712.0452.3180.680.450.730.775671.516623.352.256530.151.286532.79.1452914.25215.219404.2591426.418510.746016.540426.071512.393911.32648495.754730.408480.054741.206.865088489.114741.156.167714.75075.38885.58065.20356.56625.89022.162.219.469.736.0610.9824.5743.430.2550.8281.0742.5146.5529.94159958.257514799011212249161124.831252.456313.6849.73311.060.4107.6510.3969.57568.1173.39010.58074.387145.884945.47569.4132687331.832112933.021611833.092540423.881884493.3331.478.256.919.476.8311.282.5723.93109.2725.7822.2450.6944.0833.2216.3931.358.357.059.526.8011.332.5823.63109.4525.7922.1451.2446.1733.7716.570.7501.6847102932.280481751.370.66112.18813.79425.9406.53518.9513223.2049578.62711.516.2816.66845.4013.05100.6529.60208.2129.867.2200.1125.862.697.659.728.090.136821.9568442.537470.495184.048850.026250.327002.8296719517.860123.80611.9252.2570.680.450.710.745687.386643.652.406527.551.286537.89.0965815.07415.178624.1643326.922610.721517.533426.140012.603611.28608410.764693.248426.984714.606.943588413.384713.766.064184.74235.34695.61395.21016.62905.92662.162.229.469.756.0710.9624.5643.680.2610.8231.1082.5956.5930.16159234.482059798628712254973125.793250.264313.6951.46711.270.4137.7010.3969.61567.5672.61910.72274.248145.794949.40569.8601682927.162140255.001622814.522339268.781808870.9732.229.037.099.457.0411.802.6424.27110.7926.1122.4351.5644.9034.5716.5831.769.107.149.456.9611.717.6723.53109.7025.8122.4160.7645.8034.1817.410.7541.6877099652.293482371.170.73112.24313.72625.9336.82419.0413109.5549012.32711.766.3746.41445.4413.03100.8529.53207.6131.067.0200.5125.962.797.860.828.592.159571.8931044.680270.479804.060380.027130.278473.2135518703.028123.83211.9042.2580.680.450.730.745697.276648.651.926523.650.116536.69.1874315.42155.183745.2962627.058810.778416.634725.743512.097411.16418446.754743.298475.514729.758.223588482.734758.636.084024.72255.33925.56965.18606.54175.88492.162.219.459.736.0711.0024.6043.610.2590.8141.0792.5526.5629.89161328.403945791879912309486125.683254.648316.2551.21311.220.4087.7010.3969.61568.0973.27910.49574.437145.870972.13470.5951695087.332104918.261655968.922314175.991916548.1631.648.547.109.486.8411.602.5523.85108.9625.7822.2551.4844.4533.4616.4531.538.597.069.496.8111.562.6123.97109.1125.7722.2852.1044.1634.1816.590.7521.6917092822.329485991.370.60112.21713.724OpenBenchmarking.org

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No321612182430SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 1.00, N = 1225.9325.9424.78

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes321246810SE +/- 0.085, N = 15SE +/- 0.096, N = 4SE +/- 0.059, N = 126.8246.5356.950

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy321510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.0418.9518.951. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute3213K6K9K12K15KSE +/- 44.87, N = 3SE +/- 37.92, N = 3SE +/- 29.01, N = 313109.5513223.2013186.911. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect32111K22K33K44K55KSE +/- 107.39, N = 3SE +/- 90.21, N = 3SE +/- 736.78, N = 349012.3249578.6248607.601. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap321150300450600750SE +/- 0.22, N = 3SE +/- 0.36, N = 3SE +/- 0.24, N = 3711.76711.51711.861. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest321246810SE +/- 0.143, N = 12SE +/- 0.134, N = 12SE +/- 0.422, N = 156.3746.2816.9751. (CXX) g++ options: -O3 -O2 -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest321246810SE +/- 0.127, N = 12SE +/- 0.117, N = 15SE +/- 0.431, N = 156.4146.6686.7111. (CXX) g++ options: -O3 -O2 -lpthread -ldl

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23211020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 345.4445.4045.46MIN: 25.99 / MAX: 180.9MIN: 33.02 / MAX: 50.63MIN: 25.16 / MAX: 257.861. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time321714212835Min: 4.01 / Avg: 22.02 / Max: 31.07Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 3.55 / Avg: 22.02 / Max: 28.051. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23213691215SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.0313.0513.03MIN: 11.77 / MAX: 14.01MIN: 10.32 / MAX: 14.01MIN: 11.75 / MAX: 13.981. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

MinAvgMax372.076.782.4271.676.684.9172.376.885.1OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap32120406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3100.85100.65101.15MIN: 37.89 / MAX: 224.47MIN: 58.03 / MAX: 227.27MIN: 54.32 / MAX: 224.921. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time321510152025Min: 4.46 / Avg: 9.94 / Max: 17Min: 4.4 / Avg: 9.98 / Max: 16.99Min: 4.45 / Avg: 9.95 / Max: 18.411. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap321714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 329.5329.6029.58MIN: 15.15 / MAX: 181.72MIN: 14.76 / MAX: 161.92MIN: 14.88 / MAX: 175.931. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time3211326395265Min: 5.77 / Avg: 34.02 / Max: 65.99Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 6.24 / Avg: 33.97 / Max: 65.291. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108032150100150200250SE +/- 0.32, N = 3SE +/- 0.55, N = 3SE +/- 0.42, N = 3207.6208.2206.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440321306090120150SE +/- 0.12, N = 3SE +/- 0.55, N = 3SE +/- 0.37, N = 3131.0129.8130.41. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21603211530456075SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 367.067.267.11. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10803214080120160200SE +/- 0.95, N = 3SE +/- 0.23, N = 3SE +/- 0.53, N = 3200.5200.1200.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440321306090120150SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3125.9125.8125.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21603211428425670SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 362.762.662.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108032120406080100SE +/- 0.38, N = 3SE +/- 0.98, N = 3SE +/- 0.87, N = 397.897.698.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14403211428425670SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 360.859.760.31. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160321714212835SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 328.528.027.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL32120406080100SE +/- 1.45, N = 3SE +/- 1.01, N = 6SE +/- 1.21, N = 492.1690.1490.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte3210.46470.92941.39411.85882.3235SE +/- 0.07276, N = 3SE +/- 0.13431, N = 3SE +/- 0.08108, N = 31.893101.956842.065261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM3211020304050SE +/- 0.66, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 344.6842.5443.791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3210.11140.22280.33420.44560.557SE +/- 0.01039, N = 3SE +/- 0.03821, N = 3SE +/- 0.01258, N = 30.479800.495180.493981. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3210.91361.82722.74083.65444.568SE +/- 0.01163, N = 3SE +/- 0.00333, N = 3SE +/- 0.01197, N = 34.060384.048854.037901. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access3210.00610.01220.01830.02440.0305SE +/- 0.00050, N = 3SE +/- 0.00077, N = 3SE +/- 0.00052, N = 30.027130.026250.026501. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3210.07360.14720.22080.29440.368SE +/- 0.04588, N = 3SE +/- 0.00582, N = 3SE +/- 0.04635, N = 30.278470.327000.278791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth3210.76361.52722.29083.05443.818SE +/- 0.24447, N = 3SE +/- 0.10194, N = 3SE +/- 0.09620, N = 33.213552.829673.393651. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth3214K8K12K16K20KSE +/- 289.88, N = 3SE +/- 639.68, N = 3SE +/- 146.25, N = 318703.0319517.8619281.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3123.83123.81123.731. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.9011.9312.051. (CC) gcc options: -std=c99 -O3 -lm -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein3210.52161.04321.56482.08642.608SE +/- 0.031, N = 15SE +/- 0.052, N = 14SE +/- 0.064, N = 122.2582.2572.3181. (CXX) g++ options: -O3 -pthread -lm

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.16430.32860.49290.65720.8215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.730.710.731. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 30.740.740.771. (CXX) g++ options: -O3 -pthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed32112002400360048006000SE +/- 2.69, N = 3SE +/- 5.97, N = 3SE +/- 7.62, N = 35697.275687.385671.511. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed32114002800420056007000SE +/- 1.39, N = 3SE +/- 4.53, N = 3SE +/- 10.10, N = 36648.66643.66623.31. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed3211224364860SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 351.9252.4052.251. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed32114002800420056007000SE +/- 1.37, N = 3SE +/- 1.58, N = 3SE +/- 1.59, N = 36523.66527.56530.11. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed3211224364860SE +/- 0.85, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 350.1151.2851.281. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed32114002800420056007000SE +/- 1.35, N = 3SE +/- 5.32, N = 3SE +/- 3.54, N = 36536.66537.86532.71. (CC) gcc options: -O3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.04734, N = 3SE +/- 0.14569, N = 3SE +/- 0.03949, N = 39.187439.096589.14529MIN: 8.82MIN: 8.69MIN: 8.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU32148121620SE +/- 0.15, N = 15SE +/- 0.13, N = 3SE +/- 0.01, N = 315.4215.0714.25MIN: 14.53MIN: 14.72MIN: 14.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3211.17442.34883.52324.69765.872SE +/- 0.01621, N = 3SE +/- 0.00946, N = 3SE +/- 0.00525, N = 35.183745.178625.21940MIN: 4.84MIN: 4.94MIN: 4.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3211.19172.38343.57514.76685.9585SE +/- 0.09775, N = 15SE +/- 0.00978, N = 3SE +/- 0.00331, N = 35.296264.164334.25914MIN: 4.25MIN: 4.05MIN: 4.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321612182430SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 327.0626.9226.42MIN: 26.33MIN: 26.51MIN: 26.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.7810.7210.75MIN: 10.43MIN: 9.95MIN: 10.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU32148121620SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 316.6317.5316.54MIN: 16.28MIN: 16.55MIN: 16.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321612182430SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 325.7426.1426.07MIN: 25.45MIN: 25.95MIN: 25.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 312.1012.6012.39MIN: 10.95MIN: 11.17MIN: 11.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.1611.2911.33MIN: 10.46MIN: 10.69MIN: 10.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3212K4K6K8K10KSE +/- 15.35, N = 3SE +/- 20.81, N = 3SE +/- 17.63, N = 38446.758410.768495.75MIN: 8413.69MIN: 8366.44MIN: 8463.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU32110002000300040005000SE +/- 3.10, N = 3SE +/- 15.09, N = 3SE +/- 0.95, N = 34743.294693.244730.40MIN: 4730.58MIN: 4657.74MIN: 4721.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3212K4K6K8K10KSE +/- 25.14, N = 3SE +/- 7.59, N = 3SE +/- 6.10, N = 38475.518426.988480.05MIN: 8432.62MIN: 8406.64MIN: 8460.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU32110002000300040005000SE +/- 19.23, N = 3SE +/- 5.23, N = 3SE +/- 22.26, N = 34729.754714.604741.20MIN: 4687.53MIN: 4700.85MIN: 4710.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU321246810SE +/- 0.11002, N = 15SE +/- 0.09722, N = 3SE +/- 0.00679, N = 38.223586.943586.86508MIN: 7.36MIN: 6.67MIN: 6.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3212K4K6K8K10KSE +/- 20.61, N = 3SE +/- 13.64, N = 3SE +/- 11.29, N = 38482.738413.388489.11MIN: 8441.98MIN: 8383.33MIN: 8460.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU32110002000300040005000SE +/- 24.18, N = 3SE +/- 16.47, N = 3SE +/- 15.27, N = 34758.634713.764741.15MIN: 4700.29MIN: 4686.24MIN: 4705.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.06654, N = 3SE +/- 0.01547, N = 3SE +/- 0.01016, N = 36.084026.064186.16771MIN: 5.81MIN: 5.69MIN: 5.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown3211.06892.13783.20674.27565.3445SE +/- 0.0074, N = 3SE +/- 0.0037, N = 3SE +/- 0.0091, N = 34.72254.74234.7507MIN: 4.69 / MAX: 4.82MIN: 4.71 / MAX: 4.82MIN: 4.71 / MAX: 4.83

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown3211.21252.4253.63754.856.0625SE +/- 0.0582, N = 3SE +/- 0.0293, N = 3SE +/- 0.0366, N = 35.33925.34695.3888MIN: 5.19 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.28 / MAX: 5.51

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon3211.26312.52623.78935.05246.3155SE +/- 0.0036, N = 3SE +/- 0.0161, N = 3SE +/- 0.0040, N = 35.56965.61395.5806MIN: 5.5 / MAX: 5.65MIN: 5.54 / MAX: 5.72MIN: 5.53 / MAX: 5.66

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj3211.17232.34463.51694.68925.8615SE +/- 0.0051, N = 3SE +/- 0.0125, N = 3SE +/- 0.0107, N = 35.18605.21015.2035MIN: 5.15 / MAX: 5.26MIN: 5.17 / MAX: 5.28MIN: 5.16 / MAX: 5.26

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon321246810SE +/- 0.0287, N = 3SE +/- 0.0431, N = 3SE +/- 0.0417, N = 36.54176.62906.5662MIN: 6.45 / MAX: 6.74MIN: 6.51 / MAX: 6.81MIN: 6.46 / MAX: 6.79

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj3211.33352.6674.00055.3346.6675SE +/- 0.0272, N = 3SE +/- 0.0196, N = 3SE +/- 0.0176, N = 35.88495.92665.8902MIN: 5.81 / MAX: 5.99MIN: 5.87 / MAX: 6.03MIN: 5.83 / MAX: 6

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium3210.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.222.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.459.469.461. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.739.759.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast321246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.076.076.061. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3213691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0010.9610.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast321612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 324.6024.5624.571. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast3211020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.6143.6843.431. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 13210.05870.11740.17610.23480.2935SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 30.2590.2610.255

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 53210.18630.37260.55890.74520.9315SE +/- 0.006, N = 3SE +/- 0.013, N = 3SE +/- 0.005, N = 30.8140.8230.828

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 63210.24930.49860.74790.99721.2465SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.011, N = 31.0791.1081.074

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 103210.58391.16781.75172.33562.9195SE +/- 0.015, N = 3SE +/- 0.044, N = 3SE +/- 0.012, N = 32.5522.5952.514

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K321246810SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 36.566.596.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p321714212835SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 329.8930.1629.941. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second32130K60K90K120K150KSE +/- 104.15, N = 3SE +/- 1169.64, N = 3SE +/- 270.17, N = 3161328.40159234.48159958.261. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time3212M4M6M8M10MSE +/- 100608.19, N = 3SE +/- 53934.69, N = 3SE +/- 42720.90, N = 37918799798628779901121. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth3213M6M9M12M15MSE +/- 130255.47, N = 3SE +/- 117949.09, N = 3SE +/- 93407.88, N = 3123094861225497312249161

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile321306090120150SE +/- 0.37, N = 3SE +/- 0.58, N = 3SE +/- 0.10, N = 3125.68125.79124.83

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32160120180240300SE +/- 2.21, N = 3SE +/- 1.16, N = 3SE +/- 1.13, N = 3254.65250.26252.46

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark32170140210280350SE +/- 0.35, N = 3SE +/- 0.64, N = 3SE +/- 0.12, N = 3316.25313.69313.68

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis3211224364860SE +/- 0.37, N = 4SE +/- 0.86, N = 17SE +/- 1.23, N = 1651.2151.4749.731. (CC) gcc options: -O2 -std=c99

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 311.2211.2711.061. Nodejs v10.15.2

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark3210.09290.18580.27870.37160.4645SE +/- 0.006, N = 4SE +/- 0.003, N = 3SE +/- 0.005, N = 30.4080.4130.4101. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.707.707.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium3213691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough3211530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.6169.6169.571. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive321120240360480600SE +/- 0.25, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3568.09567.56568.111. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S3211632486480SE +/- 0.38, N = 3SE +/- 0.68, N = 3SE +/- 0.54, N = 373.2872.6273.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 03213691215SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 310.5010.7210.581. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 232120406080100SE +/- 0.22, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 374.4474.2574.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3321306090120150SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 3145.87145.79145.881. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing3212004006008001000SE +/- 12.55, N = 4SE +/- 2.85, N = 3SE +/- 1.28, N = 3972.13949.41945.481. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0003211632486480SE +/- 0.54, N = 3SE +/- 0.12, N = 3SE +/- 0.25, N = 370.6069.8669.411. (CC) gcc options: -O2 -ldl -lz -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP321600K1200K1800K2400K3000KSE +/- 2760.10, N = 3SE +/- 86194.23, N = 13SE +/- 37569.90, N = 31695087.331682927.162687331.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD321500K1000K1500K2000K2500KSE +/- 30741.46, N = 13SE +/- 8400.74, N = 3SE +/- 33704.56, N = 122104918.262140255.002112933.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH321400K800K1200K1600K2000KSE +/- 5741.58, N = 3SE +/- 20312.72, N = 15SE +/- 35412.98, N = 121655968.921622814.521611833.091. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET321500K1000K1500K2000K2500KSE +/- 48510.25, N = 12SE +/- 23853.74, N = 8SE +/- 21911.47, N = 132314175.992339268.782540423.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET321400K800K1200K1600K2000KSE +/- 25199.46, N = 3SE +/- 49673.53, N = 15SE +/- 29746.00, N = 31916548.161808870.971884493.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet321714212835SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 331.6432.2231.47MIN: 31.25 / MAX: 55.14MIN: 31.89 / MAX: 45.17MIN: 30.7 / MAX: 45.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 38.549.038.25MIN: 8.16 / MAX: 12.1MIN: 8.75 / MAX: 11.63MIN: 8.02 / MAX: 11.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 37.107.096.91MIN: 6.88 / MAX: 9.9MIN: 6.9 / MAX: 10.15MIN: 6.69 / MAX: 12.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v23213691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 39.489.459.47MIN: 9.35 / MAX: 13.34MIN: 9.32 / MAX: 12.68MIN: 9.36 / MAX: 12.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet321246810SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 36.847.046.83MIN: 6.59 / MAX: 10.33MIN: 6.85 / MAX: 9.1MIN: 6.61 / MAX: 7.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b03213691215SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 311.6011.8011.28MIN: 11.37 / MAX: 26.59MIN: 11.52 / MAX: 12.14MIN: 10.83 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3210.5941.1881.7822.3762.97SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.552.642.57MIN: 2.49 / MAX: 2.61MIN: 2.6 / MAX: 2.69MIN: 2.37 / MAX: 2.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 323.8524.2723.93MIN: 23.58 / MAX: 37.59MIN: 23.92 / MAX: 27MIN: 23.66 / MAX: 24.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1632120406080100SE +/- 0.12, N = 3SE +/- 0.82, N = 3SE +/- 0.57, N = 3108.96110.79109.27MIN: 108.21 / MAX: 115.52MIN: 109.24 / MAX: 127.6MIN: 108.04 / MAX: 121.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18321612182430SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 325.7826.1125.78MIN: 25.3 / MAX: 26.66MIN: 25.75 / MAX: 41.28MIN: 25.38 / MAX: 39.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet321510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 322.2522.4322.24MIN: 22 / MAX: 36.1MIN: 21.97 / MAX: 35.74MIN: 21.89 / MAX: 24.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet503211224364860SE +/- 0.47, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 351.4851.5650.69MIN: 50.22 / MAX: 66.07MIN: 50.54 / MAX: 64.38MIN: 50.36 / MAX: 63.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny3211020304050SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.72, N = 344.4544.9044.08MIN: 43.34 / MAX: 46.59MIN: 44.01 / MAX: 52.5MIN: 42.69 / MAX: 58.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd321816243240SE +/- 0.27, N = 3SE +/- 0.57, N = 3SE +/- 0.07, N = 333.4634.5733.22MIN: 32.94 / MAX: 43.71MIN: 33.72 / MAX: 36.89MIN: 32.95 / MAX: 34.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m32148121620SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 316.4516.5816.39MIN: 16.16 / MAX: 17.78MIN: 16.23 / MAX: 31.03MIN: 16.23 / MAX: 18.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet321714212835SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.06, N = 331.5331.7631.35MIN: 31.22 / MAX: 33.07MIN: 31.13 / MAX: 48.02MIN: 31.01 / MAX: 43.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 38.599.108.35MIN: 8.33 / MAX: 13.54MIN: 8.89 / MAX: 12.03MIN: 7.99 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 37.067.147.05MIN: 6.87 / MAX: 10.89MIN: 6.95 / MAX: 21.34MIN: 6.84 / MAX: 8.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v23213691215SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 39.499.459.52MIN: 9.39 / MAX: 11.92MIN: 9.35 / MAX: 12.21MIN: 9.42 / MAX: 11.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet321246810SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 36.816.966.80MIN: 6.53 / MAX: 7.12MIN: 6.68 / MAX: 21.89MIN: 6.62 / MAX: 10.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b03213691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 311.5611.7111.33MIN: 11.28 / MAX: 25.04MIN: 11.47 / MAX: 14.37MIN: 11.18 / MAX: 14.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface321246810SE +/- 0.03, N = 3SE +/- 5.13, N = 3SE +/- 0.02, N = 32.617.672.58MIN: 2.53 / MAX: 2.69MIN: 2.41 / MAX: 416.67MIN: 2.47 / MAX: 2.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 323.9723.5323.63MIN: 23.72 / MAX: 36.58MIN: 23.31 / MAX: 26.84MIN: 23.33 / MAX: 36.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1632120406080100SE +/- 0.21, N = 3SE +/- 0.32, N = 3SE +/- 0.26, N = 3109.11109.70109.45MIN: 108.06 / MAX: 125.36MIN: 108.72 / MAX: 136.67MIN: 108.54 / MAX: 123.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18321612182430SE +/- 0.32, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 325.7725.8125.79MIN: 25 / MAX: 26.95MIN: 25.13 / MAX: 40.98MIN: 24.99 / MAX: 28.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet321510152025SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 322.2822.4122.14MIN: 21.96 / MAX: 22.64MIN: 21.96 / MAX: 34.32MIN: 21.81 / MAX: 24.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet503211428425670SE +/- 0.55, N = 3SE +/- 8.74, N = 3SE +/- 0.54, N = 352.1060.7651.24MIN: 50.52 / MAX: 64.77MIN: 51.25 / MAX: 1056.17MIN: 50.5 / MAX: 66.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny3211020304050SE +/- 0.15, N = 3SE +/- 0.74, N = 3SE +/- 0.68, N = 344.1645.8046.17MIN: 43.39 / MAX: 57.44MIN: 43.83 / MAX: 58.33MIN: 44.33 / MAX: 611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd321816243240SE +/- 0.14, N = 3SE +/- 0.27, N = 3SE +/- 0.38, N = 334.1834.1833.77MIN: 33.24 / MAX: 46.06MIN: 32.78 / MAX: 44.2MIN: 32.69 / MAX: 43.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m32148121620SE +/- 0.05, N = 3SE +/- 0.68, N = 3SE +/- 0.10, N = 316.5917.4116.57MIN: 16.38 / MAX: 20.04MIN: 16.52 / MAX: 269.38MIN: 16.28 / MAX: 16.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom3210.16970.33940.50910.67880.8485SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7520.7540.750

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar3210.38050.7611.14151.5221.9025SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 31.6911.6871.684

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite321150K300K450K600K750KSE +/- 709.37, N = 3SE +/- 2148.66, N = 3SE +/- 448.34, N = 3709282709965710293

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis3210.5241.0481.5722.0962.62SE +/- 0.024, N = 3SE +/- 0.019, N = 3SE +/- 0.012, N = 32.3292.2932.280MIN: 2.21 / MAX: 3.04MIN: 2.17 / MAX: 2.99MIN: 2.17 / MAX: 2.91

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric32110K20K30K40K50K4859948237481751. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3210.29250.5850.87751.171.4625SE +/- 0.03, N = 12SE +/- 0.05, N = 12SE +/- 0.05, N = 121.31.11.31. (CC) gcc options: -fopenmp -O3 -lm

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile3211632486480SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 370.6070.7370.66

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE3213691215SE +/- 0.06, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 512.2212.2412.191. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

WavPack Audio Encoding

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 513.7213.7313.791. (CXX) g++ options: -rdynamic

148 Results Shown

Waifu2x-NCNN Vulkan:
  2x - 3 - No
  2x - 3 - Yes
Libplacebo:
  deband_heavy
  polar_nocompute
  hdr_peakdetect
  av1_grain_lap
Betsy GPU Compressor:
  ETC1 - Highest
  ETC2 RGB - Highest
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
yquake2:
  OpenGL 1.x - 1920 x 1080
  OpenGL 1.x - 2560 x 1440
  OpenGL 1.x - 3840 x 2160
  OpenGL 3.x - 1920 x 1080
  OpenGL 3.x - 2560 x 1440
  OpenGL 3.x - 3840 x 2160
  Software CPU - 1920 x 1080
  Software CPU - 2560 x 1440
  Software CPU - 3840 x 2160
HPC Challenge:
  G-HPL
  G-Ffte
  EP-DGEMM
  G-Ptrans
  EP-STREAM Triad
  G-Rand Access
  Rand Ring Latency
  Rand Ring Bandwidth
  Max Ping Pong Bandwidth
Timed HMMer Search
Timed MAFFT Alignment
LAMMPS Molecular Dynamics Simulator
simdjson:
  Kostya
  LargeRand
  PartialTweets
  DistinctUserID
LZ4 Compression:
  1 - Compression Speed
  1 - Decompression Speed
  3 - Compression Speed
  3 - Decompression Speed
  9 - Compression Speed
  9 - Decompression Speed
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Embree:
  Pathtracer - Crown
  Pathtracer ISPC - Crown
  Pathtracer - Asian Dragon
  Pathtracer - Asian Dragon Obj
  Pathtracer ISPC - Asian Dragon
  Pathtracer ISPC - Asian Dragon Obj
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Ultra Fast
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Ultra Fast
rav1e:
  1
  5
  6
  10
x265:
  Bosphorus 4K
  Bosphorus 1080p
Coremark
Stockfish
asmFish
Timed FFmpeg Compilation
Build2
Numpy Benchmark
eSpeak-NG Speech Engine
Node.js V8 Web Tooling Benchmark
GROMACS
ASTC Encoder:
  Fast
  Medium
  Thorough
  Exhaustive
Basis Universal:
  ETC1S
  UASTC Level 0
  UASTC Level 2
  UASTC Level 3
  UASTC Level 2 + RDO Post-Processing
SQLite Speedtest
Redis:
  LPOP
  SADD
  LPUSH
  GET
  SET
NCNN:
  CPU - mobilenet
  CPU-v2-v2 - mobilenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU - shufflenet-v2
  CPU - mnasnet
  CPU - efficientnet-b0
  CPU - blazeface
  CPU - googlenet
  CPU - vgg16
  CPU - resnet18
  CPU - alexnet
  CPU - resnet50
  CPU - yolov4-tiny
  CPU - squeezenet_ssd
  CPU - regnety_400m
  Vulkan GPU - mobilenet
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU - shufflenet-v2
  Vulkan GPU - mnasnet
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - blazeface
  Vulkan GPU - googlenet
  Vulkan GPU - vgg16
  Vulkan GPU - resnet18
  Vulkan GPU - alexnet
  Vulkan GPU - resnet50
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - regnety_400m
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
PHPBench
Sunflow Rendering System
BRL-CAD
CLOMP
Timed Eigen Compilation
Monkey Audio Encoding
WavPack Audio Encoding