Intel Xeon E5-1680 v3 testing with a ASUS X99-A (3902 BIOS) and eVGA NVIDIA NVE7 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012271-HA-XEONE516824 Xeon E5 1680 v3 Xmas - Phoronix Test Suite Xeon E5 1680 v3 Xmas Intel Xeon E5-1680 v3 testing with a ASUS X99-A (3902 BIOS) and eVGA NVIDIA NVE7 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012271-HA-XEONE516824&sro .
Xeon E5 1680 v3 Xmas Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 Intel Xeon E5-1680 v3 @ 3.80GHz (8 Cores / 16 Threads) ASUS X99-A (3902 BIOS) Intel Xeon E7 v3/Xeon 16GB PNY CS900 240GB eVGA NVIDIA NVE7 1GB Realtek ALC1150 G237HL Intel I218-V Ubuntu 20.04 5.4.0-47-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.3 Mesa 20.0.8 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x43 Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Xeon E5 1680 v3 Xmas compilebench: Compile compilebench: Initial Create compilebench: Read Compiled Tree yquake2: Software CPU - 1920 x 1080 sockperf: Throughput sockperf: Latency Ping Pong sockperf: Latency Under Load lczero: BLAS lczero: Eigen clomp: Static OMP Speedup dolfyn: Computational Fluid Dynamics ffte: N=256, 3D Complex FFT Routine hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA lammps: Rhodopsin Protein byte: Dhrystone 2 compress-lz4: 1 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed crafty: Elapsed Time graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 1080p - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Very Fast kvazaar: Bosphorus 1080p - Ultra Fast rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 x265: Bosphorus 4K x265: Bosphorus 1080p coremark: CoreMark Size 666 - Iterations Per Second stockfish: Total Time asmfish: 1024 Hash Memory, 26 Depth build-ffmpeg: Time To Compile build2: Time To Compile numpy: build-eigen: Time To Compile encode-ape: WAV To APE encode-ogg: WAV To Ogg encode-opus: WAV To Opus Encode espeak: Text-To-Speech Synthesis rnnoise: node-web-tooling: couchdb: 100 - 1000 - 24 keydb: gromacs: Water Benchmark astcenc: Fast astcenc: Medium astcenc: Thorough astcenc: Exhaustive basis: ETC1S basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 sqlite-speedtest: Timed Time - Size 1,000 redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET caffe: AlexNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: GoogleNet - CPU - 100 caffe: GoogleNet - CPU - 200 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU indigobench: CPU - Bedroom indigobench: CPU - Supercar hint: FLOAT ai-benchmark: Device Inference Score ai-benchmark: Device Training Score ai-benchmark: Device AI Score phpbench: PHP Benchmark Suite encode-wavpack: WAV To WavPack mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression unpack-firefox: firefox-84.0.source.tar.xz brl-cad: VGR Performance Metric 1 2 3 760.84 357.55 2243.77 75.4 265543 4.499 25.468 1105 1046 9.9 23.207 30109.610789668 162.575 11.358 5.055 37390438.7 6098.31 7499.0 42.97 7267.3 42.06 7283.6 7002968 262 598 90 101 648 175 565 4.53012 6.87752 3.13341 2.55204 12.5519 6.33479 9.17727 10.8160 7.87695 6.20319 4531.11 2474.88 4529.12 2472.67 3.01373 4533.01 2480.10 3.50993 8.6804 9.6807 10.0146 9.4002 12.0319 10.8120 3.62 15.65 10.28 18.56 36.78 69.70 0.272 0.815 1.088 2.415 8.85 35.51 270945.863096 14124404 21707918 81.843 173.720 285.15 97.316 14.157 24.755 9.678 35.721 25.711 9.77 134.602 470540.93 0.754 7.00 6.59 41.52 334.10 65.822 10.238 47.383 88.874 79.022 2000353.86 1639601.21 1289264.92 1834428.00 1481853.31 59307 118086 150994 301960 20.50 6.19 5.40 6.88 5.45 8.17 2.55 15.39 47.95 14.02 11.16 27.37 29.91 24.81 20.68 1.88 2114.40 1.88 2113.87 1.3 3070.21 1.3 3074.79 5069.47 0.77 5077.11 0.77 1.297 2.966 348188152.87162 1019 1060 2079 626480 15.977 67.17 61.46 22.30 3.18 23.160 88817 773.58 339.32 2297.29 73.6 266097 4.487 25.961 1104 1050 10.2 23.397 30542.463681196 163.454 11.317 5.026 37461981.3 6163.86 7494.0 42.99 7256.9 42.11 7259.4 6926721 264 612 90 101 651 176 575 4.58350 6.78791 3.15874 2.53457 12.4980 6.40014 9.23463 10.7546 7.90354 6.21295 4525.42 2468.97 4523.86 2470.37 2.98814 4523.92 2469.55 3.49680 8.6170 9.7353 10.0540 9.4191 12.1373 10.7986 3.61 15.67 10.28 18.52 36.78 69.21 0.272 0.816 1.093 2.425 8.75 36.22 273924.335703 14141342 21577603 81.860 172.592 289.20 97.179 14.170 24.749 9.643 37.479 25.669 9.71 134.085 475933.28 0.753 7.11 6.58 41.51 334.17 66.204 10.202 47.418 88.986 79.662 1685556.72 1625787.13 1306966.33 1750688.54 1456546.48 59205 118507 150853 301686 20.23 6.23 5.42 6.95 5.63 8.34 2.55 15.58 48.14 14.07 11.19 27.46 28.20 24.72 20.87 1.88 2115.29 1.88 2115.31 1.27 3078.70 1.3 3069.32 5044.33 0.77 5066.12 0.77 1.291 2.963 349172823.11238 1022 1060 2082 626218 15.980 67.93 63.56 22.33 3.19 22.837 88995 653.68 376.15 2286.43 74.8 265622 4.510 24.475 1091 1038 10.2 23.323 30708.448763254 163.313 11.256 5.072 37177008.1 6152.72 7503.5 43.04 7267.9 41.60 7278.7 7021270 264 613 90 101 656 176 578 4.55778 6.81139 3.14475 2.52494 12.4781 6.37270 9.19921 10.8074 7.66474 6.21070 4533.64 2468.80 4536.47 2468.25 2.97447 4526.33 2466.09 3.46034 8.6675 9.7629 10.0477 9.3499 12.0984 10.7868 3.55 15.43 10.10 18.29 36.02 68.68 0.272 0.818 1.086 2.419 8.87 35.60 273946.970904 13900575 21768662 81.743 178.600 283.88 97.807 14.139 24.777 9.616 38.497 25.521 9.50 135.298 475766.15 0.755 7.04 6.59 41.50 334.21 66.223 10.204 47.506 88.933 79.316 1279010.17 1635637.53 1243644.16 1852979.38 1464928.24 59221 118149 151020 301760 20.44 6.32 5.45 6.97 5.60 8.67 2.61 15.47 48.18 14.00 11.19 27.47 28.92 24.73 20.95 1.88 2116.99 1.88 2116.84 1.30 3073.95 1.3 3073.87 5066.61 0.77 5060.17 0.77 1.298 2.946 348871675.89817 1022 1065 2087 624167 16.012 71.22 67.82 22.33 3.19 23.217 88885 OpenBenchmarking.org
Compile Bench Test: Compile OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile 1 2 3 170 340 510 680 850 SE +/- 8.09, N = 3 SE +/- 5.81, N = 3 SE +/- 52.39, N = 12 760.84 773.58 653.68
Compile Bench Test: Initial Create OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Initial Create 1 2 3 80 160 240 320 400 SE +/- 5.18, N = 3 SE +/- 25.84, N = 3 SE +/- 7.94, N = 3 357.55 339.32 376.15
Compile Bench Test: Read Compiled Tree OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Read Compiled Tree 1 2 3 500 1000 1500 2000 2500 SE +/- 25.31, N = 3 SE +/- 12.51, N = 3 SE +/- 10.79, N = 3 2243.77 2297.29 2286.43
yquake2 Renderer: Software CPU - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 1920 x 1080 1 2 3 20 40 60 80 100 SE +/- 0.98, N = 5 SE +/- 0.35, N = 3 SE +/- 0.26, N = 3 75.4 73.6 74.8 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.4 Test: Throughput 1 2 3 60K 120K 180K 240K 300K SE +/- 2778.72, N = 25 SE +/- 2697.01, N = 25 SE +/- 2962.76, N = 5 265543 266097 265622 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong 1 2 3 1.0148 2.0296 3.0444 4.0592 5.074 SE +/- 0.037, N = 25 SE +/- 0.032, N = 5 SE +/- 0.055, N = 6 4.499 4.487 4.510 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load 1 2 3 6 12 18 24 30 SE +/- 0.84, N = 25 SE +/- 1.08, N = 20 SE +/- 1.22, N = 25 25.47 25.96 24.48 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 1 2 3 200 400 600 800 1000 SE +/- 8.29, N = 3 SE +/- 4.93, N = 3 SE +/- 8.14, N = 3 1105 1104 1091 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 200 400 600 800 1000 SE +/- 2.89, N = 3 SE +/- 5.03, N = 3 SE +/- 11.79, N = 3 1046 1050 1038 1. (CXX) g++ options: -flto -pthread
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 1 2 3 3 6 9 12 15 SE +/- 0.14, N = 4 SE +/- 0.15, N = 4 SE +/- 0.09, N = 3 9.9 10.2 10.2 1. (CC) gcc options: -fopenmp -O3 -lm
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 23.21 23.40 23.32
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 7K 14K 21K 28K 35K SE +/- 340.92, N = 3 SE +/- 209.55, N = 3 SE +/- 89.40, N = 3 30109.61 30542.46 30708.45 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 40 80 120 160 200 SE +/- 0.21, N = 3 SE +/- 0.36, N = 3 SE +/- 0.13, N = 3 162.58 163.45 163.31 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 11.36 11.32 11.26 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 1.1412 2.2824 3.4236 4.5648 5.706 SE +/- 0.005, N = 3 SE +/- 0.072, N = 3 SE +/- 0.007, N = 3 5.055 5.026 5.072 1. (CXX) g++ options: -O3 -pthread -lm
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 8M 16M 24M 32M 40M SE +/- 165452.15, N = 3 SE +/- 46692.87, N = 3 SE +/- 122181.12, N = 3 37390438.7 37461981.3 37177008.1
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed 1 2 3 1300 2600 3900 5200 6500 SE +/- 1.20, N = 3 SE +/- 5.37, N = 3 SE +/- 8.03, N = 3 6098.31 6163.86 6152.72 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 8.20, N = 3 SE +/- 10.99, N = 3 SE +/- 10.24, N = 3 7499.0 7494.0 7503.5 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed 1 2 3 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 42.97 42.99 43.04 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 4.72, N = 3 SE +/- 6.48, N = 3 SE +/- 4.32, N = 3 7267.3 7256.9 7267.9 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed 1 2 3 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.56, N = 3 42.06 42.11 41.60 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed 1 2 3 1600 3200 4800 6400 8000 SE +/- 5.97, N = 3 SE +/- 4.05, N = 3 SE +/- 14.52, N = 3 7283.6 7259.4 7278.7 1. (CC) gcc options: -O3
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time 1 2 3 1.5M 3M 4.5M 6M 7.5M SE +/- 8272.92, N = 3 SE +/- 19512.27, N = 3 SE +/- 33788.32, N = 3 7002968 6926721 7021270 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 1 2 3 60 120 180 240 300 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 262 264 264 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 1 2 3 130 260 390 520 650 SE +/- 0.33, N = 3 598 612 613 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 1 2 3 20 40 60 80 100 90 90 90 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 1 2 3 20 40 60 80 100 101 101 101 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 1 2 3 140 280 420 560 700 SE +/- 1.15, N = 3 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 648 651 656 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 1 2 3 40 80 120 160 200 SE +/- 0.33, N = 3 175 176 176 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space 1 2 3 120 240 360 480 600 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 565 575 578 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 1.0313 2.0626 3.0939 4.1252 5.1565 SE +/- 0.01478, N = 3 SE +/- 0.03023, N = 3 SE +/- 0.02200, N = 3 4.53012 4.58350 4.55778 MIN: 4.43 MIN: 4.48 MIN: 4.45 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01639, N = 3 SE +/- 0.02409, N = 3 SE +/- 0.03424, N = 3 6.87752 6.78791 6.81139 MIN: 6.77 MIN: 6.64 MIN: 6.68 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.7107 1.4214 2.1321 2.8428 3.5535 SE +/- 0.01227, N = 3 SE +/- 0.01228, N = 3 SE +/- 0.01108, N = 3 3.13341 3.15874 3.14475 MIN: 3.07 MIN: 3.1 MIN: 3.07 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.5742 1.1484 1.7226 2.2968 2.871 SE +/- 0.01421, N = 3 SE +/- 0.01518, N = 3 SE +/- 0.01253, N = 3 2.55204 2.53457 2.52494 MIN: 2.5 MIN: 2.49 MIN: 2.48 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 12.55 12.50 12.48 MIN: 12.41 MIN: 12.37 MIN: 12.34 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00863, N = 3 SE +/- 0.01093, N = 3 SE +/- 0.03639, N = 3 6.33479 6.40014 6.37270 MIN: 6.26 MIN: 6.3 MIN: 6.26 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.01399, N = 3 SE +/- 0.02794, N = 3 SE +/- 0.00829, N = 3 9.17727 9.23463 9.19921 MIN: 9.08 MIN: 9.09 MIN: 9.09 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 10.82 10.75 10.81 MIN: 10.52 MIN: 10.48 MIN: 10.47 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.10075, N = 5 SE +/- 0.12917, N = 3 SE +/- 0.01396, N = 3 7.87695 7.90354 7.66474 MIN: 7.57 MIN: 7.65 MIN: 7.58 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01219, N = 3 SE +/- 0.02072, N = 3 SE +/- 0.01601, N = 3 6.20319 6.21295 6.21070 MIN: 6.12 MIN: 6.11 MIN: 6.1 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 1.69, N = 3 SE +/- 1.21, N = 3 SE +/- 2.17, N = 3 4531.11 4525.42 4533.64 MIN: 4524.03 MIN: 4516.35 MIN: 4525.25 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 2.75, N = 3 SE +/- 3.28, N = 3 SE +/- 1.82, N = 3 2474.88 2468.97 2468.80 MIN: 2467.18 MIN: 2460.79 MIN: 2463.37 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 0.35, N = 3 SE +/- 3.64, N = 3 SE +/- 7.29, N = 3 4529.12 4523.86 4536.47 MIN: 4522.47 MIN: 4513.41 MIN: 4520.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.91, N = 3 SE +/- 0.84, N = 3 SE +/- 1.99, N = 3 2472.67 2470.37 2468.25 MIN: 2467.6 MIN: 2466.62 MIN: 2462.29 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 0.6781 1.3562 2.0343 2.7124 3.3905 SE +/- 0.00785, N = 3 SE +/- 0.00758, N = 3 SE +/- 0.00748, N = 3 3.01373 2.98814 2.97447 MIN: 2.93 MIN: 2.91 MIN: 2.9 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 1000 2000 3000 4000 5000 SE +/- 2.65, N = 3 SE +/- 1.37, N = 3 SE +/- 3.38, N = 3 4533.01 4523.92 4526.33 MIN: 4522.75 MIN: 4517.94 MIN: 4516.82 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 10.28, N = 3 SE +/- 2.48, N = 3 SE +/- 0.45, N = 3 2480.10 2469.55 2466.09 MIN: 2465 MIN: 2462.7 MIN: 2461.63 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.7897 1.5794 2.3691 3.1588 3.9485 SE +/- 0.00979, N = 3 SE +/- 0.02742, N = 3 SE +/- 0.01650, N = 3 3.50993 3.49680 3.46034 MIN: 3.44 MIN: 3.33 MIN: 3.32 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Crown 1 2 3 2 4 6 8 10 SE +/- 0.0232, N = 3 SE +/- 0.0079, N = 3 SE +/- 0.0281, N = 3 8.6804 8.6170 8.6675 MIN: 8.59 / MAX: 8.82 MIN: 8.56 / MAX: 8.76 MIN: 8.57 / MAX: 8.83
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown 1 2 3 3 6 9 12 15 SE +/- 0.0256, N = 3 SE +/- 0.0397, N = 3 SE +/- 0.0193, N = 3 9.6807 9.7353 9.7629 MIN: 9.6 / MAX: 9.86 MIN: 9.62 / MAX: 9.95 MIN: 9.68 / MAX: 9.94
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 10.01 10.05 10.05 MIN: 9.97 / MAX: 10.12 MIN: 9.94 / MAX: 10.21 MIN: 9.86 / MAX: 10.24
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj 1 2 3 3 6 9 12 15 SE +/- 0.0214, N = 3 SE +/- 0.0255, N = 3 SE +/- 0.0092, N = 3 9.4002 9.4191 9.3499 MIN: 9.33 / MAX: 9.52 MIN: 9.36 / MAX: 9.54 MIN: 9.3 / MAX: 9.45
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 12.03 12.14 12.10 MIN: 11.97 / MAX: 12.2 MIN: 11.91 / MAX: 12.45 MIN: 11.87 / MAX: 12.37
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.81 10.80 10.79 MIN: 10.7 / MAX: 11.01 MIN: 10.71 / MAX: 11.01 MIN: 10.7 / MAX: 10.96
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium 1 2 3 0.8145 1.629 2.4435 3.258 4.0725 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.62 3.61 3.55 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 15.65 15.67 15.43 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.28 10.28 10.10 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 18.56 18.52 18.29 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast 1 2 3 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 36.78 36.78 36.02 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 1 2 3 16 32 48 64 80 SE +/- 0.11, N = 3 SE +/- 0.21, N = 3 SE +/- 0.22, N = 3 69.70 69.21 68.68 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 1 1 2 3 0.0612 0.1224 0.1836 0.2448 0.306 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.272 0.272 0.272
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 1 2 3 0.1841 0.3682 0.5523 0.7364 0.9205 SE +/- 0.002, N = 3 SE +/- 0.005, N = 3 SE +/- 0.005, N = 3 0.815 0.816 0.818
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 1 2 3 0.2459 0.4918 0.7377 0.9836 1.2295 SE +/- 0.001, N = 3 SE +/- 0.006, N = 3 SE +/- 0.003, N = 3 1.088 1.093 1.086
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 1 2 3 0.5456 1.0912 1.6368 2.1824 2.728 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.007, N = 3 2.415 2.425 2.419
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 1 2 3 2 4 6 8 10 SE +/- 0.09, N = 8 SE +/- 0.12, N = 4 SE +/- 0.08, N = 3 8.85 8.75 8.87 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p 1 2 3 8 16 24 32 40 SE +/- 0.53, N = 3 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 35.51 36.22 35.60 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 3 60K 120K 180K 240K 300K SE +/- 1053.72, N = 3 SE +/- 590.16, N = 3 SE +/- 486.71, N = 3 270945.86 273924.34 273946.97 1. (CC) gcc options: -O2 -lrt" -lrt
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 1 2 3 3M 6M 9M 12M 15M SE +/- 166711.74, N = 3 SE +/- 71363.03, N = 3 SE +/- 194479.46, N = 4 14124404 14141342 13900575 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 1 2 3 5M 10M 15M 20M 25M SE +/- 260870.39, N = 3 SE +/- 222609.71, N = 3 SE +/- 288716.85, N = 3 21707918 21577603 21768662
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 3 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 81.84 81.86 81.74
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 2.07, N = 5 SE +/- 2.37, N = 4 SE +/- 1.61, N = 3 173.72 172.59 178.60
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark 1 2 3 60 120 180 240 300 SE +/- 0.88, N = 3 SE +/- 0.38, N = 3 SE +/- 0.75, N = 3 285.15 289.20 283.88
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile 1 2 3 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 97.32 97.18 97.81
Monkey Audio Encoding WAV To APE OpenBenchmarking.org Seconds, Fewer Is Better Monkey Audio Encoding 3.99.6 WAV To APE 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 5 SE +/- 0.06, N = 5 SE +/- 0.05, N = 5 14.16 14.17 14.14 1. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
Ogg Audio Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Audio Encoding 1.3.4 WAV To Ogg 1 2 3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 24.76 24.75 24.78 1. (CC) gcc options: -O2 -ffast-math -fsigned-char
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode 1 2 3 3 6 9 12 15 SE +/- 0.024, N = 5 SE +/- 0.033, N = 5 SE +/- 0.008, N = 5 9.678 9.643 9.616 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 2 3 9 18 27 36 45 SE +/- 0.32, N = 4 SE +/- 0.44, N = 20 SE +/- 0.55, N = 20 35.72 37.48 38.50 1. (CC) gcc options: -O2 -std=c99
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 25.71 25.67 25.52 1. (CC) gcc options: -O2 -pedantic -fvisibility=hidden
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 9.77 9.71 9.50 1. Nodejs
v10.19.0
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 30 60 90 120 150 SE +/- 0.34, N = 3 SE +/- 0.62, N = 3 SE +/- 2.21, N = 3 134.60 134.09 135.30 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 100K 200K 300K 400K 500K SE +/- 1362.98, N = 3 SE +/- 111.39, N = 3 SE +/- 531.93, N = 3 470540.93 475933.28 475766.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.1699 0.3398 0.5097 0.6796 0.8495 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.754 0.753 0.755 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
ASTC Encoder Preset: Fast OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Fast 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 7.00 7.11 7.04 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Medium 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.59 6.58 6.59 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Thorough 1 2 3 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 41.52 41.51 41.50 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.0 Preset: Exhaustive 1 2 3 70 140 210 280 350 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 334.10 334.17 334.21 1. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: ETC1S 1 2 3 15 30 45 60 75 SE +/- 0.44, N = 3 SE +/- 0.19, N = 3 SE +/- 0.05, N = 3 65.82 66.20 66.22 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 0 1 2 3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.24 10.20 10.20 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 2 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 47.38 47.42 47.51 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.12 Settings: UASTC Level 3 1 2 3 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 88.87 88.99 88.93 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 1 2 3 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.41, N = 3 SE +/- 0.06, N = 3 79.02 79.66 79.32 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 33266.07, N = 15 SE +/- 113389.94, N = 12 SE +/- 28277.44, N = 15 2000353.86 1685556.72 1279010.17 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 11131.75, N = 3 SE +/- 31572.89, N = 15 SE +/- 24311.83, N = 15 1639601.21 1625787.13 1635637.53 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 16275.71, N = 5 SE +/- 10203.98, N = 15 SE +/- 34112.41, N = 12 1289264.92 1306966.33 1243644.16 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 22966.71, N = 3 SE +/- 26369.12, N = 15 SE +/- 28672.73, N = 3 1834428.00 1750688.54 1852979.38 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 34189.05, N = 15 SE +/- 26544.26, N = 15 SE +/- 32485.29, N = 15 1481853.31 1456546.48 1464928.24 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 13K 26K 39K 52K 65K SE +/- 170.12, N = 3 SE +/- 51.35, N = 3 SE +/- 27.33, N = 3 59307 59205 59221 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 30K 60K 90K 120K 150K SE +/- 18.73, N = 3 SE +/- 244.34, N = 3 SE +/- 86.87, N = 3 118086 118507 118149 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 103.00, N = 3 SE +/- 78.70, N = 3 SE +/- 108.38, N = 3 150994 150853 151020 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 60K 120K 180K 240K 300K SE +/- 189.29, N = 3 SE +/- 155.18, N = 3 SE +/- 180.52, N = 3 301960 301686 301760 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mobilenet 1 2 3 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.20, N = 3 20.50 20.23 20.44 MIN: 20.12 / MAX: 38.51 MIN: 20.1 / MAX: 22.33 MIN: 20.12 / MAX: 27.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 6.19 6.23 6.32 MIN: 6 / MAX: 15.89 MIN: 6.01 / MAX: 16.48 MIN: 6.04 / MAX: 17.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 3 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 5.40 5.42 5.45 MIN: 5.25 / MAX: 17.21 MIN: 5.27 / MAX: 16.73 MIN: 5.3 / MAX: 16.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: shufflenet-v2 1 2 3 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 6.88 6.95 6.97 MIN: 6.77 / MAX: 17.73 MIN: 6.77 / MAX: 18.58 MIN: 6.8 / MAX: 18.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: mnasnet 1 2 3 1.2668 2.5336 3.8004 5.0672 6.334 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 5.45 5.63 5.60 MIN: 5.28 / MAX: 16.44 MIN: 5.37 / MAX: 17.72 MIN: 5.39 / MAX: 16.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: efficientnet-b0 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.35, N = 3 8.17 8.34 8.67 MIN: 8.07 / MAX: 8.37 MIN: 8.06 / MAX: 16.21 MIN: 8.06 / MAX: 89.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: blazeface 1 2 3 0.5873 1.1746 1.7619 2.3492 2.9365 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 2.55 2.55 2.61 MIN: 2.48 / MAX: 2.62 MIN: 2.51 / MAX: 2.59 MIN: 2.55 / MAX: 3.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: googlenet 1 2 3 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 15.39 15.58 15.47 MIN: 15.11 / MAX: 19.35 MIN: 15.19 / MAX: 16.78 MIN: 15.2 / MAX: 16.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: vgg16 1 2 3 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 47.95 48.14 48.18 MIN: 47.64 / MAX: 50.16 MIN: 47.78 / MAX: 64.77 MIN: 47.69 / MAX: 57.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet18 1 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.02 14.07 14.00 MIN: 13.88 / MAX: 14.41 MIN: 13.87 / MAX: 19.55 MIN: 13.86 / MAX: 15.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: alexnet 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 11.16 11.19 11.19 MIN: 11.09 / MAX: 11.53 MIN: 11.11 / MAX: 11.28 MIN: 11.11 / MAX: 20.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: resnet50 1 2 3 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 27.37 27.46 27.47 MIN: 26.85 / MAX: 35.85 MIN: 27.01 / MAX: 44.87 MIN: 27.05 / MAX: 45.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: yolov4-tiny 1 2 3 7 14 21 28 35 SE +/- 0.25, N = 3 SE +/- 0.07, N = 3 SE +/- 0.76, N = 3 29.91 28.20 28.92 MIN: 27.91 / MAX: 34.95 MIN: 27.88 / MAX: 28.62 MIN: 27.88 / MAX: 35.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: squeezenet_ssd 1 2 3 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 24.81 24.72 24.73 MIN: 24.62 / MAX: 48.23 MIN: 24.64 / MAX: 26.71 MIN: 24.64 / MAX: 27.27 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: CPU - Model: regnety_400m 1 2 3 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 20.68 20.87 20.95 MIN: 20.41 / MAX: 37.64 MIN: 20.52 / MAX: 52.3 MIN: 20.63 / MAX: 34.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 1 2 3 0.423 0.846 1.269 1.692 2.115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.88 1.88 1.88
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.54, N = 3 SE +/- 0.91, N = 3 SE +/- 1.10, N = 3 2114.40 2115.29 2116.99
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 1 2 3 0.423 0.846 1.269 1.692 2.115 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.88 1.88 1.88
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 0.83, N = 3 SE +/- 0.51, N = 3 SE +/- 1.55, N = 3 2113.87 2115.31 2116.84
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 1 2 3 0.2925 0.585 0.8775 1.17 1.4625 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.30 1.27 1.30
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 1 2 3 700 1400 2100 2800 3500 SE +/- 2.92, N = 3 SE +/- 2.89, N = 3 SE +/- 0.75, N = 3 3070.21 3078.70 3073.95
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 1 2 3 0.2925 0.585 0.8775 1.17 1.4625 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.3 1.3 1.3
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 1 2 3 700 1400 2100 2800 3500 SE +/- 1.72, N = 3 SE +/- 1.16, N = 3 SE +/- 5.64, N = 3 3074.79 3069.32 3073.87
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 13.21, N = 3 SE +/- 8.75, N = 3 SE +/- 7.38, N = 3 5069.47 5044.33 5066.61
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 1 2 3 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.77 0.77 0.77
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 9.42, N = 3 SE +/- 9.51, N = 3 SE +/- 7.20, N = 3 5077.11 5066.12 5060.17
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 1 2 3 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.77 0.77 0.77
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom 1 2 3 0.2921 0.5842 0.8763 1.1684 1.4605 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 1.297 1.291 1.298
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar 1 2 3 0.6674 1.3348 2.0022 2.6696 3.337 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 2.966 2.963 2.946
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 70M 140M 210M 280M 350M SE +/- 671953.04, N = 3 SE +/- 348495.09, N = 3 SE +/- 110846.40, N = 3 348188152.87 349172823.11 348871675.90 1. (CC) gcc options: -O3 -march=native -lm
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score 1 2 3 200 400 600 800 1000 1019 1022 1022
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score 1 2 3 200 400 600 800 1000 1060 1060 1065
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score 1 2 3 400 800 1200 1600 2000 2079 2082 2087
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 130K 260K 390K 520K 650K SE +/- 676.23, N = 3 SE +/- 872.14, N = 3 SE +/- 1083.96, N = 3 626480 626218 624167
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 5 SE +/- 0.05, N = 5 SE +/- 0.05, N = 5 15.98 15.98 16.01 1. (CXX) g++ options: -rdynamic
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 16 32 48 64 80 SE +/- 0.62, N = 15 SE +/- 0.78, N = 6 SE +/- 1.19, N = 3 67.17 67.93 71.22
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 15 30 45 60 75 SE +/- 0.21, N = 3 SE +/- 1.18, N = 12 SE +/- 2.69, N = 12 61.46 63.56 67.82
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 22.30 22.33 22.33
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.7178 1.4356 2.1534 2.8712 3.589 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.18 3.19 3.19
Unpacking Firefox Extracting: firefox-84.0.source.tar.xz OpenBenchmarking.org Seconds, Fewer Is Better Unpacking Firefox 84.0 Extracting: firefox-84.0.source.tar.xz 1 2 3 6 12 18 24 30 SE +/- 0.31, N = 4 SE +/- 0.28, N = 6 SE +/- 0.33, N = 4 23.16 22.84 23.22
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 3 20K 40K 60K 80K 100K 88817 88995 88885 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
Phoronix Test Suite v10.8.5