1245 2021 Tests for a future article. Intel Xeon E3-1245 v5 testing with a MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) and MSI Intel HD P530 SKL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106018-IB-12452021428&rdt&grs .
1245 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Xeon E3-1245 v5 @ 3.90GHz (4 Cores / 8 Threads) MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 120GB Samsung SSD 850 MSI Intel HD P530 SKL GT2 3GB (1150MHz) Realtek ALC1150 LG Ultra HD Intel I219-V Ubuntu 20.04 5.9.0-050900rc7daily20201003-generic (x86_64) 20201002 GNOME Shell 3.36.4 X Server 1.20.9 4.6 Mesa 20.2.6 1.2.145 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
1245 2021 cp2k: Fayalite-FIST namd: ATPase Simulation - 327,506 Atoms luaradio: Complex Phase mnn: resnet-v2-50 srsran: PHY_DL_Test openscad: Leonardo Phone Case Slim srsran: PHY_DL_Test mnn: SqueezeNetV1.0 compress-zstd: 19, Long Mode - Compression Speed yquake2: Software CPU - 1920 x 1080 srslte: PHY_DL_Test chia-vdf: Square Plain C++ mnn: MobileNetV2_224 compress-zstd: 8, Long Mode - Compression Speed build-wasmer: Time To Compile compress-zstd: 3 - Compression Speed aom-av1: Speed 9 Realtime - Bosphorus 1080p basis: UASTC Level 0 srsran: OFDM_Test npb: CG.C aom-av1: Speed 8 Realtime - Bosphorus 1080p stockfish: Total Time luaradio: Five Back to Back FIR Filters avifenc: 0 npb: SP.C draco: Lion yquake2: OpenGL 3.x - 1920 x 1080 libgav1: Chimera 1080p mnn: mobilenet-v1-1.0 liquid-dsp: 1 - 256 - 57 mnn: inception-v3 liquid-dsp: 4 - 256 - 57 gnuradio: Five Back to Back FIR Filters chia-vdf: Square Assembly Optimized yquake2: OpenGL 1.x - 1920 x 1080 compress-zstd: 3, Long Mode - Compression Speed botan: ChaCha20Poly1305 npb: IS.D pjsip: INVITE yquake2: Software CPU - 3840 x 2160 aom-av1: Speed 6 Realtime - Bosphorus 1080p yquake2: Software CPU - 2560 x 1440 astcenc: Medium svt-vp9: VMAF Optimized - Bosphorus 1080p npb: FT.C srslte: OFDM_Test embree: Pathtracer ISPC - Crown ngspice: C7552 luaradio: FM Deemphasis Filter gnuradio: FIR Filter onednn: Convolution Batch Shapes Auto - f32 - CPU compress-zstd: 8 - Compression Speed botan: ChaCha20Poly1305 - Decrypt embree: Pathtracer ISPC - Asian Dragon avifenc: 2 embree: Pathtracer - Asian Dragon Obj gnuradio: FM Deemphasis Filter srslte: PHY_DL_Test svt-av1: Preset 8 - Bosphorus 1080p svt-av1: Preset 4 - Bosphorus 1080p libgav1: Summer Nature 1080p draco: Church Facade dav1d: Summer Nature 1080p npb: EP.C gnuradio: Hilbert Transform npb: BT.C aom-av1: Speed 8 Realtime - Bosphorus 4K viennacl: CPU BLAS - dGEMM-NT v-ray: CPU tjbench: Decompression Throughput yquake2: OpenGL 1.x - 3840 x 2160 onednn: IP Shapes 3D - f32 - CPU botan: Blowfish avifenc: 6 openscad: Pistol yquake2: OpenGL 1.x - 2560 x 1440 svt-hevc: 10 - Bosphorus 1080p botan: AES-256 svt-hevc: 7 - Bosphorus 1080p gromacs: MPI CPU - water_GMX50_bare basis: ETC1S sysbench: RAM / Memory helsing: 12 digit onednn: Deconvolution Batch shapes_3d - f32 - CPU gnuradio: Signal Source (Cosine) svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p onednn: IP Shapes 1D - f32 - CPU botan: Twofish botan: Twofish - Decrypt compress-zstd: 19 - Compression Speed botan: AES-256 - Decrypt aom-av1: Speed 4 Two-Pass - Bosphorus 1080p avifenc: 10, Lossless pjsip: OPTIONS, Stateless toktx: Zstd Compression 9 svt-vp9: Visual Quality Optimized - Bosphorus 1080p embree: Pathtracer - Asian Dragon build-llvm: Unix Makefiles avifenc: 10 gnuradio: IIR Filter compress-zstd: 8 - Decompression Speed compress-zstd: 3, Long Mode - Decompression Speed openscad: Mini-ITX Case luaradio: Hilbert Transform openscad: Retro Car compress-zstd: 19, Long Mode - Decompression Speed avifenc: 6, Lossless hmmer: Pfam Database Search compress-zstd: 3 - Decompression Speed onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU ngspice: C2670 viennacl: CPU BLAS - sAXPY onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU toybrot: OpenMP xmrig: Monero - 1M dav1d: Chimera 1080p onednn: Deconvolution Batch shapes_1d - f32 - CPU incompact3d: input.i3d 129 Cells Per Direction yquake2: OpenGL 3.x - 3840 x 2160 aom-av1: Speed 6 Realtime - Bosphorus 4K compress-zstd: 8, Long Mode - Decompression Speed xmrig: Wownero - 1M securemark: SecureMark-TLS yquake2: OpenGL 3.x - 2560 x 1440 mrbayes: Primate Phylogeny Analysis dav1d: Summer Nature 4K brl-cad: VGR Performance Metric npb: SP.B npb: MG.C embree: Pathtracer - Crown libgav1: Chimera 1080p 10-bit build-erlang: Time To Compile aom-av1: Speed 9 Realtime - Bosphorus 4K build-gdb: Time To Compile onednn: Recurrent Neural Network Inference - f32 - CPU compress-zstd: 19 - Decompression Speed onednn: Recurrent Neural Network Training - f32 - CPU svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 4 - Bosphorus 4K build-linux-kernel: Time To Compile build-mesa: Time To Compile liquid-dsp: 2 - 256 - 57 libgav1: Summer Nature 4K astcenc: Exhaustive toktx: UASTC 3 toktx: UASTC 3 + Zstd Compression 19 botan: Blowfish - Decrypt npb: EP.D botan: KASUMI build-llvm: Ninja onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU embree: Pathtracer ISPC - Asian Dragon Obj npb: LU.C onednn: Recurrent Neural Network Inference - u8s8f32 - CPU aom-av1: Speed 6 Two-Pass - Bosphorus 1080p gmpbench: Total Time vosk: onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU build-nodejs: Time To Compile helsing: 14 digit toybrot: C++ Threads onednn: IP Shapes 1D - u8s8f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU pjsip: OPTIONS, Stateful dav1d: Chimera 1080p 10-bit basis: UASTC Level 2 toktx: Zstd Compression 19 basis: UASTC Level 3 toybrot: C++ Tasks liquid-dsp: 8 - 256 - 57 onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU botan: CAST-256 - Decrypt botan: KASUMI - Decrypt onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU build-ffmpeg: Time To Compile botan: CAST-256 toktx: UASTC 4 + Zstd Compression 19 astcenc: Thorough openscad: Projector Mount Swivel sysbench: CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sCOPY oidn: RTLightmap.hdr.4096x4096 oidn: RT.ldr_alb_nrm.3840x2160 oidn: RT.hdr_alb_nrm.3840x2160 svt-hevc: 1 - Bosphorus 1080p aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 4 Two-Pass - Bosphorus 4K aom-av1: Speed 0 Two-Pass - Bosphorus 4K 1 2 3 1128.871 3.89873 459.1 43.828 82.3 22.683 212 7.427 16.5 91.2 85.6 116100 3.935 182.6 208.224 1353.2 88.52 11.498 95100000 3491.89 72.64 10048442 565.4 140.047 4789.81 5936 305.3 134.43 4.513 51576000 55.45 179850000 558.8 178400 326.6 780.2 702.824 530.05 3865 28.5 12.41 56.9 9.2336 83.12 7402.12 101600000 4.7632 153.712 353.5 583.2 20.1566 165.7 698.792 5.9147 70.492 4.7135 594.1 224.2 25.054 2.138 159.15 9043 293.12 528.54 416.4 14115.43 21.97 13.8 4056 173.593651 89.4 12.1832 420.115 24.113 131.54 191.5 100.79 3725.527 47.01 0.506 37.711 16196.99 12.393 14.7556 2213.1 87.14 8.04538 336.448 338.124 18.6 3720.585 3.85 9.322 52692 3.321 69.11 5.0308 1754.395 5.298 439.9 3280.3 3401.3 54.719 67.2 22.723 2854.1 140.193 157.514 3191 8.28836 153.739 28.4 3.10885 5.16444 111218 1513.7 319.05 14.8412 58.1343498 97.4 6.73 3500.7 2163.5 222244 207.5 144.562 86.29 48966 4692.51 7618.61 4.2076 36.47 206.774 33.04 130.595 3953.04 2820.6 7508.26 7.183 0.691 221.828 126.952 98985000 42.39 263.3219 29.691 36.141 416.538 527.59 87.205 1737.26 4.89271 5.2945 15136.11 3954.76 10.36 4831.8 23.685 19.7534 1105.62 1268.619 111077 3.73302 7489.27 6895 241.75 74.856 23.315 148.009 111176 212370000 7493.28 132.999 83.415 6.05256 137.306 132.908 865.344 33.9441 122.174 7633.23 3951.94 14.9 14.3 28.1 28.7 28.2 27.4 18.4 28.5 19 0.08 0.17 0.17 3 0.2 3.32 1.76 0.06 1262.536 4.16104 480.8 45.727 85.7 21.917 217.8 7.608 16.9 93.4 83.9 118700 4.017 181.3 204.283 1343 90.18 11.296 96000000 3464.75 71.52 9945139 565.5 139.356 4781.62 5853 309.5 133.37 4.574 52258000 56.179 181980000 557.7 176400 327.6 773.4 709.948 530.71 3854 28.2 12.37 57.5 9.1377 82.46 7465.46 100600000 4.7791 155.235 357 577.9 19.9605 166 704.67 5.8603 69.865 4.7257 593.8 222.9 24.901 2.12 160.48 8969 295.46 524.4 415.5 14184.14 22.04 13.9 4085 172.407981 89.6 12.2653 419.696 23.953 130.684 191.4 100.18 3703.085 46.78 0.503 37.492 16103.81 12.464 14.8397 2219.5 86.66 8.08952 338.287 339.962 18.7 3701.131 3.86 9.274 52963 3.338 68.76 5.0558 1763.005 5.324 441.5 3287.2 3417 54.472 67.5 22.626 2866 139.611 156.943 3192 8.32126 153.161 28.3 3.11921 5.14749 110853 1518.6 320.08 14.8887 58.287674 97.5 6.72 3499.1 2164.5 221587 206.9 144.775 86.41 49086 4682.65 7632.76 4.2145 36.4 206.385 33.01 130.823 3959.67 2816.2 7496.69 7.181 0.691 221.512 126.791 99104000 42.34 263.0161 29.657 36.181 416.317 528.05 87.115 1739.033 4.88774 5.2899 15121.32 3958.61 10.35 4827.6 23.663 19.7353 1106.599 1267.564 111155 3.736 7494.03 6891 241.86 74.899 23.302 147.94 111144 212440000 7490.84 133.04 83.423 6.05425 137.331 132.893 865.459 33.9472 122.182 7632.97 3952.06 14.4 14.9 14.3 28.1 28.7 28.2 27.4 18.4 28.5 19 0.08 0.17 0.17 3 0.2 3.32 1.76 0.06 1199.072 3.89068 458.2 85.7 216.9 16.6 93.4 85.8 118700 184.8 1368.4 88.88 96700000 3437.52 72.32 9897277 573.7 138.023 4722.14 308.4 132.61 552.4 176600 330.2 781.9 702.258 535.75 3824 28.4 12.28 56.9 82.28 7476 100600000 4.8104 355.9 577.5 167.3 697.934 5.8941 4.6845 589 222.3 25.113 2.122 159.57 295.07 526.55 418.7 14079.59 22.13 89 417.311 192.6 100.77 3723.347 46.73 2225.7 86.78 336.911 338.152 18.6 3720.277 3.84 52859 68.82 5.0421 439.4 3295.6 3412 67.3 2859.4 157.576 3179.2 111078 1516.6 319.85 58.3146057 97.7 6.74 3509.5 2158.1 207.1 144.391 86.51 4683.5 7634.37 4.216 36.4 32.98 2818.6 7.172 0.692 42.38 416.771 527.5 87.153 5.2952 15135.59 10.36 4827.2 111167 6893 241.89 111135 133.039 83.439 132.915 0.08 0.17 0.17 3 0.2 3.32 1.76 0.06 OpenBenchmarking.org
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Input: Fayalite-FIST 1 2 3 300 600 900 1200 1500 1128.87 1262.54 1199.07
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 1 2 3 0.9362 1.8724 2.8086 3.7448 4.681 3.89873 4.16104 3.89068
LuaRadio Test: Complex Phase OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Complex Phase 1 2 3 100 200 300 400 500 459.1 480.8 458.2
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 1 2 10 20 30 40 50 43.83 45.73 MIN: 43.51 / MAX: 65.42 MIN: 45.58 / MAX: 62.15 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
srsRAN Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test 1 2 3 20 40 60 80 100 82.3 85.7 85.7 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OpenSCAD Render: Leonardo Phone Case Slim OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Leonardo Phone Case Slim 1 2 5 10 15 20 25 22.68 21.92 1. OpenSCAD version 2019.05
srsRAN Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: PHY_DL_Test 1 2 3 50 100 150 200 250 212.0 217.8 216.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 1 2 2 4 6 8 10 7.427 7.608 MIN: 7.35 / MAX: 9.49 MIN: 7.55 / MAX: 8.11 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed 1 2 3 4 8 12 16 20 16.5 16.9 16.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
yquake2 Renderer: Software CPU - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 1920 x 1080 1 2 3 20 40 60 80 100 91.2 93.4 93.4 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
srsLTE Test: PHY_DL_Test OpenBenchmarking.org UE Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 20 40 60 80 100 85.6 83.9 85.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Chia Blockchain VDF Test: Square Plain C++ OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Plain C++ 1 2 3 30K 60K 90K 120K 150K 116100 118700 118700 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 1 2 0.9038 1.8076 2.7114 3.6152 4.519 3.935 4.017 MIN: 3.88 / MAX: 6.43 MIN: 3.97 / MAX: 6.51 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed 1 2 3 40 80 120 160 200 182.6 181.3 184.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile 1 2 50 100 150 200 250 208.22 204.28 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed 1 2 3 300 600 900 1200 1500 1353.2 1343.0 1368.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 88.52 90.18 88.88 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 1 2 3 6 9 12 15 11.50 11.30 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test 1 2 3 20M 40M 60M 80M 100M 95100000 96000000 96700000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 700 1400 2100 2800 3500 3491.89 3464.75 3437.52 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 1 2 3 16 32 48 64 80 72.64 71.52 72.32 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 1 2 3 2M 4M 6M 8M 10M 10048442 9945139 9897277 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
LuaRadio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters 1 2 3 120 240 360 480 600 565.4 565.5 573.7
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 0 1 2 3 30 60 90 120 150 140.05 139.36 138.02 1. (CXX) g++ options: -O3 -fPIC -lm
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 1 2 3 1000 2000 3000 4000 5000 4789.81 4781.62 4722.14 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion 1 2 1300 2600 3900 5200 6500 5936 5853 1. (CXX) g++ options: -O3
yquake2 Renderer: OpenGL 3.x - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 3.x - Resolution: 1920 x 1080 1 2 3 70 140 210 280 350 305.3 309.5 308.4 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 1 2 3 30 60 90 120 150 134.43 133.37 132.61 1. (CXX) g++ options: -O3 -lpthread -lrt
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 1 2 1.0292 2.0584 3.0876 4.1168 5.146 4.513 4.574 MIN: 4.48 / MAX: 5.77 MIN: 4.53 / MAX: 7.74 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 1 2 11M 22M 33M 44M 55M 51576000 52258000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 1 2 13 26 39 52 65 55.45 56.18 MIN: 55.22 / MAX: 75.48 MIN: 55.99 / MAX: 75.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 1 2 40M 80M 120M 160M 200M 179850000 181980000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
GNU Radio Test: Five Back to Back FIR Filters OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Five Back to Back FIR Filters 1 2 3 120 240 360 480 600 558.8 557.7 552.4 1. 3.8.1.0
Chia Blockchain VDF Test: Square Assembly Optimized OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized 1 2 3 40K 80K 120K 160K 200K 178400 176400 176600 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
yquake2 Renderer: OpenGL 1.x - Resolution: 1920 x 1080 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 1.x - Resolution: 1920 x 1080 1 2 3 70 140 210 280 350 326.6 327.6 330.2 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed 1 2 3 200 400 600 800 1000 780.2 773.4 781.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 1 2 3 150 300 450 600 750 702.82 709.95 702.26 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 2 3 120 240 360 480 600 530.05 530.71 535.75 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE 1 2 3 800 1600 2400 3200 4000 3865 3854 3824 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lopus -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
yquake2 Renderer: Software CPU - Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 3840 x 2160 1 2 3 7 14 21 28 35 28.5 28.2 28.4 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 12.41 12.37 12.28 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
yquake2 Renderer: Software CPU - Resolution: 2560 x 1440 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: Software CPU - Resolution: 2560 x 1440 1 2 3 13 26 39 52 65 56.9 57.5 56.9 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Medium 1 2 3 6 9 12 15 9.2336 9.1377 1. (CXX) g++ options: -O3 -flto -pthread
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 83.12 82.46 82.28 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 1600 3200 4800 6400 8000 7402.12 7465.46 7476.00 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
srsLTE Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsLTE 20.10.1 Test: OFDM_Test 1 2 3 20M 40M 60M 80M 100M 101600000 100600000 100600000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 1 2 3 1.0823 2.1646 3.2469 4.3292 5.4115 4.7632 4.7791 4.8104 MIN: 4.74 / MAX: 4.84 MIN: 4.75 / MAX: 4.85 MIN: 4.78 / MAX: 4.89
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 30 60 90 120 150 153.71 155.24 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
LuaRadio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: FM Deemphasis Filter 1 2 3 80 160 240 320 400 353.5 357.0 355.9
GNU Radio Test: FIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FIR Filter 1 2 3 130 260 390 520 650 583.2 577.9 577.5 1. 3.8.1.0
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 5 10 15 20 25 20.16 19.96 MIN: 19.87 MIN: 19.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed 1 2 3 40 80 120 160 200 165.7 166.0 167.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt 1 2 3 150 300 450 600 750 698.79 704.67 697.93 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 1 2 3 1.3308 2.6616 3.9924 5.3232 6.654 5.9147 5.8603 5.8941 MIN: 5.88 / MAX: 6.02 MIN: 5.82 / MAX: 5.96 MIN: 5.85 / MAX: 5.99
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 1 2 16 32 48 64 80 70.49 69.87 1. (CXX) g++ options: -O3 -fPIC -lm
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 1 2 3 1.0633 2.1266 3.1899 4.2532 5.3165 4.7135 4.7257 4.6845 MIN: 4.7 / MAX: 4.75 MIN: 4.71 / MAX: 4.77 MIN: 4.67 / MAX: 4.72
GNU Radio Test: FM Deemphasis Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: FM Deemphasis Filter 1 2 3 130 260 390 520 650 594.1 593.8 589.0 1. 3.8.1.0
srsLTE Test: PHY_DL_Test OpenBenchmarking.org eNb Mb/s, More Is Better srsLTE 20.10.1 Test: PHY_DL_Test 1 2 3 50 100 150 200 250 224.2 222.9 222.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 1 2 3 6 12 18 24 30 25.05 24.90 25.11 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 1 2 3 0.4811 0.9622 1.4433 1.9244 2.4055 2.138 2.120 2.122 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p 1 2 3 40 80 120 160 200 159.15 160.48 159.57 1. (CXX) g++ options: -O3 -lpthread -lrt
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade 1 2 2K 4K 6K 8K 10K 9043 8969 1. (CXX) g++ options: -O3
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 1080p 1 2 3 60 120 180 240 300 293.12 295.46 295.07 MIN: 269.46 / MAX: 319.44 MIN: 275.68 / MAX: 322.9 MIN: 275.14 / MAX: 322.49 1. (CC) gcc options: -pthread -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 110 220 330 440 550 528.54 524.40 526.55 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
GNU Radio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Hilbert Transform 1 2 3 90 180 270 360 450 416.4 415.5 418.7 1. 3.8.1.0
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 1 2 3 3K 6K 9K 12K 15K 14115.43 14184.14 14079.59 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 1 2 3 5 10 15 20 25 21.97 22.04 22.13 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 1 2 4 8 12 16 20 13.8 13.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 900 1800 2700 3600 4500 4056 4085
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput 1 2 40 80 120 160 200 173.59 172.41 1. (CC) gcc options: -O3 -rdynamic -lm
yquake2 Renderer: OpenGL 1.x - Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 1.x - Resolution: 3840 x 2160 1 2 3 20 40 60 80 100 89.4 89.6 89.0 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 6 9 12 15 12.18 12.27 MIN: 12.08 MIN: 12.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish 1 2 3 90 180 270 360 450 420.12 419.70 417.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 1 2 6 12 18 24 30 24.11 23.95 1. (CXX) g++ options: -O3 -fPIC -lm
OpenSCAD Render: Pistol OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Pistol 1 2 30 60 90 120 150 131.54 130.68 1. OpenSCAD version 2019.05
yquake2 Renderer: OpenGL 1.x - Resolution: 2560 x 1440 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 1.x - Resolution: 2560 x 1440 1 2 3 40 80 120 160 200 191.5 191.4 192.6 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 100.79 100.18 100.77 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 1 2 3 800 1600 2400 3200 4000 3725.53 3703.09 3723.35 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 1 2 3 11 22 33 44 55 47.01 46.78 46.73 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare 1 2 0.1139 0.2278 0.3417 0.4556 0.5695 0.506 0.503 1. (CXX) g++ options: -O3 -pthread
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S 1 2 9 18 27 36 45 37.71 37.49 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory 1 2 3K 6K 9K 12K 15K 16196.99 16103.81 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
Helsing Digit Range: 12 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit 1 2 3 6 9 12 15 12.39 12.46 1. (CC) gcc options: -O2 -pthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 4 8 12 16 20 14.76 14.84 MIN: 14.64 MIN: 14.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
GNU Radio Test: Signal Source (Cosine) OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: Signal Source (Cosine) 1 2 3 500 1000 1500 2000 2500 2213.1 2219.5 2225.7 1. 3.8.1.0
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 1 2 3 20 40 60 80 100 87.14 86.66 86.78 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 2 4 6 8 10 8.04538 8.08952 MIN: 7.79 MIN: 7.82 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish 1 2 3 70 140 210 280 350 336.45 338.29 336.91 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt 1 2 3 70 140 210 280 350 338.12 339.96 338.15 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed 1 2 3 5 10 15 20 25 18.6 18.7 18.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt 1 2 3 800 1600 2400 3200 4000 3720.59 3701.13 3720.28 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.8685 1.737 2.6055 3.474 4.3425 3.85 3.86 3.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless 1 2 3 6 9 12 15 9.322 9.274 1. (CXX) g++ options: -O3 -fPIC -lm
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless 1 2 3 11K 22K 33K 44K 55K 52692 52963 52859 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lopus -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
KTX-Software toktx Settings: Zstd Compression 9 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 9 1 2 0.7511 1.5022 2.2533 3.0044 3.7555 3.321 3.338
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 1 2 3 15 30 45 60 75 69.11 68.76 68.82 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 1 2 3 1.1376 2.2752 3.4128 4.5504 5.688 5.0308 5.0558 5.0421 MIN: 4.97 / MAX: 5.13 MIN: 5.02 / MAX: 5.13 MIN: 4.97 / MAX: 5.16
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles 1 2 400 800 1200 1600 2000 1754.40 1763.01
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 1 2 1.1979 2.3958 3.5937 4.7916 5.9895 5.298 5.324 1. (CXX) g++ options: -O3 -fPIC -lm
GNU Radio Test: IIR Filter OpenBenchmarking.org MiB/s, More Is Better GNU Radio Test: IIR Filter 1 2 3 100 200 300 400 500 439.9 441.5 439.4 1. 3.8.1.0
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed 1 2 3 700 1400 2100 2800 3500 3280.3 3287.2 3295.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed 1 2 3 700 1400 2100 2800 3500 3401.3 3417.0 3412.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenSCAD Render: Mini-ITX Case OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Mini-ITX Case 1 2 12 24 36 48 60 54.72 54.47 1. OpenSCAD version 2019.05
LuaRadio Test: Hilbert Transform OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Hilbert Transform 1 2 3 15 30 45 60 75 67.2 67.5 67.3
OpenSCAD Render: Retro Car OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Retro Car 1 2 5 10 15 20 25 22.72 22.63 1. OpenSCAD version 2019.05
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed 1 2 3 600 1200 1800 2400 3000 2854.1 2866.0 2859.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless 1 2 30 60 90 120 150 140.19 139.61 1. (CXX) g++ options: -O3 -fPIC -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.2 Pfam Database Search 1 2 3 30 60 90 120 150 157.51 156.94 157.58 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm -lmpi
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed 1 2 3 700 1400 2100 2800 3500 3191.0 3192.0 3179.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 2 4 6 8 10 8.28836 8.32126 MIN: 8.25 MIN: 8.29 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 30 60 90 120 150 153.74 153.16 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 1 2 7 14 21 28 35 28.4 28.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 0.7018 1.4036 2.1054 2.8072 3.509 3.10885 3.11921 MIN: 3.05 MIN: 3.06 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 1.162 2.324 3.486 4.648 5.81 5.16444 5.14749 MIN: 5.08 MIN: 5.08 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP 1 2 3 20K 40K 60K 80K 100K 111218 110853 111078 1. (CXX) g++ options: -O3 -lpthread
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M 1 2 3 300 600 900 1200 1500 1513.7 1518.6 1516.6 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 1 2 3 70 140 210 280 350 319.05 320.08 319.85 MIN: 233.68 / MAX: 511.52 MIN: 233.93 / MAX: 535.81 MIN: 234.06 / MAX: 525.41 1. (CC) gcc options: -pthread -lm
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 4 8 12 16 20 14.84 14.89 MIN: 10.92 MIN: 12.01 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1 2 3 13 26 39 52 65 58.13 58.29 58.31 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
yquake2 Renderer: OpenGL 3.x - Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 3.x - Resolution: 3840 x 2160 1 2 3 20 40 60 80 100 97.4 97.5 97.7 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 6.73 6.72 6.74 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed 1 2 3 800 1600 2400 3200 4000 3500.7 3499.1 3509.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M 1 2 3 500 1000 1500 2000 2500 2163.5 2164.5 2158.1 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS 1 2 50K 100K 150K 200K 250K 222244 221587 1. (CC) gcc options: -pedantic -O3
yquake2 Renderer: OpenGL 3.x - Resolution: 2560 x 1440 OpenBenchmarking.org Frames Per Second, More Is Better yquake2 7.45 Renderer: OpenGL 3.x - Resolution: 2560 x 1440 1 2 3 50 100 150 200 250 207.5 206.9 207.1 1. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 1 2 3 30 60 90 120 150 144.56 144.78 144.39 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K 1 2 3 20 40 60 80 100 86.29 86.41 86.51 MIN: 80.76 / MAX: 98.28 MIN: 80.82 / MAX: 98.22 MIN: 80.86 / MAX: 98.67 1. (CC) gcc options: -pthread -lm
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric 1 2 11K 22K 33K 44K 55K 48966 49086 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -lm -pthread -ldl
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 1000 2000 3000 4000 5000 4692.51 4682.65 4683.50 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 1600 3200 4800 6400 8000 7618.61 7632.76 7634.37 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 1 2 3 0.9486 1.8972 2.8458 3.7944 4.743 4.2076 4.2145 4.2160 MIN: 4.19 / MAX: 4.26 MIN: 4.19 / MAX: 4.26 MIN: 4.2 / MAX: 4.27
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit 1 2 3 8 16 24 32 40 36.47 36.40 36.40 1. (CXX) g++ options: -O3 -lpthread -lrt
Timed Erlang/OTP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Erlang/OTP Compilation 23.2 Time To Compile 1 2 50 100 150 200 250 206.77 206.39
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 1 2 3 8 16 24 32 40 33.04 33.01 32.98 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile 1 2 30 60 90 120 150 130.60 130.82
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 800 1600 2400 3200 4000 3953.04 3959.67 MIN: 3950.85 MIN: 3956.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed 1 2 3 600 1200 1800 2400 3000 2820.6 2816.2 2818.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 1600 3200 4800 6400 8000 7508.26 7496.69 MIN: 7496.63 MIN: 7494.1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 1 2 3 2 4 6 8 10 7.183 7.181 7.172 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 1 2 3 0.1557 0.3114 0.4671 0.6228 0.7785 0.691 0.691 0.692 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 1 2 50 100 150 200 250 221.83 221.51
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile 1 2 30 60 90 120 150 126.95 126.79
Liquid-DSP Threads: 2 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 1 2 20M 40M 60M 80M 100M 98985000 99104000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K 1 2 3 10 20 30 40 50 42.39 42.34 42.38 1. (CXX) g++ options: -O3 -lpthread -lrt
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive 1 2 60 120 180 240 300 263.32 263.02 1. (CXX) g++ options: -O3 -flto -pthread
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 1 2 7 14 21 28 35 29.69 29.66
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 1 2 8 16 24 32 40 36.14 36.18
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt 1 2 3 90 180 270 360 450 416.54 416.32 416.77 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 110 220 330 440 550 527.59 528.05 527.50 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI 1 2 3 20 40 60 80 100 87.21 87.12 87.15 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja 1 2 400 800 1200 1600 2000 1737.26 1739.03
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 1.1009 2.2018 3.3027 4.4036 5.5045 4.89271 4.88774 MIN: 4.87 MIN: 4.87 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 1 2 3 1.1914 2.3828 3.5742 4.7656 5.957 5.2945 5.2899 5.2952 MIN: 5.27 / MAX: 5.35 MIN: 5.27 / MAX: 5.34 MIN: 5.28 / MAX: 5.35
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 3K 6K 9K 12K 15K 15136.11 15121.32 15135.59 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 800 1600 2400 3200 4000 3954.76 3958.61 MIN: 3952.97 MIN: 3954.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 1 2 3 3 6 9 12 15 10.36 10.35 10.36 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time 1 2 3 1000 2000 3000 4000 5000 4831.8 4827.6 4827.2 1. (CC) gcc options: -O3 -fomit-frame-pointer -lm
VOSK Speech Recognition Toolkit OpenBenchmarking.org Seconds, Fewer Is Better VOSK Speech Recognition Toolkit 0.3.21 1 2 6 12 18 24 30 23.69 23.66
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 5 10 15 20 25 19.75 19.74 MIN: 19.66 MIN: 19.64 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 1 2 200 400 600 800 1000 1105.62 1106.60
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit 1 2 300 600 900 1200 1500 1268.62 1267.56 1. (CC) gcc options: -O2 -pthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads 1 2 3 20K 40K 60K 80K 100K 111077 111155 111167 1. (CXX) g++ options: -O3 -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 0.8406 1.6812 2.5218 3.3624 4.203 3.73302 3.73600 MIN: 3.7 MIN: 3.69 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 1600 3200 4800 6400 8000 7489.27 7494.03 MIN: 7484.39 MIN: 7491.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful 1 2 3 1500 3000 4500 6000 7500 6895 6891 6893 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lopus -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O2
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit 1 2 3 50 100 150 200 250 241.75 241.86 241.89 MIN: 183.88 / MAX: 387.38 MIN: 184.2 / MAX: 389.96 MIN: 184.12 / MAX: 392.71 1. (CC) gcc options: -pthread -lm
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 1 2 20 40 60 80 100 74.86 74.90 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 1 2 6 12 18 24 30 23.32 23.30
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 1 2 30 60 90 120 150 148.01 147.94 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks 1 2 3 20K 40K 60K 80K 100K 111176 111144 111135 1. (CXX) g++ options: -O3 -lpthread
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 1 2 50M 100M 150M 200M 250M 212370000 212440000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 1600 3200 4800 6400 8000 7493.28 7490.84 MIN: 7488.85 MIN: 7484.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt 1 2 3 30 60 90 120 150 133.00 133.04 133.04 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt 1 2 3 20 40 60 80 100 83.42 83.42 83.44 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 2 4 6 8 10 6.05256 6.05425 MIN: 6.01 MIN: 6.01 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile 1 2 30 60 90 120 150 137.31 137.33
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 1 2 3 30 60 90 120 150 132.91 132.89 132.92 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 1 2 200 400 600 800 1000 865.34 865.46
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Thorough 1 2 8 16 24 32 40 33.94 33.95 1. (CXX) g++ options: -O3 -flto -pthread
OpenSCAD Render: Projector Mount Swivel OpenBenchmarking.org Seconds, Fewer Is Better OpenSCAD Render: Projector Mount Swivel 1 2 30 60 90 120 150 122.17 122.18 1. OpenSCAD version 2019.05
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 1 2 1600 3200 4800 6400 8000 7633.23 7632.97 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 800 1600 2400 3200 4000 3951.94 3952.06 MIN: 3949.03 MIN: 3948.57 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 2 4 8 12 16 20 14.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 1 2 4 8 12 16 20 14.9 14.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 1 2 4 8 12 16 20 14.3 14.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 1 2 7 14 21 28 35 28.1 28.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 1 2 7 14 21 28 35 28.7 28.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 1 2 7 14 21 28 35 28.2 28.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 1 2 6 12 18 24 30 27.4 27.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 1 2 5 10 15 20 25 18.4 18.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 1 2 7 14 21 28 35 28.5 28.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 1 2 5 10 15 20 25 19 19 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 1 2 3 0.018 0.036 0.054 0.072 0.09 0.08 0.08 0.08
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 1 2 3 0.0383 0.0766 0.1149 0.1532 0.1915 0.17 0.17 0.17
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 1 2 3 0.0383 0.0766 0.1149 0.1532 0.1915 0.17 0.17 0.17
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 1 2 3 0.675 1.35 2.025 2.7 3.375 3 3 3 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 1 2 3 0.045 0.09 0.135 0.18 0.225 0.2 0.2 0.2 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 1 2 3 0.747 1.494 2.241 2.988 3.735 3.32 3.32 3.32 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 1 2 3 0.396 0.792 1.188 1.584 1.98 1.76 1.76 1.76 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 1 2 3 0.0135 0.027 0.0405 0.054 0.0675 0.06 0.06 0.06 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Phoronix Test Suite v10.8.5