Intel Core i7-4960X testing with a MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) and AMD FirePro V4800 1GB on Ubuntu 20.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2102057-HA-4960X202146 4960x-2021 - Phoronix Test Suite 4960x-2021 Intel Core i7-4960X testing with a MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) and AMD FirePro V4800 1GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102057-HA-4960X202146&grs&sor .
4960x-2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 1a 2 3 Intel Core i7-4960X @ 4.00GHz (6 Cores / 12 Threads) MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) Intel Xeon E7 v2/Xeon 8GB VisionTek 240GB AMD FirePro V4800 1GB Realtek ALC892 DELL S2409W Realtek RTL8111/8168/8411 Ubuntu 20.10 5.8.0-33-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 3.3 Mesa 20.2.1 (LLVM 11.0.0) GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - 1, 2, 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42e Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Graphics Details - 1a, 2, 3: GLAMOR
4960x-2021 redis: LPOP mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 askap: tConvolve MT - Gridding askap: tConvolve MPI - Degridding redis: LPUSH lzbench: XZ 0 - Compression askap: tConvolve MPI - Gridding kripke: lzbench: Brotli 2 - Compression qmcpack: simple-H2O askap: Hogbom Clean OpenMP redis: GET onnx: shufflenet-v2-10 - OpenMP CPU npb: EP.D redis: SET cloverleaf: Lagrangian-Eulerian Hydrodynamics lzbench: Brotli 2 - Decompression lulesh: npb: LU.C onnx: yolov4 - OpenMP CPU quantlib: mnn: inception-v3 paraview: Wavelet Volume - 1280 x 1024 pennant: sedovbig paraview: Wavelet Volume - 1280 x 1024 mnn: resnet-v2-50 rav1e: 10 rav1e: 6 dav1d: Summer Nature 4K redis: SADD cp2k: Fayalite-FIST Data pennant: leblancbig paraview: Wavelet Volume - 1920 x 1080 rav1e: 1 lzbench: Zstd 1 - Compression onnx: bertsquad-10 - OpenMP CPU paraview: Wavelet Volume - 1920 x 1080 askap: tConvolve MT - Degridding gnupg: 2.7GB Sample File Encryption tnn: CPU - SqueezeNet v1.1 amg: dav1d: Summer Nature 1080p tnn: CPU - MobileNet v2 build-godot: Time To Compile lzbench: Zstd 8 - Decompression webp2: Quality 75, Compression Effort 7 synthmark: VoiceMark_100 webp2: Default lzbench: Crush 0 - Decompression webp2: Quality 95, Compression Effort 7 rav1e: 5 webp2: Quality 100, Compression Effort 5 lzbench: Brotli 0 - Decompression askap: tConvolve OpenMP - Degridding gcrypt: openfoam: Motorbike 30M dav1d: Chimera 1080p dav1d: Chimera 1080p 10-bit onnx: super-resolution-10 - OpenMP CPU lzbench: Zstd 1 - Decompression paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1280 x 1024 paraview: Wavelet Contour - 1280 x 1024 webp2: Quality 100, Lossless Compression paraview: Wavelet Contour - 1920 x 1080 onnx: fcn-resnet101-11 - OpenMP CPU lzbench: Libdeflate 1 - Compression lzbench: Brotli 0 - Compression lzbench: Crush 0 - Compression lzbench: Zstd 8 - Compression lzbench: XZ 0 - Decompression mnn: SqueezeNetV1.0 askap: tConvolve OpenMP - Gridding lammps: Rhodopsin Protein npb: EP.C 1 1a 2 3 421116.24 16.874 11.214 471.256 979.569 352737.01 33 1101.04 20695700 158 53.256 83.6658 415048.13 7911 578.24 381871.14 341.50 634 945.56350 10854.09 128 1564.8 101.724 219.6609 97.548 1.778 0.651 72.72 398623.91 1518.815 159.6349 0.198 408 219 785.172 79.291 314.380 126186033 272.48 322.186 318.343 1431 436.407 548.479 7.632 474 791.188 0.523 22.671 545 903.892 268.853 612.98 293.74 72.90 1781 1362 1387.515 21 200 415 82 73 106 16.227 475.089 2.935 552.07 14.08 225.321 13.78 220.405 14.58 14.68 152.989 151.870 358938.94 15.146 10.799 483.420 1014.54 344615.09 33 1072.23 20275343 159 52.471 84.6029 407879.64 7776 588.19 376442.55 344.10 634 951.21757 10883.31 128 1563.9 102.443 14.16 220.7694 226.566 96.945 1.782 0.655 73.09 400109.75 1510.857 159.0102 13.75 0.199 410 220 220.081 786.565 79.061 313.238 126339000 273.14 323.197 317.350 1435 435.691 548.206 7.629 473 791.017 0.524 22.651 544 905.474 268.424 612.76 293.88 73.01 1780 1363 14.57 14.69 153.054 1386.947 151.848 21 200 415 82 73 106 16.901 483.634 2.879 578.92 357535.78 15.124 11.476 488.906 990.445 355245.38 34 1088.36 20478857 156 52.293 85.1549 410152.44 7911 579.26 377130.07 340.22 628 953.28322 10939.94 129 1552.8 101.691 14.17 221.0630 226.756 97.179 1.789 0.654 73.15 400763.15 1517.97 158.8181 13.71 0.199 410 220 219.405 788.586 78.954 313.335 125923000 273.34 322.574 318.316 1432 436.859 547.240 7.615 473 789.525 0.524 22.629 544 904.838 268.594 612.01 294.20 72.96 1782 1362 14.57 14.69 153.050 1387.329 151.852 21 200 415 82 73 106 16.466 475.962 2.924 547.09 OpenBenchmarking.org
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 90K 180K 270K 360K 450K SE +/- 2411.53, N = 3 SE +/- 458.28, N = 3 SE +/- 1639.52, N = 3 421116.24 358938.94 357535.78 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 3 2 1 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 9 SE +/- 0.56, N = 3 15.12 15.15 16.87 MIN: 10.68 / MAX: 32.84 MIN: 10.68 / MAX: 37.52 MIN: 12.35 / MAX: 32.6 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 2 1 3 3 6 9 12 15 SE +/- 0.18, N = 9 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 10.80 11.21 11.48 MIN: 7.12 / MAX: 30.62 MIN: 7.61 / MAX: 20.61 MIN: 7.15 / MAX: 28.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 3 2 1 110 220 330 440 550 SE +/- 5.01, N = 12 SE +/- 7.25, N = 12 SE +/- 7.74, N = 12 488.91 483.42 471.26 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 2 3 1 200 400 600 800 1000 SE +/- 8.04, N = 3 SE +/- 7.70, N = 14 SE +/- 12.60, N = 4 1014.54 990.45 979.57 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 3 1 2 80K 160K 240K 320K 400K SE +/- 681.76, N = 3 SE +/- 1522.91, N = 3 SE +/- 2058.89, N = 3 355245.38 352737.01 344615.09 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 8 16 24 32 40 SE +/- 0.33, N = 3 34 33 33 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 3 2 200 400 600 800 1000 SE +/- 5.24, N = 4 SE +/- 7.29, N = 14 SE +/- 20.07, N = 3 1101.04 1088.36 1072.23 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 3 2 4M 8M 12M 16M 20M SE +/- 146047.06, N = 3 SE +/- 112722.75, N = 3 SE +/- 347619.09, N = 3 20695700 20478857 20275343 1. (CXX) g++ options: -O3 -fopenmp
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 2 1 3 40 80 120 160 200 SE +/- 1.20, N = 3 159 158 156 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 3 2 1 12 24 36 48 60 SE +/- 0.28, N = 3 SE +/- 0.49, N = 15 SE +/- 0.51, N = 15 52.29 52.47 53.26 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 3 2 1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.55, N = 3 85.15 84.60 83.67 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 90K 180K 270K 360K 450K SE +/- 1046.63, N = 3 SE +/- 419.84, N = 3 SE +/- 1892.58, N = 3 415048.13 410152.44 407879.64 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 1 2 2K 4K 6K 8K 10K SE +/- 25.26, N = 3 SE +/- 5.25, N = 3 SE +/- 101.36, N = 12 7911 7911 7776 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 3 1 130 260 390 520 650 SE +/- 2.93, N = 3 SE +/- 2.91, N = 3 SE +/- 9.17, N = 3 588.19 579.26 578.24 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 80K 160K 240K 320K 400K SE +/- 2101.11, N = 3 SE +/- 2849.61, N = 3 SE +/- 3451.68, N = 3 381871.14 377130.07 376442.55 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 1 2 70 140 210 280 350 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.81, N = 3 340.22 341.50 344.10 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 3 140 280 420 560 700 SE +/- 0.88, N = 3 SE +/- 6.51, N = 3 634 634 628 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 3 2 1 200 400 600 800 1000 SE +/- 6.65, N = 3 SE +/- 3.12, N = 3 SE +/- 1.25, N = 3 953.28 951.22 945.56 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 2K 4K 6K 8K 10K SE +/- 57.65, N = 3 SE +/- 19.24, N = 3 SE +/- 15.44, N = 3 10939.94 10883.31 10854.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 129 128 128 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 300 600 900 1200 1500 SE +/- 10.15, N = 3 SE +/- 8.53, N = 3 SE +/- 14.16, N = 3 1564.8 1563.9 1552.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 3 1 2 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.28, N = 3 SE +/- 0.22, N = 9 101.69 101.72 102.44 MIN: 95.45 / MAX: 204.2 MIN: 95.14 / MAX: 163.22 MIN: 94.11 / MAX: 167.26 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ParaView Test: Wavelet Volume - Resolution: 1280 x 1024 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 3 2 1a 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.17 14.16 14.08
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 50 100 150 200 250 SE +/- 0.18, N = 3 SE +/- 0.78, N = 3 SE +/- 0.25, N = 3 219.66 220.77 221.06 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
ParaView Test: Wavelet Volume - Resolution: 1280 x 1024 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 3 2 1a 50 100 150 200 250 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 SE +/- 0.64, N = 3 226.76 226.57 225.32
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 2 3 1 20 40 60 80 100 SE +/- 0.11, N = 9 SE +/- 0.10, N = 3 SE +/- 0.79, N = 3 96.95 97.18 97.55 MIN: 81.72 / MAX: 187.63 MIN: 83.52 / MAX: 127.58 MIN: 87.88 / MAX: 155.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 2 1 0.4025 0.805 1.2075 1.61 2.0125 SE +/- 0.008, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 1.789 1.782 1.778
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 2 3 1 0.1474 0.2948 0.4422 0.5896 0.737 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.655 0.654 0.651
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 3 2 1 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 73.15 73.09 72.72 MIN: 69.24 / MAX: 78.42 MIN: 68.74 / MAX: 78.64 MIN: 68.17 / MAX: 78.13 1. (CC) gcc options: -pthread
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 3 2 1 90K 180K 270K 360K 450K SE +/- 1728.25, N = 3 SE +/- 1118.19, N = 3 SE +/- 227.22, N = 3 400763.15 400109.75 398623.91 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
CP2K Molecular Dynamics Fayalite-FIST Data OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Fayalite-FIST Data 2 3 1 300 600 900 1200 1500 1510.86 1517.97 1518.82
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 3 2 1 40 80 120 160 200 SE +/- 0.72, N = 3 SE +/- 0.35, N = 3 SE +/- 1.03, N = 3 158.82 159.01 159.63 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 13.78 13.75 13.71
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 3 2 1 0.0448 0.0896 0.1344 0.1792 0.224 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.199 0.199 0.198
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 2 1 90 180 270 360 450 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 410 410 408 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 3 2 1 50 100 150 200 250 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 220 220 219 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 50 100 150 200 250 SE +/- 0.03, N = 3 SE +/- 0.37, N = 3 SE +/- 0.05, N = 3 220.41 220.08 219.41
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 3 2 1 200 400 600 800 1000 SE +/- 0.52, N = 12 SE +/- 1.22, N = 12 SE +/- 0.97, N = 12 788.59 786.57 785.17 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 3 2 1 20 40 60 80 100 SE +/- 0.52, N = 3 SE +/- 0.63, N = 3 SE +/- 0.77, N = 3 78.95 79.06 79.29 1. (CC) gcc options: -O2
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 2 3 1 70 140 210 280 350 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 SE +/- 0.24, N = 3 313.24 313.34 314.38 MIN: 310.16 / MAX: 315.44 MIN: 311.05 / MAX: 316.02 MIN: 311.58 / MAX: 317.04 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 2 1 3 30M 60M 90M 120M 150M SE +/- 61978.95, N = 3 SE +/- 41688.42, N = 3 SE +/- 528736.77, N = 3 126339000 126186033 125923000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 3 2 1 60 120 180 240 300 SE +/- 0.43, N = 3 SE +/- 0.40, N = 3 SE +/- 0.64, N = 3 273.34 273.14 272.48 MIN: 243.22 / MAX: 296.72 MIN: 240.8 / MAX: 297.76 MIN: 235.99 / MAX: 296.83 1. (CC) gcc options: -pthread
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 3 2 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.22, N = 3 SE +/- 0.22, N = 3 322.19 322.57 323.20 MIN: 318.2 / MAX: 334.81 MIN: 319.05 / MAX: 330.16 MIN: 319.57 / MAX: 330.9 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 2 3 1 70 140 210 280 350 SE +/- 0.14, N = 3 SE +/- 0.54, N = 3 SE +/- 0.68, N = 3 317.35 318.32 318.34
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 2.96, N = 3 SE +/- 2.33, N = 3 1435 1432 1431 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 2 1 3 90 180 270 360 450 SE +/- 0.40, N = 3 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 435.69 436.41 436.86 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 120 240 360 480 600 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 548.48 548.21 547.24 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 3 2 1 2 4 6 8 10 SE +/- 0.031, N = 3 SE +/- 0.019, N = 3 SE +/- 0.055, N = 3 7.615 7.629 7.632 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 3 2 100 200 300 400 500 SE +/- 0.67, N = 3 474 473 473 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 3 2 1 200 400 600 800 1000 SE +/- 0.73, N = 3 SE +/- 1.63, N = 3 SE +/- 0.46, N = 3 789.53 791.02 791.19 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 3 2 1 0.1179 0.2358 0.3537 0.4716 0.5895 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.524 0.524 0.523
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 3 2 1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 22.63 22.65 22.67 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 3 2 120 240 360 480 600 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 545 544 544 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 3 1 200 400 600 800 1000 SE +/- 1.73, N = 15 SE +/- 1.28, N = 15 SE +/- 2.07, N = 12 905.47 904.84 903.89 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 3 1 60 120 180 240 300 SE +/- 0.73, N = 3 SE +/- 0.62, N = 3 SE +/- 0.30, N = 3 268.42 268.59 268.85 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 3 2 1 130 260 390 520 650 SE +/- 0.43, N = 3 SE +/- 0.29, N = 3 SE +/- 0.30, N = 3 612.01 612.76 612.98 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lspecie -lfiniteVolume -lfvOptions -lgenericPatchFields -lmeshTools -lsampling -lOpenFOAM -ldl -lm
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 3 2 1 60 120 180 240 300 SE +/- 0.39, N = 3 SE +/- 0.63, N = 3 SE +/- 0.39, N = 3 294.20 293.88 293.74 MIN: 220.72 / MAX: 482.19 MIN: 220.85 / MAX: 496.89 MIN: 220.61 / MAX: 495.08 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 2 3 1 16 32 48 64 80 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 73.01 72.96 72.90 MIN: 47.57 / MAX: 170.84 MIN: 47.59 / MAX: 170.26 MIN: 47.55 / MAX: 170.43 1. (CC) gcc options: -pthread
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 1 2 400 800 1200 1600 2000 SE +/- 0.73, N = 3 SE +/- 2.19, N = 3 SE +/- 2.18, N = 3 1782 1781 1780 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 3.51, N = 3 SE +/- 1.53, N = 3 1363 1362 1362 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 3 2 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.58 14.57 14.57
ParaView Test: Wavelet Contour - Resolution: 1280 x 1024 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 3 2 1a 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.69 14.69 14.68
ParaView Test: Wavelet Contour - Resolution: 1280 x 1024 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 2 3 1a 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 153.05 153.05 152.99
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 2 3 1 300 600 900 1200 1500 SE +/- 0.43, N = 3 SE +/- 1.52, N = 3 SE +/- 0.59, N = 3 1386.95 1387.33 1387.52 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 3 2 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 151.87 151.85 151.85
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 21 21 21 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 1 40 80 120 160 200 SE +/- 0.33, N = 3 200 200 200 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 2 1 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 415 415 415 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 82 82 82 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 16 32 48 64 80 73 73 73 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 20 40 60 80 100 106 106 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 3 2 4 8 12 16 20 SE +/- 0.24, N = 3 SE +/- 0.13, N = 3 SE +/- 0.44, N = 9 16.23 16.47 16.90 MIN: 13.32 / MAX: 35.03 MIN: 12.74 / MAX: 36.16 MIN: 12.9 / MAX: 36.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 2 3 1 100 200 300 400 500 SE +/- 8.96, N = 15 SE +/- 6.61, N = 15 SE +/- 8.75, N = 12 483.63 475.96 475.09 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 3 2 0.6604 1.3208 1.9812 2.6416 3.302 SE +/- 0.037, N = 3 SE +/- 0.052, N = 15 SE +/- 0.065, N = 12 2.935 2.924 2.879 1. (CXX) g++ options: -O3 -pthread -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 1 3 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 7.64, N = 3 SE +/- 9.19, N = 15 578.92 552.07 547.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Phoronix Test Suite v10.8.4