Intel Core i7-4960X testing with a MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) and AMD FirePro V4800 1GB on Ubuntu 20.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2102057-HA-4960X202146 4960x-2021 - Phoronix Test Suite 4960x-2021 Intel Core i7-4960X testing with a MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) and AMD FirePro V4800 1GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102057-HA-4960X202146&gru&sro .
4960x-2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 1a 2 3 Intel Core i7-4960X @ 4.00GHz (6 Cores / 12 Threads) MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) Intel Xeon E7 v2/Xeon 8GB VisionTek 240GB AMD FirePro V4800 1GB Realtek ALC892 DELL S2409W Realtek RTL8111/8168/8411 Ubuntu 20.10 5.8.0-33-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 3.3 Mesa 20.2.1 (LLVM 11.0.0) GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - 1, 2, 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42e Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Graphics Details - 1a, 2, 3: GLAMOR
4960x-2021 amg: dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit paraview: Wavelet Volume - 1280 x 1024 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1280 x 1024 paraview: Wavelet Contour - 1920 x 1080 rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU askap: Hogbom Clean OpenMP lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression quantlib: askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding paraview: Wavelet Contour - 1280 x 1024 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Volume - 1280 x 1024 paraview: Wavelet Volume - 1920 x 1080 askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding lammps: Rhodopsin Protein redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET kripke: npb: EP.C npb: EP.D npb: LU.C synthmark: VoiceMark_100 lulesh: pennant: sedovbig pennant: leblancbig mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 cloverleaf: Lagrangian-Eulerian Hydrodynamics cp2k: Fayalite-FIST Data openfoam: Motorbike 30M build-godot: Time To Compile gcrypt: webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression gnupg: 2.7GB Sample File Encryption qmcpack: simple-H2O 1 1a 2 3 126186033 293.74 72.72 272.48 72.90 0.198 0.523 0.651 1.778 128 219 21 7911 1781 83.6658 33 106 408 1362 73 1431 82 474 415 545 158 634 200 1564.8 471.256 785.172 475.089 903.892 979.569 1101.04 2.935 421116.24 398623.91 352737.01 415048.13 381871.14 20695700 552.07 578.24 10854.09 548.479 945.56350 219.6609 159.6349 16.227 97.548 11.214 16.874 101.724 322.186 314.380 341.50 1518.815 612.98 318.343 268.853 7.632 436.407 791.188 22.671 1387.515 79.291 53.256 14.08 13.78 14.68 14.58 152.989 151.870 225.321 220.405 126339000 293.88 73.09 273.14 73.01 14.16 13.75 14.69 14.57 0.199 0.524 0.655 1.782 128 220 21 7776 1780 84.6029 33 106 410 1363 73 1435 82 473 415 544 159 634 200 1563.9 483.420 786.565 483.634 905.474 153.054 151.848 226.566 220.081 1014.54 1072.23 2.879 358938.94 400109.75 344615.09 407879.64 376442.55 20275343 578.92 588.19 10883.31 548.206 951.21757 220.7694 159.0102 16.901 96.945 10.799 15.146 102.443 323.197 313.238 344.10 1510.857 612.76 317.350 268.424 7.629 435.691 791.017 22.651 1386.947 79.061 52.471 125923000 294.20 73.15 273.34 72.96 14.17 13.71 14.69 14.57 0.199 0.524 0.654 1.789 129 220 21 7911 1782 85.1549 34 106 410 1362 73 1432 82 473 415 544 156 628 200 1552.8 488.906 788.586 475.962 904.838 153.050 151.852 226.756 219.405 990.445 1088.36 2.924 357535.78 400763.15 355245.38 410152.44 377130.07 20478857 547.09 579.26 10939.94 547.240 953.28322 221.0630 158.8181 16.466 97.179 11.476 15.124 101.691 322.574 313.335 340.22 1517.97 612.01 318.316 268.594 7.615 436.859 789.525 22.629 1387.329 78.954 52.293 OpenBenchmarking.org
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 2 3 30M 60M 90M 120M 150M SE +/- 41688.42, N = 3 SE +/- 61978.95, N = 3 SE +/- 528736.77, N = 3 126186033 126339000 125923000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 60 120 180 240 300 SE +/- 0.39, N = 3 SE +/- 0.63, N = 3 SE +/- 0.39, N = 3 293.74 293.88 294.20 MIN: 220.61 / MAX: 495.08 MIN: 220.85 / MAX: 496.89 MIN: 220.72 / MAX: 482.19 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 1 2 3 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 72.72 73.09 73.15 MIN: 68.17 / MAX: 78.13 MIN: 68.74 / MAX: 78.64 MIN: 69.24 / MAX: 78.42 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 60 120 180 240 300 SE +/- 0.64, N = 3 SE +/- 0.40, N = 3 SE +/- 0.43, N = 3 272.48 273.14 273.34 MIN: 235.99 / MAX: 296.83 MIN: 240.8 / MAX: 297.76 MIN: 243.22 / MAX: 296.72 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 16 32 48 64 80 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 72.90 73.01 72.96 MIN: 47.55 / MAX: 170.43 MIN: 47.57 / MAX: 170.84 MIN: 47.59 / MAX: 170.26 1. (CC) gcc options: -pthread
ParaView Test: Wavelet Volume - Resolution: 1280 x 1024 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 1a 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 14.08 14.16 14.17
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 13.78 13.75 13.71
ParaView Test: Wavelet Contour - Resolution: 1280 x 1024 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 1a 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.68 14.69 14.69
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.58 14.57 14.57
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 0.0448 0.0896 0.1344 0.1792 0.224 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.198 0.199 0.199
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 0.1179 0.2358 0.3537 0.4716 0.5895 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.523 0.524 0.524
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 0.1474 0.2948 0.4422 0.5896 0.737 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 0.651 0.655 0.654
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 0.4025 0.805 1.2075 1.61 2.0125 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.008, N = 3 1.778 1.782 1.789
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 30 60 90 120 150 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 128 128 129 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 50 100 150 200 250 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 219 220 220 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 21 21 21 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 5.25, N = 3 SE +/- 101.36, N = 12 SE +/- 25.26, N = 3 7911 7776 7911 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 400 800 1200 1600 2000 SE +/- 2.19, N = 3 SE +/- 2.18, N = 3 SE +/- 0.73, N = 3 1781 1780 1782 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 20 40 60 80 100 SE +/- 0.55, N = 3 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 83.67 84.60 85.15 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 8 16 24 32 40 SE +/- 0.33, N = 3 33 33 34 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 20 40 60 80 100 106 106 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 90 180 270 360 450 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 408 410 410 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 1.53, N = 3 SE +/- 0.33, N = 3 SE +/- 3.51, N = 3 1362 1363 1362 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 16 32 48 64 80 73 73 73 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 2.33, N = 3 SE +/- 2.96, N = 3 1431 1435 1432 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 82 82 82 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 100 200 300 400 500 SE +/- 0.67, N = 3 474 473 473 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 415 415 415 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 1.20, N = 3 SE +/- 1.67, N = 3 545 544 544 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 1.20, N = 3 158 159 156 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 0.88, N = 3 SE +/- 6.51, N = 3 634 634 628 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.33, N = 3 200 200 200 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 300 600 900 1200 1500 SE +/- 10.15, N = 3 SE +/- 8.53, N = 3 SE +/- 14.16, N = 3 1564.8 1563.9 1552.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 110 220 330 440 550 SE +/- 7.74, N = 12 SE +/- 7.25, N = 12 SE +/- 5.01, N = 12 471.26 483.42 488.91 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 200 400 600 800 1000 SE +/- 0.97, N = 12 SE +/- 1.22, N = 12 SE +/- 0.52, N = 12 785.17 786.57 788.59 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 100 200 300 400 500 SE +/- 8.75, N = 12 SE +/- 8.96, N = 15 SE +/- 6.61, N = 15 475.09 483.63 475.96 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 200 400 600 800 1000 SE +/- 2.07, N = 12 SE +/- 1.73, N = 15 SE +/- 1.28, N = 15 903.89 905.47 904.84 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ParaView Test: Wavelet Contour - Resolution: 1280 x 1024 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 1a 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 152.99 153.05 153.05
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 2 3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 151.87 151.85 151.85
ParaView Test: Wavelet Volume - Resolution: 1280 x 1024 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 1a 2 3 50 100 150 200 250 SE +/- 0.64, N = 3 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 225.32 226.57 226.76
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 50 100 150 200 250 SE +/- 0.03, N = 3 SE +/- 0.37, N = 3 SE +/- 0.05, N = 3 220.41 220.08 219.41
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 200 400 600 800 1000 SE +/- 12.60, N = 4 SE +/- 8.04, N = 3 SE +/- 7.70, N = 14 979.57 1014.54 990.45 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 200 400 600 800 1000 SE +/- 5.24, N = 4 SE +/- 20.07, N = 3 SE +/- 7.29, N = 14 1101.04 1072.23 1088.36 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 0.6604 1.3208 1.9812 2.6416 3.302 SE +/- 0.037, N = 3 SE +/- 0.065, N = 12 SE +/- 0.052, N = 15 2.935 2.879 2.924 1. (CXX) g++ options: -O3 -pthread -lm
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 90K 180K 270K 360K 450K SE +/- 2411.53, N = 3 SE +/- 458.28, N = 3 SE +/- 1639.52, N = 3 421116.24 358938.94 357535.78 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 90K 180K 270K 360K 450K SE +/- 227.22, N = 3 SE +/- 1118.19, N = 3 SE +/- 1728.25, N = 3 398623.91 400109.75 400763.15 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 80K 160K 240K 320K 400K SE +/- 1522.91, N = 3 SE +/- 2058.89, N = 3 SE +/- 681.76, N = 3 352737.01 344615.09 355245.38 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 90K 180K 270K 360K 450K SE +/- 1046.63, N = 3 SE +/- 1892.58, N = 3 SE +/- 419.84, N = 3 415048.13 407879.64 410152.44 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 80K 160K 240K 320K 400K SE +/- 2101.11, N = 3 SE +/- 3451.68, N = 3 SE +/- 2849.61, N = 3 381871.14 376442.55 377130.07 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 2 3 4M 8M 12M 16M 20M SE +/- 146047.06, N = 3 SE +/- 347619.09, N = 3 SE +/- 112722.75, N = 3 20695700 20275343 20478857 1. (CXX) g++ options: -O3 -fopenmp
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 130 260 390 520 650 SE +/- 7.64, N = 3 SE +/- 1.66, N = 3 SE +/- 9.19, N = 15 552.07 578.92 547.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 130 260 390 520 650 SE +/- 9.17, N = 3 SE +/- 2.93, N = 3 SE +/- 2.91, N = 3 578.24 588.19 579.26 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 2K 4K 6K 8K 10K SE +/- 15.44, N = 3 SE +/- 19.24, N = 3 SE +/- 57.65, N = 3 10854.09 10883.31 10939.94 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 120 240 360 480 600 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 548.48 548.21 547.24 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 1 2 3 200 400 600 800 1000 SE +/- 1.25, N = 3 SE +/- 3.12, N = 3 SE +/- 6.65, N = 3 945.56 951.22 953.28 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 50 100 150 200 250 SE +/- 0.18, N = 3 SE +/- 0.78, N = 3 SE +/- 0.25, N = 3 219.66 220.77 221.06 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 40 80 120 160 200 SE +/- 1.03, N = 3 SE +/- 0.35, N = 3 SE +/- 0.72, N = 3 159.63 159.01 158.82 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 8 12 16 20 SE +/- 0.24, N = 3 SE +/- 0.44, N = 9 SE +/- 0.13, N = 3 16.23 16.90 16.47 MIN: 13.32 / MAX: 35.03 MIN: 12.9 / MAX: 36.5 MIN: 12.74 / MAX: 36.16 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 20 40 60 80 100 SE +/- 0.79, N = 3 SE +/- 0.11, N = 9 SE +/- 0.10, N = 3 97.55 96.95 97.18 MIN: 87.88 / MAX: 155.84 MIN: 81.72 / MAX: 187.63 MIN: 83.52 / MAX: 127.58 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 3 6 9 12 15 SE +/- 0.23, N = 3 SE +/- 0.18, N = 9 SE +/- 0.19, N = 3 11.21 10.80 11.48 MIN: 7.61 / MAX: 20.61 MIN: 7.12 / MAX: 30.62 MIN: 7.15 / MAX: 28.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 4 8 12 16 20 SE +/- 0.56, N = 3 SE +/- 0.08, N = 9 SE +/- 0.09, N = 3 16.87 15.15 15.12 MIN: 12.35 / MAX: 32.6 MIN: 10.68 / MAX: 37.52 MIN: 10.68 / MAX: 32.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 20 40 60 80 100 SE +/- 0.28, N = 3 SE +/- 0.22, N = 9 SE +/- 0.15, N = 3 101.72 102.44 101.69 MIN: 95.14 / MAX: 163.22 MIN: 94.11 / MAX: 167.26 MIN: 95.45 / MAX: 204.2 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.22, N = 3 SE +/- 0.22, N = 3 322.19 323.20 322.57 MIN: 318.2 / MAX: 334.81 MIN: 319.57 / MAX: 330.9 MIN: 319.05 / MAX: 330.16 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 70 140 210 280 350 SE +/- 0.24, N = 3 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 314.38 313.24 313.34 MIN: 311.58 / MAX: 317.04 MIN: 310.16 / MAX: 315.44 MIN: 311.05 / MAX: 316.02 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 70 140 210 280 350 SE +/- 0.31, N = 3 SE +/- 0.81, N = 3 SE +/- 0.18, N = 3 341.50 344.10 340.22 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CP2K Molecular Dynamics Fayalite-FIST Data OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Fayalite-FIST Data 1 2 3 300 600 900 1200 1500 1518.82 1510.86 1517.97
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 1 2 3 130 260 390 520 650 SE +/- 0.30, N = 3 SE +/- 0.29, N = 3 SE +/- 0.43, N = 3 612.98 612.76 612.01 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lspecie -lfiniteVolume -lfvOptions -lgenericPatchFields -lmeshTools -lsampling -lOpenFOAM -ldl -lm
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 70 140 210 280 350 SE +/- 0.68, N = 3 SE +/- 0.14, N = 3 SE +/- 0.54, N = 3 318.34 317.35 318.32
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 60 120 180 240 300 SE +/- 0.30, N = 3 SE +/- 0.73, N = 3 SE +/- 0.62, N = 3 268.85 268.42 268.59 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 2 4 6 8 10 SE +/- 0.055, N = 3 SE +/- 0.019, N = 3 SE +/- 0.031, N = 3 7.632 7.629 7.615 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 90 180 270 360 450 SE +/- 0.19, N = 3 SE +/- 0.40, N = 3 SE +/- 0.25, N = 3 436.41 435.69 436.86 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 200 400 600 800 1000 SE +/- 0.46, N = 3 SE +/- 1.63, N = 3 SE +/- 0.73, N = 3 791.19 791.02 789.53 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 22.67 22.65 22.63 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 300 600 900 1200 1500 SE +/- 0.59, N = 3 SE +/- 0.43, N = 3 SE +/- 1.52, N = 3 1387.52 1386.95 1387.33 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 20 40 60 80 100 SE +/- 0.77, N = 3 SE +/- 0.63, N = 3 SE +/- 0.52, N = 3 79.29 79.06 78.95 1. (CC) gcc options: -O2
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 12 24 36 48 60 SE +/- 0.51, N = 15 SE +/- 0.49, N = 15 SE +/- 0.28, N = 3 53.26 52.47 52.29 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Phoronix Test Suite v10.8.4