Intel Core i7-4960X testing with a MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) and AMD FirePro V4800 1GB on Ubuntu 20.10 via the Phoronix Test Suite.
1 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42ePython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
1a Processor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42eGraphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
2 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42eGraphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3 Processor: Intel Core i7-4960X @ 4.00GHz (6 Cores / 12 Threads), Motherboard: MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 8GB, Disk: VisionTek 240GB, Graphics: AMD FirePro V4800 1GB, Audio: Realtek ALC892, Monitor: DELL S2409W, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, OpenGL: 3.3 Mesa 20.2.1 (LLVM 11.0.0), Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
4960x-2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 1a 2 3 Intel Core i7-4960X @ 4.00GHz (6 Cores / 12 Threads) MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS) Intel Xeon E7 v2/Xeon 8GB VisionTek 240GB AMD FirePro V4800 1GB Realtek ALC892 DELL S2409W Realtek RTL8111/8168/8411 Ubuntu 20.10 5.8.0-33-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 3.3 Mesa 20.2.1 (LLVM 11.0.0) GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - 1, 2, 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42e Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Graphics Details - 1a, 2, 3: GLAMOR
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 20 40 60 80 100 106 106 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 2 1 90 180 270 360 450 SE +/- 1.00, N = 3 SE +/- 0.67, N = 3 410 410 408 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 3.51, N = 3 SE +/- 1.53, N = 3 1363 1362 1362 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 16 32 48 64 80 73 73 73 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 2 3 1 300 600 900 1200 1500 SE +/- 2.96, N = 3 SE +/- 2.33, N = 3 1435 1432 1431 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 82 82 82 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 3 2 100 200 300 400 500 SE +/- 0.67, N = 3 474 473 473 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 2 1 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 415 415 415 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 3 2 120 240 360 480 600 SE +/- 1.67, N = 3 SE +/- 1.20, N = 3 545 544 544 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 2 1 3 40 80 120 160 200 SE +/- 1.20, N = 3 159 158 156 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 2 1 3 140 280 420 560 700 SE +/- 0.88, N = 3 SE +/- 6.51, N = 3 634 634 628 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 1 40 80 120 160 200 SE +/- 0.33, N = 3 200 200 200 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 3 2 1 2 4 6 8 10 SE +/- 0.031, N = 3 SE +/- 0.019, N = 3 SE +/- 0.055, N = 3 7.615 7.629 7.632 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 2 1 3 90 180 270 360 450 SE +/- 0.40, N = 3 SE +/- 0.19, N = 3 SE +/- 0.25, N = 3 435.69 436.41 436.86 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 3 2 1 200 400 600 800 1000 SE +/- 0.73, N = 3 SE +/- 1.63, N = 3 SE +/- 0.46, N = 3 789.53 791.02 791.19 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 3 2 1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 22.63 22.65 22.67 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 2 3 1 300 600 900 1200 1500 SE +/- 0.43, N = 3 SE +/- 1.52, N = 3 SE +/- 0.59, N = 3 1386.95 1387.33 1387.52 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 120 240 360 480 600 SE +/- 0.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.34, N = 3 548.48 548.21 547.24 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Gcrypt Library Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 3 1 60 120 180 240 300 SE +/- 0.73, N = 3 SE +/- 0.62, N = 3 SE +/- 0.30, N = 3 268.42 268.59 268.85 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 300 600 900 1200 1500 SE +/- 10.15, N = 3 SE +/- 8.53, N = 3 SE +/- 14.16, N = 3 1564.8 1563.9 1552.8 1. (CXX) g++ options: -O3 -march=native -rdynamic
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 1 2 70 140 210 280 350 SE +/- 0.18, N = 3 SE +/- 0.31, N = 3 SE +/- 0.81, N = 3 340.22 341.50 344.10 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 3 2 4 8 12 16 20 SE +/- 0.24, N = 3 SE +/- 0.13, N = 3 SE +/- 0.44, N = 9 16.23 16.47 16.90 MIN: 13.32 / MAX: 35.03 MIN: 12.74 / MAX: 36.16 MIN: 12.9 / MAX: 36.5 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 2 3 1 20 40 60 80 100 SE +/- 0.11, N = 9 SE +/- 0.10, N = 3 SE +/- 0.79, N = 3 96.95 97.18 97.55 MIN: 81.72 / MAX: 187.63 MIN: 83.52 / MAX: 127.58 MIN: 87.88 / MAX: 155.84 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 2 1 3 3 6 9 12 15 SE +/- 0.18, N = 9 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 10.80 11.21 11.48 MIN: 7.12 / MAX: 30.62 MIN: 7.61 / MAX: 20.61 MIN: 7.15 / MAX: 28.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 3 2 1 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 9 SE +/- 0.56, N = 3 15.12 15.15 16.87 MIN: 10.68 / MAX: 32.84 MIN: 10.68 / MAX: 37.52 MIN: 12.35 / MAX: 32.6 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 3 1 2 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.28, N = 3 SE +/- 0.22, N = 9 101.69 101.72 102.44 MIN: 95.45 / MAX: 204.2 MIN: 95.14 / MAX: 163.22 MIN: 94.11 / MAX: 167.26 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 129 128 128 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 3 2 1 50 100 150 200 250 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 220 220 219 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 21 21 21 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 1 2 2K 4K 6K 8K 10K SE +/- 25.26, N = 3 SE +/- 5.25, N = 3 SE +/- 101.36, N = 12 7911 7911 7776 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 1 2 400 800 1200 1600 2000 SE +/- 0.73, N = 3 SE +/- 2.19, N = 3 SE +/- 2.18, N = 3 1782 1781 1780 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 3 2 70 140 210 280 350 SE +/- 0.46, N = 3 SE +/- 0.22, N = 3 SE +/- 0.22, N = 3 322.19 322.57 323.20 MIN: 318.2 / MAX: 334.81 MIN: 319.05 / MAX: 330.16 MIN: 319.57 / MAX: 330.9 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 2 3 1 70 140 210 280 350 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 SE +/- 0.24, N = 3 313.24 313.34 314.38 MIN: 310.16 / MAX: 315.44 MIN: 311.05 / MAX: 316.02 MIN: 311.58 / MAX: 317.04 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 1 3 130 260 390 520 650 SE +/- 1.66, N = 3 SE +/- 7.64, N = 3 SE +/- 9.19, N = 15 578.92 552.07 547.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 3 1 130 260 390 520 650 SE +/- 2.93, N = 3 SE +/- 2.91, N = 3 SE +/- 9.17, N = 3 588.19 579.26 578.24 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 2K 4K 6K 8K 10K SE +/- 57.65, N = 3 SE +/- 19.24, N = 3 SE +/- 15.44, N = 3 10939.94 10883.31 10854.09 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 3 2 1 110 220 330 440 550 SE +/- 5.01, N = 12 SE +/- 7.25, N = 12 SE +/- 7.74, N = 12 488.91 483.42 471.26 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 3 2 1 200 400 600 800 1000 SE +/- 0.52, N = 12 SE +/- 1.22, N = 12 SE +/- 0.97, N = 12 788.59 786.57 785.17 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 2 3 1 200 400 600 800 1000 SE +/- 8.04, N = 3 SE +/- 7.70, N = 14 SE +/- 12.60, N = 4 1014.54 990.45 979.57 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 3 2 200 400 600 800 1000 SE +/- 5.24, N = 4 SE +/- 7.29, N = 14 SE +/- 20.07, N = 3 1101.04 1088.36 1072.23 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 2 3 1 100 200 300 400 500 SE +/- 8.96, N = 15 SE +/- 6.61, N = 15 SE +/- 8.75, N = 12 483.63 475.96 475.09 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 3 1 200 400 600 800 1000 SE +/- 1.73, N = 15 SE +/- 1.28, N = 15 SE +/- 2.07, N = 12 905.47 904.84 903.89 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 3 2 1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.55, N = 3 85.15 84.60 83.67 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 3 2 1 40 80 120 160 200 SE +/- 0.72, N = 3 SE +/- 0.35, N = 3 SE +/- 1.03, N = 3 158.82 159.01 159.63 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 2 1 3 30M 60M 90M 120M 150M SE +/- 61978.95, N = 3 SE +/- 41688.42, N = 3 SE +/- 528736.77, N = 3 126339000 126186033 125923000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 3 2 4M 8M 12M 16M 20M SE +/- 146047.06, N = 3 SE +/- 112722.75, N = 3 SE +/- 347619.09, N = 3 20695700 20478857 20275343 1. (CXX) g++ options: -O3 -fopenmp
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 3 2 1 130 260 390 520 650 SE +/- 0.43, N = 3 SE +/- 0.29, N = 3 SE +/- 0.30, N = 3 612.01 612.76 612.98 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lspecie -lfiniteVolume -lfvOptions -lgenericPatchFields -lmeshTools -lsampling -lOpenFOAM -ldl -lm
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 3 2 1 12 24 36 48 60 SE +/- 0.28, N = 3 SE +/- 0.49, N = 15 SE +/- 0.51, N = 15 52.29 52.47 53.26 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 3 2 1 60 120 180 240 300 SE +/- 0.39, N = 3 SE +/- 0.63, N = 3 SE +/- 0.39, N = 3 294.20 293.88 293.74 MIN: 220.72 / MAX: 482.19 MIN: 220.85 / MAX: 496.89 MIN: 220.61 / MAX: 495.08 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 3 2 1 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 73.15 73.09 72.72 MIN: 69.24 / MAX: 78.42 MIN: 68.74 / MAX: 78.64 MIN: 68.17 / MAX: 78.13 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 3 2 1 60 120 180 240 300 SE +/- 0.43, N = 3 SE +/- 0.40, N = 3 SE +/- 0.64, N = 3 273.34 273.14 272.48 MIN: 243.22 / MAX: 296.72 MIN: 240.8 / MAX: 297.76 MIN: 235.99 / MAX: 296.83 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 2 3 1 16 32 48 64 80 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 73.01 72.96 72.90 MIN: 47.57 / MAX: 170.84 MIN: 47.59 / MAX: 170.26 MIN: 47.55 / MAX: 170.43 1. (CC) gcc options: -pthread
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 3 2 1 0.1179 0.2358 0.3537 0.4716 0.5895 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.524 0.524 0.523
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 2 3 1 0.1474 0.2948 0.4422 0.5896 0.737 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.655 0.654 0.651
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 2 1 0.4025 0.805 1.2075 1.61 2.0125 SE +/- 0.008, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 1.789 1.782 1.778
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 3 2 1 90K 180K 270K 360K 450K SE +/- 1728.25, N = 3 SE +/- 1118.19, N = 3 SE +/- 227.22, N = 3 400763.15 400109.75 398623.91 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 3 1 2 80K 160K 240K 320K 400K SE +/- 681.76, N = 3 SE +/- 1522.91, N = 3 SE +/- 2058.89, N = 3 355245.38 352737.01 344615.09 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 3 2 90K 180K 270K 360K 450K SE +/- 1046.63, N = 3 SE +/- 419.84, N = 3 SE +/- 1892.58, N = 3 415048.13 410152.44 407879.64 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 80K 160K 240K 320K 400K SE +/- 2101.11, N = 3 SE +/- 2849.61, N = 3 SE +/- 3451.68, N = 3 381871.14 377130.07 376442.55 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ParaView This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 3 2 1a 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 14.17 14.16 14.08
OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1280 x 1024 3 2 1a 50 100 150 200 250 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 SE +/- 0.64, N = 3 226.76 226.57 225.32
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 13.78 13.75 13.71
OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1a 2 3 50 100 150 200 250 SE +/- 0.03, N = 3 SE +/- 0.37, N = 3 SE +/- 0.05, N = 3 220.41 220.08 219.41
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 3 2 1a 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.69 14.69 14.68
OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1280 x 1024 2 3 1a 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 153.05 153.05 152.99
OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 3 2 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14.58 14.57 14.57
OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1a 3 2 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 151.87 151.85 151.85
1 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42ePython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 4 February 2021 08:43 by user phoronix.
1a Processor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42eGraphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 4 February 2021 17:44 by user phoronix.
2 Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42eGraphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 4 February 2021 18:09 by user phoronix.
3 Processor: Intel Core i7-4960X @ 4.00GHz (6 Cores / 12 Threads), Motherboard: MSI X79MA-GD45 (MS-7738) v1.0 (V3.8 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 8GB, Disk: VisionTek 240GB, Graphics: AMD FirePro V4800 1GB, Audio: Realtek ALC892, Monitor: DELL S2409W, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: modesetting 1.20.9, OpenGL: 3.3 Mesa 20.2.1 (LLVM 11.0.0), Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x42eGraphics Notes: GLAMORPython Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 5 February 2021 05:20 by user phoronix.