Intel Core i3-7100 testing with a Gigabyte B250M-DS3H-CF (F9 BIOS) and Gigabyte Intel HD 630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
2 3 Processor: Intel Core i3-7100 @ 3.90GHz (2 Cores / 4 Threads), Motherboard: Gigabyte B250M-DS3H-CF (F9 BIOS), Chipset: Intel Xeon E3-1200 v6/7th + B250, Memory: 8GB, Disk: 250GB Western Digital WDS250G1B0A-, Graphics: Gigabyte Intel HD 630 3GB (1100MHz), Audio: Realtek ALC887-VD, Monitor: VA2431, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: gigabyte, OpenGL: 4.6 Mesa 20.2.1, Vulkan: 1.2.145, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Core i3 7100 EO January Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i3-7100 @ 3.90GHz (2 Cores / 4 Threads) Gigabyte B250M-DS3H-CF (F9 BIOS) Intel Xeon E3-1200 v6/7th + B250 8GB 250GB Western Digital WDS250G1B0A- Gigabyte Intel HD 630 3GB (1100MHz) Realtek ALC887-VD VA2431 Realtek RTL8111/8168/8411 Ubuntu 20.10 5.8.0-33-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 gigabyte 4.6 Mesa 20.2.1 1.2.145 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3 Python Details - Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
1 2 3 Result Overview Phoronix Test Suite 100% 102% 104% 106% Mobile Neural Network OpenFOAM WebP2 Image Encode QMCPACK CloverLeaf Redis Algebraic Multi-Grid Benchmark ONNX Runtime rav1e Gcrypt Library Timed Godot Game Engine Compilation Kripke GnuPG lzbench Etcpak QuantLib ASKAP TNN Google SynthMark LAMMPS Molecular Dynamics Simulator dav1d LULESH CP2K Molecular Dynamics
Core i3 7100 EO January lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 gcrypt: quantlib: cloverleaf: Lagrangian-Eulerian Hydrodynamics mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 lammps: Rhodopsin Protein askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP amg: kripke: lulesh: openfoam: Motorbike 30M qmcpack: simple-H2O cp2k: Fayalite-FIST Data dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 build-godot: Time To Compile redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET gnupg: 2.7GB Sample File Encryption 1 2 3 37 105 451 1595 61 1680 77 445 420 557 164 640 214 1195.626 297.424 164.375 279.313 17.463 1100.373 2039.060 65.028 4160.509 609.070 230.077 2150.8 242.63 11.976 71.809 6.650 9.900 77.011 137 198 22 11621 1454 355.868 346.345 1.499 813.829 880.670 562.285 669.514 664.589 1020.14 84.4596 180021600 18212867 1172.1813 507.18 29.498 1182.577 187.04 47.53 168.12 40.55 0.249 0.766 1.015 2.413 634.394 2186502.30 1730998.80 1337301.15 2146914.81 1512907.54 73.623 37 105 453 1602 61 1686 77 443 426 557 165 642 214 1208.166 298.058 164.646 279.967 17.343 1064.783 1955.616 63.658 4148.686 610.056 231.610 2153.1 237.84 14.285 72.550 8.051 9.744 77.422 140 198 21 11674 1432 356.742 347.077 1.503 815.277 882.871 563.952 669.423 660.151 1016.24 86.8836 182353300 18325130 1171.3985 494.10 28.893 1180.896 186.69 47.57 168.09 40.51 0.254 0.770 1.021 2.408 635.128 1818307.16 1959116.33 1368893.38 2006701.24 1550391.52 73.723 37 105 453 1605 61 1694 78 445 425 559 165 642 214 1205.829 297.511 164.905 278.366 17.401 1093.172 2033.118 63.592 4158.912 608.000 230.844 2158.5 240.38 12.140 66.186 8.700 8.655 77.488 139 198 22 11571 1465 357.188 347.130 1.502 809.499 880.749 555.949 664.869 667.867 1014.95 86.9571 181513400 18267857 1171.2538 500.63 29.174 1182.485 186.84 47.51 167.55 40.52 0.253 0.767 1.019 2.407 635.350 2080148.81 1812276.32 1362865.22 2098750.81 1510099.25 74.001 OpenBenchmarking.org
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 20 40 60 80 100 105 105 105 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 100 200 300 400 500 SE +/- 0.67, N = 3 451 453 453 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 300 600 900 1200 1500 SE +/- 9.87, N = 3 SE +/- 1.67, N = 3 SE +/- 2.03, N = 3 1595 1602 1605 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 14 28 42 56 70 61 61 61 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 0.88, N = 3 SE +/- 10.04, N = 3 SE +/- 0.67, N = 3 1680 1686 1694 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 77 77 78 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 100 200 300 400 500 SE +/- 1.76, N = 3 SE +/- 0.88, N = 3 445 443 445 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 90 180 270 360 450 SE +/- 1.53, N = 3 420 426 425 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 120 240 360 480 600 SE +/- 2.52, N = 3 SE +/- 1.67, N = 3 SE +/- 1.86, N = 3 557 557 559 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 40 80 120 160 200 SE +/- 0.58, N = 3 164 165 165 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 140 280 420 560 700 SE +/- 1.15, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 640 642 642 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 50 100 150 200 250 SE +/- 0.33, N = 3 214 214 214 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 300 600 900 1200 1500 SE +/- 10.02, N = 3 SE +/- 2.64, N = 3 SE +/- 0.41, N = 3 1195.63 1208.17 1205.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 60 120 180 240 300 SE +/- 0.15, N = 3 SE +/- 0.79, N = 3 SE +/- 0.82, N = 3 297.42 298.06 297.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 40 80 120 160 200 SE +/- 0.31, N = 3 SE +/- 0.27, N = 3 SE +/- 0.01, N = 3 164.38 164.65 164.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 60 120 180 240 300 SE +/- 0.60, N = 3 SE +/- 0.09, N = 3 SE +/- 1.05, N = 3 279.31 279.97 278.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 17.46 17.34 17.40 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 200 400 600 800 1000 SE +/- 5.31, N = 3 SE +/- 1.36, N = 3 SE +/- 8.76, N = 3 1100.37 1064.78 1093.17 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 400 800 1200 1600 2000 SE +/- 2.45, N = 3 SE +/- 1.52, N = 3 SE +/- 1.58, N = 3 2039.06 1955.62 2033.12 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 65.03 63.66 63.59 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 900 1800 2700 3600 4500 SE +/- 11.19, N = 3 SE +/- 2.05, N = 3 SE +/- 11.92, N = 3 4160.51 4148.69 4158.91 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 130 260 390 520 650 SE +/- 1.05, N = 3 SE +/- 1.44, N = 3 SE +/- 1.37, N = 3 609.07 610.06 608.00 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Gcrypt Library Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.12, N = 3 SE +/- 0.16, N = 3 SE +/- 0.50, N = 3 230.08 231.61 230.84 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 500 1000 1500 2000 2500 SE +/- 27.76, N = 4 SE +/- 20.97, N = 12 SE +/- 21.95, N = 8 2150.8 2153.1 2158.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 50 100 150 200 250 SE +/- 0.80, N = 3 SE +/- 0.86, N = 3 SE +/- 0.20, N = 3 242.63 237.84 240.38 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.96, N = 12 SE +/- 0.11, N = 3 11.98 14.29 12.14 MIN: 10.47 / MAX: 30.48 MIN: 10.49 / MAX: 29.25 MIN: 10.58 / MAX: 45.53 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 16 32 48 64 80 SE +/- 4.92, N = 3 SE +/- 4.14, N = 12 SE +/- 4.13, N = 3 71.81 72.55 66.19 MIN: 54.72 / MAX: 91.49 MIN: 53.89 / MAX: 122.59 MIN: 55.21 / MAX: 109.41 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 2 4 6 8 10 SE +/- 0.200, N = 3 SE +/- 0.584, N = 12 SE +/- 1.186, N = 3 6.650 8.051 8.700 MIN: 5.66 / MAX: 25.56 MIN: 5.61 / MAX: 25.34 MIN: 5.63 / MAX: 22.77 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 3 6 9 12 15 SE +/- 1.541, N = 3 SE +/- 0.705, N = 12 SE +/- 0.188, N = 3 9.900 9.744 8.655 MIN: 7.35 / MAX: 27.04 MIN: 7.33 / MAX: 27.36 MIN: 7.36 / MAX: 24 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 20 40 60 80 100 SE +/- 0.36, N = 3 SE +/- 0.21, N = 12 SE +/- 0.63, N = 3 77.01 77.42 77.49 MIN: 72.2 / MAX: 114.06 MIN: 73.59 / MAX: 98.43 MIN: 72.83 / MAX: 164.73 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 30 60 90 120 150 SE +/- 2.03, N = 12 SE +/- 0.50, N = 3 SE +/- 0.67, N = 3 137 140 139 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 40 80 120 160 200 SE +/- 1.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.44, N = 3 198 198 198 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 22 21 22 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 3K 6K 9K 12K 15K SE +/- 38.03, N = 3 SE +/- 63.88, N = 3 SE +/- 60.19, N = 3 11621 11674 11571 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 300 600 900 1200 1500 SE +/- 10.11, N = 3 SE +/- 4.37, N = 3 SE +/- 8.08, N = 3 1454 1432 1465 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 80 160 240 320 400 SE +/- 0.20, N = 3 SE +/- 0.56, N = 3 SE +/- 0.14, N = 3 355.87 356.74 357.19 MIN: 353.61 / MAX: 362.38 MIN: 353.95 / MAX: 365.88 MIN: 353.91 / MAX: 381.51 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 80 160 240 320 400 SE +/- 0.37, N = 3 SE +/- 0.65, N = 3 SE +/- 0.65, N = 3 346.35 347.08 347.13 MIN: 344.52 / MAX: 349.53 MIN: 344.56 / MAX: 350.9 MIN: 344.71 / MAX: 353.26 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 200 400 600 800 1000 SE +/- 1.45, N = 3 SE +/- 0.21, N = 3 SE +/- 1.43, N = 3 813.83 815.28 809.50 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 200 400 600 800 1000 SE +/- 0.24, N = 3 SE +/- 2.13, N = 3 SE +/- 5.90, N = 3 880.67 882.87 880.75 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 120 240 360 480 600 SE +/- 1.61, N = 3 SE +/- 4.26, N = 3 SE +/- 2.72, N = 3 562.29 563.95 555.95 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 140 280 420 560 700 SE +/- 6.76, N = 3 SE +/- 3.94, N = 3 SE +/- 2.25, N = 3 669.51 669.42 664.87 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 140 280 420 560 700 SE +/- 4.32, N = 3 SE +/- 1.97, N = 3 SE +/- 0.56, N = 3 664.59 660.15 667.87 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 200 400 600 800 1000 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 1.29, N = 3 1020.14 1016.24 1014.95 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 20 40 60 80 100 SE +/- 1.01, N = 3 SE +/- 0.34, N = 3 SE +/- 0.16, N = 3 84.46 86.88 86.96 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 2 3 40M 80M 120M 160M 200M SE +/- 190232.24, N = 3 SE +/- 640522.07, N = 3 SE +/- 675452.60, N = 3 180021600 182353300 181513400 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 2 3 4M 8M 12M 16M 20M SE +/- 128997.10, N = 3 SE +/- 15070.00, N = 3 SE +/- 23942.83, N = 3 18212867 18325130 18267857 1. (CXX) g++ options: -O3 -fopenmp
OpenFOAM OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 1 2 3 110 220 330 440 550 SE +/- 4.15, N = 3 SE +/- 0.21, N = 3 SE +/- 3.29, N = 3 507.18 494.10 500.63 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.39, N = 3 SE +/- 0.43, N = 3 SE +/- 0.24, N = 3 29.50 28.89 29.17 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 187.04 186.69 186.84 MIN: 135.46 / MAX: 314.37 MIN: 135.58 / MAX: 309.56 MIN: 135.49 / MAX: 309.18 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 1 2 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 47.53 47.57 47.51 MIN: 43.97 / MAX: 56.03 MIN: 43.95 / MAX: 56.22 MIN: 43.96 / MAX: 55.8 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 40 80 120 160 200 SE +/- 0.19, N = 3 SE +/- 0.24, N = 3 SE +/- 0.22, N = 3 168.12 168.09 167.55 MIN: 156.56 / MAX: 184.1 MIN: 156.61 / MAX: 184.61 MIN: 155.69 / MAX: 183.28 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 40.55 40.51 40.52 MIN: 28.2 / MAX: 88.05 MIN: 28.18 / MAX: 87.92 MIN: 28.17 / MAX: 88.07 1. (CC) gcc options: -pthread
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 0.1733 0.3466 0.5199 0.6932 0.8665 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.002, N = 3 0.766 0.770 0.767
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 0.2297 0.4594 0.6891 0.9188 1.1485 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 1.015 1.021 1.019
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 0.5429 1.0858 1.6287 2.1716 2.7145 SE +/- 0.006, N = 3 SE +/- 0.009, N = 3 SE +/- 0.010, N = 3 2.413 2.408 2.407
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 27204.80, N = 5 SE +/- 124811.32, N = 12 SE +/- 123334.01, N = 12 2186502.30 1818307.16 2080148.81 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 31108.95, N = 15 SE +/- 26878.67, N = 3 SE +/- 30678.66, N = 15 1730998.80 1959116.33 1812276.32 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 15303.51, N = 6 SE +/- 9380.25, N = 3 SE +/- 26624.46, N = 12 1337301.15 1368893.38 1362865.22 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 51755.37, N = 15 SE +/- 49295.20, N = 15 SE +/- 69605.36, N = 12 2146914.81 2006701.24 2098750.81 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 23347.52, N = 3 SE +/- 19135.63, N = 15 SE +/- 23030.14, N = 3 1512907.54 1550391.52 1510099.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
Testing initiated at 31 January 2021 20:08 by user phoronix.
2 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
Testing initiated at 1 February 2021 08:38 by user phoronix.
3 Processor: Intel Core i3-7100 @ 3.90GHz (2 Cores / 4 Threads), Motherboard: Gigabyte B250M-DS3H-CF (F9 BIOS), Chipset: Intel Xeon E3-1200 v6/7th + B250, Memory: 8GB, Disk: 250GB Western Digital WDS250G1B0A-, Graphics: Gigabyte Intel HD 630 3GB (1100MHz), Audio: Realtek ALC887-VD, Monitor: VA2431, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 20.10, Kernel: 5.8.0-33-generic (x86_64), Desktop: GNOME Shell 3.38.1, Display Server: X Server 1.20.9, Display Driver: gigabyte, OpenGL: 4.6 Mesa 20.2.1, Vulkan: 1.2.145, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Notes: Python 3.8.6Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
Testing initiated at 1 February 2021 20:58 by user phoronix.