Ryzen 7 1800X 2021 AMD Ryzen 7 1800X Eight-Core testing with a MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.F0 BIOS) and AMD Radeon RX 460/560D / Pro 450/455/460/555/555X/560/560X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102082-HA-RYZEN718012 .
Ryzen 7 1800X 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 4 AMD Ryzen 7 1800X Eight-Core @ 3.60GHz (8 Cores / 16 Threads) MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.F0 BIOS) AMD 17h 8GB Samsung SSD 950 PRO 256GB AMD Radeon RX 460/560D / Pro 450/455/460/555/555X/560/560X 2GB (1212/1750MHz) AMD Baffin HDMI/DP LG Ultra HD Intel I211 Ubuntu 20.10 5.8.0-33-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.8 4.6 Mesa 20.2.1 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8001137 Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Ryzen 7 1800X 2021 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 3840 x 2160 paraview: Wavelet Volume - 3840 x 2160 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 3840 x 2160 paraview: Wavelet Contour - 3840 x 2160 quantlib: etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: LU.C npb: MG.C cloverleaf: Lagrangian-Eulerian Hydrodynamics lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression amg: pennant: sedovbig pennant: leblancbig qmcpack: simple-H2O openfoam: Motorbike 30M openfoam: Motorbike 60M lammps: Rhodopsin Protein dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 build-godot: Time To Compile gcrypt: ngspice: C2670 ngspice: C7552 webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU gnupg: 2.7GB Sample File Encryption kripke: v-ray: CPU 1 2 3 4 107.40 1718.366 37.98 607.714 99.41 1035.933 50.56 526.843 2178.7 1516.527 234.167 145.625 217.087 6432.80 473.31 473.04 12539.68 23908.10 11967.26 180.75 36 106 489 1542 88 1708 86 469 455 561 185 649 196 258316567 94.04673 69.13010 29.921 268.05 1200.87 5.014 402.56 121.24 364.60 90.77 0.347 1.022 1.350 3.042 194.714 227.421 178.751 173.163 4.876 265.752 480.645 14.286 808.666 605.855 970.336 1590.19 2026.46 1729.13 882.684 1786.95 178.893 2221045.25 1707590.29 1202540.45 1990382.29 1475057.50 9.983 48.009 5.334 4.908 54.302 285.614 266.932 199 301 35 9350 2483 75.403 25764743 7959 106.25 1700.023 37.95 607.272 99.48 1036.652 50.52 526.495 2188.2 1511.247 233.479 145.742 216.273 6381.07 470.87 477.00 12532.15 23864.31 11991.06 180.67 36 107 492 1544 88 1713 91 474 452 556 186 647 189 259302600 93.24272 69.82235 30.048 267.82 1200.86 5.186 402.27 121.13 365.28 90.72 0.347 1.021 1.349 3.035 194.188 227.129 176.666 172.267 4.894 265.156 475.515 14.296 808.727 600.120 969.968 1590.38 2050.55 1734.72 888.615 1786.95 178.891 1269611.04 1698053.33 1213171.58 1888790.08 1494117.2 9.941 47.143 5.249 4.888 54.916 282.246 264.702 199 302 35 9440 2480 74.985 25870857 7896 106.77 1708.301 37.97 607.498 99.58 1037.823 50.59 527.225 2185.5 1509.919 234.046 146.771 218.152 6365.99 474.06 472.27 12524.98 23818.47 11979.01 183.50 35 106 490 1543 87 1709 89 470 450 560 186 646 185 258939133 94.06175 69.44954 30.666 268.19 1200.73 5.193 402.74 121.22 365.16 91.09 0.349 1.020 1.347 3.039 194.318 226.890 174.199 179.433 4.904 264.343 478.893 14.306 810.389 603.023 970.336 1592.57 2024.35 1727.66 884.656 1786.95 178.785 1280999.75 1719430.79 1221884.04 1901106.00 1471631.25 9.907 47.168 5.200 4.879 54.774 284.310 266.059 199 302 35 9391 2488 74.765 25843527 7982 107.68 1722.838 37.96 607.384 99.35 1035.323 50.51 526.374 2179.1 1522.765 234.235 145.553 218.704 6370.26 474.74 473.01 12497.06 23755.41 11985.17 180.13 35 107 490 1542 87 1713 92 472 454 557 187 654 187 259056033 OpenBenchmarking.org
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.93, N = 3 SE +/- 0.42, N = 3 SE +/- 0.15, N = 3 107.40 106.25 106.77 107.68
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 1 2 3 4 400 800 1200 1600 2000 SE +/- 4.30, N = 3 SE +/- 14.82, N = 3 SE +/- 6.65, N = 3 SE +/- 2.47, N = 3 1718.37 1700.02 1708.30 1722.84
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 1 2 3 4 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 37.98 37.95 37.97 37.96
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 1 2 3 4 130 260 390 520 650 SE +/- 0.26, N = 3 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 607.71 607.27 607.50 607.38
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 20 40 60 80 100 SE +/- 0.19, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 99.41 99.48 99.58 99.35
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 1 2 3 4 200 400 600 800 1000 SE +/- 1.98, N = 3 SE +/- 0.70, N = 3 SE +/- 0.56, N = 3 SE +/- 0.66, N = 3 1035.93 1036.65 1037.82 1035.32
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 1 2 3 4 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 50.56 50.52 50.59 50.51
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 1 2 3 4 110 220 330 440 550 SE +/- 0.34, N = 3 SE +/- 0.22, N = 3 SE +/- 0.67, N = 3 SE +/- 0.50, N = 3 526.84 526.50 527.23 526.37
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1 2 3 4 500 1000 1500 2000 2500 SE +/- 11.48, N = 3 SE +/- 9.88, N = 3 SE +/- 12.56, N = 3 SE +/- 8.41, N = 3 2178.7 2188.2 2185.5 2179.1 1. (CXX) g++ options: -O3 -march=native -rdynamic
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 300 600 900 1200 1500 SE +/- 1.47, N = 3 SE +/- 2.26, N = 3 SE +/- 5.58, N = 3 SE +/- 3.80, N = 3 1516.53 1511.25 1509.92 1522.77 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 50 100 150 200 250 SE +/- 0.84, N = 3 SE +/- 0.89, N = 3 SE +/- 1.11, N = 3 SE +/- 0.52, N = 3 234.17 233.48 234.05 234.24 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 30 60 90 120 150 SE +/- 0.80, N = 3 SE +/- 0.67, N = 3 SE +/- 0.62, N = 3 SE +/- 0.39, N = 3 145.63 145.74 146.77 145.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 50 100 150 200 250 SE +/- 0.43, N = 3 SE +/- 0.90, N = 3 SE +/- 0.32, N = 3 SE +/- 0.38, N = 3 217.09 216.27 218.15 218.70 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1 2 3 4 1400 2800 4200 5600 7000 SE +/- 19.15, N = 3 SE +/- 30.02, N = 3 SE +/- 16.52, N = 3 SE +/- 39.30, N = 3 6432.80 6381.07 6365.99 6370.26 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 4 100 200 300 400 500 SE +/- 0.60, N = 3 SE +/- 2.50, N = 3 SE +/- 0.52, N = 3 SE +/- 0.66, N = 3 473.31 470.87 474.06 474.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1 2 3 4 100 200 300 400 500 SE +/- 0.98, N = 3 SE +/- 1.38, N = 3 SE +/- 1.56, N = 3 SE +/- 0.67, N = 3 473.04 477.00 472.27 473.01 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1 2 3 4 3K 6K 9K 12K 15K SE +/- 11.42, N = 3 SE +/- 34.75, N = 3 SE +/- 17.36, N = 3 SE +/- 37.65, N = 3 12539.68 12532.15 12524.98 12497.06 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1 2 3 4 5K 10K 15K 20K 25K SE +/- 32.91, N = 3 SE +/- 84.98, N = 3 SE +/- 62.81, N = 3 SE +/- 101.47, N = 3 23908.10 23864.31 23818.47 23755.41 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1 2 3 4 3K 6K 9K 12K 15K SE +/- 26.31, N = 3 SE +/- 16.53, N = 3 SE +/- 16.36, N = 3 SE +/- 17.62, N = 3 11967.26 11991.06 11979.01 11985.17 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 40 80 120 160 200 SE +/- 0.05, N = 3 SE +/- 0.21, N = 3 SE +/- 2.02, N = 12 SE +/- 0.18, N = 3 180.75 180.67 183.50 180.13 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1 2 3 4 8 16 24 32 40 SE +/- 0.37, N = 15 36 36 35 35 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1 2 3 4 20 40 60 80 100 SE +/- 0.15, N = 15 106 107 106 107 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 2 3 4 110 220 330 440 550 SE +/- 1.45, N = 3 SE +/- 2.08, N = 3 489 492 490 490 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1 2 3 4 300 600 900 1200 1500 SE +/- 1.20, N = 3 SE +/- 1.53, N = 3 SE +/- 3.51, N = 3 1542 1544 1543 1542 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1 2 3 4 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 1.15, N = 3 SE +/- 0.88, N = 3 88 88 87 87 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 4 400 800 1200 1600 2000 SE +/- 5.03, N = 3 SE +/- 1.00, N = 3 SE +/- 2.96, N = 3 SE +/- 3.76, N = 3 1708 1713 1709 1713 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1 2 3 4 20 40 60 80 100 SE +/- 1.19, N = 4 SE +/- 1.33, N = 3 SE +/- 1.22, N = 4 86 91 89 92 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1 2 3 4 100 200 300 400 500 SE +/- 1.11, N = 4 SE +/- 1.84, N = 4 SE +/- 1.45, N = 3 469 474 470 472 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1 2 3 4 100 200 300 400 500 SE +/- 2.40, N = 3 455 452 450 454 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1 2 3 4 120 240 360 480 600 SE +/- 1.76, N = 3 SE +/- 5.61, N = 3 SE +/- 2.40, N = 3 SE +/- 3.06, N = 3 561 556 560 557 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 2 3 4 40 80 120 160 200 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 185 186 186 187 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1 2 3 4 140 280 420 560 700 SE +/- 1.86, N = 3 SE +/- 5.24, N = 3 SE +/- 3.00, N = 3 649 647 646 654 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 2 3 4 40 80 120 160 200 SE +/- 2.29, N = 15 SE +/- 2.17, N = 15 SE +/- 2.45, N = 4 196 189 185 187 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 2 3 4 60M 120M 180M 240M 300M SE +/- 739452.92, N = 3 SE +/- 141594.40, N = 3 SE +/- 132091.40, N = 3 SE +/- 122140.74, N = 3 258316567 259302600 258939133 259056033 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 1 2 3 20 40 60 80 100 SE +/- 0.28, N = 3 SE +/- 0.19, N = 3 SE +/- 0.18, N = 3 94.05 93.24 94.06 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 1 2 3 16 32 48 64 80 SE +/- 0.15, N = 3 SE +/- 0.42, N = 3 SE +/- 0.09, N = 3 69.13 69.82 69.45 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 7 14 21 28 35 SE +/- 0.16, N = 3 SE +/- 0.49, N = 3 SE +/- 0.38, N = 3 29.92 30.05 30.67 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 1 2 3 60 120 180 240 300 SE +/- 0.20, N = 3 SE +/- 0.35, N = 3 SE +/- 0.10, N = 3 268.05 267.82 268.19 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M 1 2 3 300 600 900 1200 1500 SE +/- 0.91, N = 3 SE +/- 0.65, N = 3 SE +/- 0.38, N = 3 1200.87 1200.86 1200.73 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 1.1684 2.3368 3.5052 4.6736 5.842 SE +/- 0.071, N = 3 SE +/- 0.031, N = 3 SE +/- 0.037, N = 3 5.014 5.186 5.193 1. (CXX) g++ options: -O3 -pthread -lm
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 90 180 270 360 450 SE +/- 0.45, N = 3 SE +/- 0.21, N = 3 SE +/- 0.72, N = 3 402.56 402.27 402.74 MIN: 304.58 / MAX: 640.97 MIN: 305.21 / MAX: 631.59 MIN: 305.44 / MAX: 655.92 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 1 2 3 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 121.24 121.13 121.22 MIN: 107.47 / MAX: 127.94 MIN: 107.36 / MAX: 127.87 MIN: 106.39 / MAX: 127.87 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 80 160 240 320 400 SE +/- 0.79, N = 3 SE +/- 0.66, N = 3 SE +/- 0.96, N = 3 364.60 365.28 365.16 MIN: 303.28 / MAX: 394.61 MIN: 305.19 / MAX: 394.29 MIN: 305 / MAX: 396.19 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 90.77 90.72 91.09 MIN: 59.08 / MAX: 211.05 MIN: 59.07 / MAX: 208.49 MIN: 59.24 / MAX: 217.19 1. (CC) gcc options: -pthread
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 0.0785 0.157 0.2355 0.314 0.3925 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.347 0.347 0.349
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 0.23 0.46 0.69 0.92 1.15 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 3 1.022 1.021 1.020
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 0.3038 0.6076 0.9114 1.2152 1.519 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.350 1.349 1.347
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 0.6845 1.369 2.0535 2.738 3.4225 SE +/- 0.001, N = 3 SE +/- 0.006, N = 3 SE +/- 0.008, N = 3 3.042 3.035 3.039
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 3 40 80 120 160 200 SE +/- 0.31, N = 3 SE +/- 0.38, N = 3 SE +/- 0.23, N = 3 194.71 194.19 194.32
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1 2 3 50 100 150 200 250 SE +/- 0.56, N = 3 SE +/- 0.39, N = 3 SE +/- 0.43, N = 3 227.42 227.13 226.89 1. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 1 2 3 40 80 120 160 200 SE +/- 2.40, N = 12 SE +/- 3.17, N = 12 SE +/- 2.28, N = 12 178.75 176.67 174.20 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 1 2 3 40 80 120 160 200 SE +/- 2.20, N = 3 SE +/- 1.85, N = 3 SE +/- 1.80, N = 12 173.16 172.27 179.43 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
WebP2 Image Encode Encode Settings: Default OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 1 2 3 1.1034 2.2068 3.3102 4.4136 5.517 SE +/- 0.047, N = 3 SE +/- 0.015, N = 3 SE +/- 0.015, N = 3 4.876 4.894 4.904 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 1 2 3 60 120 180 240 300 SE +/- 0.31, N = 3 SE +/- 0.35, N = 3 SE +/- 0.39, N = 3 265.75 265.16 264.34 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 1 2 3 100 200 300 400 500 SE +/- 0.94, N = 3 SE +/- 1.37, N = 3 SE +/- 2.78, N = 3 480.65 475.52 478.89 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 1 2 3 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 14.29 14.30 14.31 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 1 2 3 200 400 600 800 1000 SE +/- 0.28, N = 3 SE +/- 0.75, N = 3 SE +/- 1.55, N = 3 808.67 808.73 810.39 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 130 260 390 520 650 SE +/- 0.30, N = 3 SE +/- 0.86, N = 3 SE +/- 2.11, N = 3 605.86 600.12 603.02 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 1 2 3 200 400 600 800 1000 SE +/- 0.37, N = 3 SE +/- 0.56, N = 3 SE +/- 0.70, N = 3 970.34 969.97 970.34 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 1 2 3 300 600 900 1200 1500 SE +/- 2.06, N = 3 SE +/- 0.71, N = 3 SE +/- 1.79, N = 3 1590.19 1590.38 1592.57 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 1 2 3 400 800 1200 1600 2000 SE +/- 15.90, N = 15 SE +/- 24.23, N = 3 SE +/- 15.71, N = 15 2026.46 2050.55 2024.35 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 1 2 3 400 800 1200 1600 2000 SE +/- 12.59, N = 15 SE +/- 26.92, N = 3 SE +/- 24.67, N = 15 1729.13 1734.72 1727.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 1 2 3 200 400 600 800 1000 SE +/- 5.46, N = 3 SE +/- 6.90, N = 3 SE +/- 6.15, N = 3 882.68 888.62 884.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 1 2 3 400 800 1200 1600 2000 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1786.95 1786.95 1786.95 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 1 2 3 40 80 120 160 200 SE +/- 0.49, N = 3 SE +/- 0.18, N = 3 SE +/- 0.28, N = 3 178.89 178.89 178.79 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 500K 1000K 1500K 2000K 2500K SE +/- 36001.96, N = 3 SE +/- 17870.62, N = 3 SE +/- 4745.07, N = 3 2221045.25 1269611.04 1280999.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 22483.33, N = 3 SE +/- 21383.59, N = 3 SE +/- 20980.51, N = 3 1707590.29 1698053.33 1719430.79 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 300K 600K 900K 1200K 1500K SE +/- 15762.44, N = 5 SE +/- 15499.45, N = 3 SE +/- 12818.41, N = 3 1202540.45 1213171.58 1221884.04 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 30280.52, N = 3 SE +/- 8508.36, N = 3 SE +/- 6838.26, N = 3 1990382.29 1888790.08 1901106.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 13166.17, N = 3 SE +/- 2632.96, N = 3 SE +/- 10338.33, N = 3 1475057.50 1494117.20 1471631.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 3 3 6 9 12 15 SE +/- 0.109, N = 3 SE +/- 0.052, N = 3 SE +/- 0.077, N = 3 9.983 9.941 9.907 MIN: 9.38 / MAX: 22.68 MIN: 9.38 / MAX: 19.97 MIN: 9.29 / MAX: 20.07 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 3 11 22 33 44 55 SE +/- 0.49, N = 3 SE +/- 0.15, N = 3 SE +/- 0.20, N = 3 48.01 47.14 47.17 MIN: 43.15 / MAX: 66.92 MIN: 45.43 / MAX: 68.46 MIN: 42.9 / MAX: 63.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 3 1.2002 2.4004 3.6006 4.8008 6.001 SE +/- 0.064, N = 3 SE +/- 0.011, N = 3 SE +/- 0.040, N = 3 5.334 5.249 5.200 MIN: 4.98 / MAX: 16.79 MIN: 4.98 / MAX: 7.08 MIN: 4.91 / MAX: 15.47 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 3 1.1043 2.2086 3.3129 4.4172 5.5215 SE +/- 0.031, N = 3 SE +/- 0.013, N = 3 SE +/- 0.041, N = 3 4.908 4.888 4.879 MIN: 4.61 / MAX: 16.02 MIN: 4.63 / MAX: 15.1 MIN: 4.59 / MAX: 15.34 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 3 12 24 36 48 60 SE +/- 0.09, N = 3 SE +/- 0.41, N = 3 SE +/- 0.45, N = 3 54.30 54.92 54.77 MIN: 51.89 / MAX: 74.3 MIN: 52.67 / MAX: 80.56 MIN: 52.18 / MAX: 80.59 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 3 60 120 180 240 300 SE +/- 1.64, N = 3 SE +/- 1.31, N = 3 SE +/- 0.51, N = 3 285.61 282.25 284.31 MIN: 270.34 / MAX: 305.03 MIN: 267.66 / MAX: 302.96 MIN: 268.98 / MAX: 308.23 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 3 60 120 180 240 300 SE +/- 0.70, N = 3 SE +/- 0.63, N = 3 SE +/- 0.72, N = 3 266.93 264.70 266.06 MIN: 262.73 / MAX: 270.41 MIN: 261.5 / MAX: 268.89 MIN: 261.73 / MAX: 269.78 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 3 40 80 120 160 200 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 199 199 199 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 70 140 210 280 350 SE +/- 1.45, N = 3 SE +/- 1.32, N = 3 SE +/- 1.69, N = 3 301 302 302 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 8 16 24 32 40 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 35 35 35 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 28.43, N = 3 SE +/- 15.33, N = 3 SE +/- 22.53, N = 3 9350 9440 9391 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 500 1000 1500 2000 2500 SE +/- 2.75, N = 3 SE +/- 2.00, N = 3 SE +/- 1.20, N = 3 2483 2480 2488 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 1 2 3 20 40 60 80 100 SE +/- 1.18, N = 3 SE +/- 0.32, N = 3 SE +/- 0.40, N = 3 75.40 74.99 74.77 1. (CC) gcc options: -O2
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 2 3 6M 12M 18M 24M 30M SE +/- 71491.62, N = 3 SE +/- 98842.93, N = 3 SE +/- 180481.01, N = 3 25764743 25870857 25843527 1. (CXX) g++ options: -O3 -fopenmp
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 67.23, N = 3 SE +/- 38.42, N = 3 SE +/- 73.27, N = 3 7959 7896 7982
Phoronix Test Suite v10.8.4