2990wx-2021-amd AMD Ryzen Threadripper 2990WX 32-Core testing with a ASUS ROG ZENITH EXTREME (1701 BIOS) and Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101259-HA-2990WX20208&grw&sro .
2990wx-2021-amd Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 1a 2 3 4 5 AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads) ASUS ROG ZENITH EXTREME (1701 BIOS) AMD 17h 32GB Samsung SSD 970 EVO 500GB + 250GB Western Digital WDS250G2X0C-00L350 Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1244/1750MHz) Realtek ALC1220 LG Ultra HD Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad Ubuntu 20.10 5.8.0-34-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 modesetting 1.20.9 4.6 Mesa 20.2.1 (LLVM 11.0.0) 1.2.131 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - 1, 2, 3, 4, 5: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820d Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected Kernel Details - 4, 5: Transparent Huge Pages: madvise
2990wx-2021-amd cython-bench: N-Queens lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression ior: 32MB - Default Test Directory ior: 16MB - Default Test Directory ior: 8MB - Default Test Directory lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression etcpak: DXT1 etcpak: ETC1 etcpak: ETC2 etcpak: ETC1 + Dithering synthmark: VoiceMark_100 gcrypt: quantlib: relion: Basic - CPU cloverleaf: Lagrangian-Eulerian Hydrodynamics mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU ior: 4MB - Default Test Directory onnx: super-resolution-10 - OpenMP CPU tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 lammps: 20k Atoms lammps: Rhodopsin Protein npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C amg: kripke: openfoam: Motorbike 30M openfoam: Motorbike 60M qmcpack: simple-H2O qe: AUSURF112 cp2k: Fayalite-FIST Data cpuminer-opt: Magi cpuminer-opt: x25x cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 build-godot: Time To Compile financebench: Repo OpenMP financebench: Bonds OpenMP ior: 2MB - Default Test Directory 1 1a 2 3 4 5 492.81 492.78 524.14 1599.218 247.713 154.297 227.802 594.561 1823.688 119.54 9.459 38.329 5.672 4.339 48.514 155 122 54 5011 623.83 2211 281.081 251.689 15.352 13.008 401359967 26877810 76.57 726.90 36.305 1705.65 1474.831 533.91 206.02 555.39 117.16 0.340 1.002 1.322 2.957 83.675 958.85 26.033 37 108 526 1583 94 1765 94 481 498 577 196 662 203 216.058 2320.7 7685.88 1733.78 1740.43 21668.10 751.00 40721.51 17335.82 1172.19 819.47 17059 2936.88 577410 4590.08 137040 10123 45567 166940 221130 42858.381510 58307.904948 25.862 37 109 526 1572 94 1786 94 480.75 490.50 535.92 481 489 569 196 664 204 1615.224 248.786 154.896 230.993 593.545 217.833 2286.0 1793.741 125.43 9.208 37.812 5.670 4.336 48.290 157 128 54 5487 592.02 2223 279.843 251.540 15.437 12.766 7203.14 1741.25 1740.55 21182.51 770.10 39921.13 16587.15 391813467 27358503 40.81 37.346 1710.86 1149.83 839.46 16883 2940.54 571013 4639.97 135643 10130 45570 165833 218553 540.87 204.27 548.65 117.18 0.337 0.993 1.307 2.885 82.317 42328.953125 58878.983073 781.51 471.83 496.04 529.36 1607.366 247.557 154.923 229.770 1798.427 132.07 582.93 14.799 12.475 396403133 37.455 1687.65 542.26 202.23 552.84 117.72 0.338 0.995 1.308 2.911 753.94 26.152 37 109 525 1572 94 1787 95 813.49 808.18 594.69 480 492 572 197 666 203 1617.869 248.611 155.118 230.942 587.965 216.246 2288.6 1767.925 86.58 9.044 37.935 5.621 4.408 47.444 183 213 59 6400 687.41 2362 289.279 251.417 15.013 12.589 7217.86 1750.21 1732.07 22314.71 793.83 39758.79 17192.72 398053633 25710838 35.900 1677.48 1447.579 1172.74 830.66 16920 2929.72 575833 4521.83 131437 10210 46943 165943 220147 540.92 208.11 554.60 117.85 0.343 1.014 1.329 2.924 81.894 41316.536458 56415.078125 791.04 25.821 37 108 525 1533 94 1784 94 506.83 527.67 536.25 481 494 572 196 640 206 1619.436 248.733 154.303 229.607 589.778 216.768 2296.2 1781.073 86.15 9.126 37.835 5.632 4.340 48.246 181 216 61 6498 654.19 2341 288.426 251.339 15.061 12.832 7441.03 1743.29 1737.33 22032.23 811.98 41730.74 16629.67 386077433 25857623 35.624 1673.03 1171.44 830.89 16947 2950.68 565767 4520.30 132140 10127 45451 166993 220797 539.62 209.54 552.65 118.30 0.344 1.008 1.320 2.908 82.105 41135.450521 56829.992187 961.66 OpenBenchmarking.org
Cython Benchmark Test: N-Queens OpenBenchmarking.org Seconds, Fewer Is Better Cython Benchmark 0.29.21 Test: N-Queens 1a 2 4 5 6 12 18 24 30 SE +/- 0.13, N = 3 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 SE +/- 0.08, N = 3 26.03 25.86 26.15 25.82
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 1a 2 4 5 9 18 27 36 45 SE +/- 0.33, N = 3 37 37 37 37 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 1a 2 4 5 20 40 60 80 100 108 109 109 108 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1a 2 4 5 110 220 330 440 550 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 526 526 525 525 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 1a 2 4 5 300 600 900 1200 1500 SE +/- 1.15, N = 3 SE +/- 0.33, N = 3 SE +/- 2.65, N = 3 SE +/- 40.67, N = 3 1583 1572 1572 1533 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 1a 2 4 5 20 40 60 80 100 94 94 94 94 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1a 2 4 5 400 800 1200 1600 2000 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 2.33, N = 3 1765 1786 1787 1784 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 1a 2 4 5 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 94 94 95 94 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
IOR Block Size: 32MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 4.96, N = 12 SE +/- 5.02, N = 7 SE +/- 4.61, N = 3 SE +/- 6.78, N = 3 SE +/- 6.72, N = 5 492.81 480.75 471.83 813.49 506.83 MIN: 391.29 / MAX: 1073.97 MIN: 396.65 / MAX: 1002.46 MIN: 411.76 / MAX: 928.83 MIN: 299.5 / MAX: 1128.35 MIN: 420 / MAX: 1027.38 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 16MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 3.19, N = 3 SE +/- 6.50, N = 4 SE +/- 1.33, N = 3 SE +/- 3.29, N = 3 SE +/- 2.84, N = 3 492.78 490.50 496.04 808.18 527.67 MIN: 391.54 / MAX: 1031.18 MIN: 367.13 / MAX: 1024.21 MIN: 422.66 / MAX: 988.54 MIN: 297.46 / MAX: 1133.39 MIN: 422.53 / MAX: 1035.04 1. (CC) gcc options: -O2 -lm -pthread -lmpi
IOR Block Size: 8MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory 1 2 3 4 5 130 260 390 520 650 SE +/- 8.56, N = 3 SE +/- 8.80, N = 3 SE +/- 6.43, N = 6 SE +/- 7.50, N = 3 SE +/- 4.89, N = 15 524.14 535.92 529.36 594.69 536.25 MIN: 399.31 / MAX: 1031.12 MIN: 424.77 / MAX: 1080.17 MIN: 393.92 / MAX: 1033.73 MIN: 330.92 / MAX: 1124.59 MIN: 376.61 / MAX: 1090.13 1. (CC) gcc options: -O2 -lm -pthread -lmpi
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 1a 2 4 5 100 200 300 400 500 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 481 481 480 481 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 1a 2 4 5 110 220 330 440 550 SE +/- 3.18, N = 3 SE +/- 3.00, N = 3 SE +/- 1.20, N = 3 SE +/- 1.15, N = 3 498 489 492 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 1a 2 4 5 120 240 360 480 600 SE +/- 4.33, N = 3 SE +/- 2.00, N = 2 577 569 572 572 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1a 2 4 5 40 80 120 160 200 SE +/- 0.33, N = 3 196 196 197 196 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 1a 2 4 5 140 280 420 560 700 SE +/- 4.26, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 SE +/- 23.67, N = 3 662 664 666 640 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1a 2 4 5 50 100 150 200 250 SE +/- 3.06, N = 3 SE +/- 3.06, N = 3 SE +/- 2.87, N = 4 SE +/- 2.49, N = 15 203 204 203 206 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 1 2 3 4 5 300 600 900 1200 1500 SE +/- 0.37, N = 3 SE +/- 2.00, N = 3 SE +/- 9.61, N = 3 SE +/- 0.57, N = 3 SE +/- 1.41, N = 3 1599.22 1615.22 1607.37 1617.87 1619.44 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 1 2 3 4 5 50 100 150 200 250 SE +/- 1.00, N = 3 SE +/- 0.06, N = 3 SE +/- 1.16, N = 3 SE +/- 0.57, N = 3 SE +/- 0.45, N = 3 247.71 248.79 247.56 248.61 248.73 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 4 5 30 60 90 120 150 SE +/- 0.63, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 154.30 154.90 154.92 155.12 154.30 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 4 5 50 100 150 200 250 SE +/- 2.59, N = 6 SE +/- 0.07, N = 3 SE +/- 1.28, N = 3 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 227.80 230.99 229.77 230.94 229.61 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 4 5 130 260 390 520 650 SE +/- 1.64, N = 3 SE +/- 1.23, N = 3 SE +/- 1.86, N = 3 SE +/- 1.79, N = 3 594.56 593.55 587.97 589.78 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 1a 2 4 5 50 100 150 200 250 SE +/- 0.45, N = 3 SE +/- 2.05, N = 3 SE +/- 0.72, N = 3 SE +/- 1.12, N = 3 216.06 217.83 216.25 216.77 1. (CC) gcc options: -O2 -fvisibility=hidden
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 1a 2 4 5 500 1000 1500 2000 2500 SE +/- 4.31, N = 3 SE +/- 23.91, N = 8 SE +/- 27.83, N = 6 SE +/- 27.47, N = 6 2320.7 2286.0 2288.6 2296.2 1. (CXX) g++ options: -O3 -march=native -rdynamic
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 15.01, N = 3 SE +/- 2.64, N = 3 SE +/- 5.61, N = 3 SE +/- 9.74, N = 3 SE +/- 5.46, N = 3 1823.69 1793.74 1798.43 1767.93 1781.07 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1 2 3 4 5 30 60 90 120 150 SE +/- 0.72, N = 3 SE +/- 4.21, N = 9 SE +/- 3.21, N = 9 SE +/- 2.71, N = 12 SE +/- 1.40, N = 15 119.54 125.43 132.07 86.58 86.15 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 1 2 4 5 3 6 9 12 15 SE +/- 0.119, N = 15 SE +/- 0.104, N = 15 SE +/- 0.149, N = 3 SE +/- 0.086, N = 15 9.459 9.208 9.044 9.126 MIN: 8.23 / MAX: 19.92 MIN: 8.24 / MAX: 22.52 MIN: 8.26 / MAX: 12.92 MIN: 8.22 / MAX: 19.08 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 1 2 4 5 9 18 27 36 45 SE +/- 0.25, N = 15 SE +/- 0.26, N = 15 SE +/- 0.29, N = 3 SE +/- 0.30, N = 15 38.33 37.81 37.94 37.84 MIN: 35.25 / MAX: 98.22 MIN: 34.87 / MAX: 128.92 MIN: 36.11 / MAX: 89.49 MIN: 35 / MAX: 102.86 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 1 2 4 5 1.2762 2.5524 3.8286 5.1048 6.381 SE +/- 0.062, N = 15 SE +/- 0.073, N = 15 SE +/- 0.138, N = 3 SE +/- 0.055, N = 15 5.672 5.670 5.621 5.632 MIN: 5.22 / MAX: 6.84 MIN: 5.12 / MAX: 13.18 MIN: 5.33 / MAX: 6.27 MIN: 5.17 / MAX: 6.36 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 1 2 4 5 0.9918 1.9836 2.9754 3.9672 4.959 SE +/- 0.050, N = 15 SE +/- 0.067, N = 15 SE +/- 0.099, N = 3 SE +/- 0.036, N = 15 4.339 4.336 4.408 4.340 MIN: 3.45 / MAX: 35.08 MIN: 3.43 / MAX: 36.21 MIN: 3.88 / MAX: 23.42 MIN: 3.82 / MAX: 35.09 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 1 2 4 5 11 22 33 44 55 SE +/- 0.32, N = 15 SE +/- 0.35, N = 15 SE +/- 0.12, N = 3 SE +/- 0.45, N = 15 48.51 48.29 47.44 48.25 MIN: 44.81 / MAX: 103.24 MIN: 44.15 / MAX: 138.3 MIN: 45.7 / MAX: 117.61 MIN: 43.95 / MAX: 109.39 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 1 2 4 5 40 80 120 160 200 SE +/- 1.78, N = 12 SE +/- 2.09, N = 4 SE +/- 2.10, N = 12 SE +/- 1.96, N = 3 155 157 183 181 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 1 2 4 5 50 100 150 200 250 SE +/- 2.08, N = 12 SE +/- 1.50, N = 3 SE +/- 1.92, N = 3 SE +/- 2.59, N = 12 122 128 213 216 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 4 5 14 28 42 56 70 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 SE +/- 0.58, N = 3 54 54 59 61 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 4 5 1400 2800 4200 5600 7000 SE +/- 167.82, N = 12 SE +/- 55.38, N = 3 SE +/- 69.07, N = 12 SE +/- 65.22, N = 3 5011 5487 6400 6498 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
IOR Block Size: 4MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory 1 2 3 4 5 150 300 450 600 750 SE +/- 37.33, N = 15 SE +/- 6.62, N = 12 SE +/- 4.85, N = 15 SE +/- 3.96, N = 3 SE +/- 35.87, N = 15 623.83 592.02 582.93 687.41 654.19 MIN: 310.99 / MAX: 1067.3 MIN: 332.2 / MAX: 1089.17 MIN: 366.23 / MAX: 1049.9 MIN: 279.64 / MAX: 1117.59 MIN: 328.97 / MAX: 1093.06 1. (CC) gcc options: -O2 -lm -pthread -lmpi
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 1 2 4 5 500 1000 1500 2000 2500 SE +/- 4.69, N = 3 SE +/- 8.92, N = 3 SE +/- 7.60, N = 3 SE +/- 22.29, N = 3 2211 2223 2362 2341 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 1 2 4 5 60 120 180 240 300 SE +/- 0.22, N = 3 SE +/- 0.81, N = 3 SE +/- 0.81, N = 3 SE +/- 0.44, N = 3 281.08 279.84 289.28 288.43 MIN: 263.7 / MAX: 327.87 MIN: 264.8 / MAX: 315.98 MIN: 265.62 / MAX: 312.79 MIN: 263.56 / MAX: 332.62 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 1 2 4 5 50 100 150 200 250 SE +/- 0.03, N = 3 SE +/- 0.15, N = 3 SE +/- 0.31, N = 3 SE +/- 0.07, N = 3 251.69 251.54 251.42 251.34 MIN: 250.86 / MAX: 254.16 MIN: 250.62 / MAX: 254.15 MIN: 250.45 / MAX: 260.46 MIN: 250.59 / MAX: 254.17 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 2 3 4 5 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 15.35 15.44 14.80 15.01 15.06 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 2 3 4 5 3 6 9 12 15 SE +/- 0.22, N = 13 SE +/- 0.25, N = 15 SE +/- 0.16, N = 15 SE +/- 0.22, N = 15 SE +/- 0.22, N = 15 13.01 12.77 12.48 12.59 12.83 1. (CXX) g++ options: -O3 -pthread -lm
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 1a 2 4 5 1600 3200 4800 6400 8000 SE +/- 214.27, N = 15 SE +/- 230.46, N = 15 SE +/- 203.10, N = 15 SE +/- 198.36, N = 15 7685.88 7203.14 7217.86 7441.03 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1a 2 4 5 400 800 1200 1600 2000 SE +/- 7.09, N = 3 SE +/- 0.68, N = 3 SE +/- 6.67, N = 3 SE +/- 4.65, N = 3 1733.78 1741.25 1750.21 1743.29 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 1a 2 4 5 400 800 1200 1600 2000 SE +/- 0.38, N = 3 SE +/- 0.69, N = 3 SE +/- 2.98, N = 3 SE +/- 2.63, N = 3 1740.43 1740.55 1732.07 1737.33 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 1a 2 4 5 5K 10K 15K 20K 25K SE +/- 397.10, N = 15 SE +/- 443.87, N = 15 SE +/- 137.92, N = 3 SE +/- 47.70, N = 3 21668.10 21182.51 22314.71 22032.23 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1a 2 4 5 200 400 600 800 1000 SE +/- 17.71, N = 15 SE +/- 14.63, N = 12 SE +/- 8.96, N = 15 SE +/- 9.93, N = 3 751.00 770.10 793.83 811.98 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 1a 2 4 5 9K 18K 27K 36K 45K SE +/- 705.58, N = 15 SE +/- 824.34, N = 15 SE +/- 904.51, N = 15 SE +/- 555.68, N = 15 40721.51 39921.13 39758.79 41730.74 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 1a 2 4 5 4K 8K 12K 16K 20K SE +/- 398.98, N = 15 SE +/- 500.68, N = 15 SE +/- 312.61, N = 15 SE +/- 531.61, N = 15 17335.82 16587.15 17192.72 16629.67 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1 2 3 4 5 90M 180M 270M 360M 450M SE +/- 4525834.83, N = 3 SE +/- 1988092.57, N = 3 SE +/- 2489918.59, N = 3 SE +/- 277098.08, N = 3 SE +/- 3176655.21, N = 3 401359967 391813467 396403133 398053633 386077433 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 1 2 4 5 6M 12M 18M 24M 30M SE +/- 46576.08, N = 3 SE +/- 496569.70, N = 12 SE +/- 432230.86, N = 12 SE +/- 621600.77, N = 12 26877810 27358503 25710838 25857623 1. (CXX) g++ options: -O3 -fopenmp
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 1 2 20 40 60 80 100 SE +/- 10.16, N = 8 SE +/- 18.67, N = 3 76.57 40.81 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M 1 160 320 480 640 800 SE +/- 10.63, N = 9 726.90 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 1 2 3 4 5 9 18 27 36 45 SE +/- 0.57, N = 3 SE +/- 0.53, N = 3 SE +/- 0.49, N = 15 SE +/- 0.44, N = 15 SE +/- 0.39, N = 7 36.31 37.35 37.46 35.90 35.62 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 1 2 3 4 5 400 800 1200 1600 2000 SE +/- 19.15, N = 7 SE +/- 26.20, N = 3 SE +/- 3.23, N = 3 SE +/- 23.00, N = 4 SE +/- 5.57, N = 3 1705.65 1710.86 1687.65 1677.48 1673.03 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz
CP2K Molecular Dynamics Fayalite-FIST Data OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.1 Fayalite-FIST Data 1 4 300 600 900 1200 1500 1474.83 1447.58
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 1a 2 4 5 300 600 900 1200 1500 SE +/- 0.84, N = 3 SE +/- 14.98, N = 4 SE +/- 1.16, N = 3 SE +/- 3.50, N = 3 1172.19 1149.83 1172.74 1171.44 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 1a 2 4 5 200 400 600 800 1000 SE +/- 7.85, N = 3 SE +/- 5.74, N = 3 SE +/- 1.18, N = 3 SE +/- 1.08, N = 3 819.47 839.46 830.66 830.89 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1a 2 4 5 4K 8K 12K 16K 20K SE +/- 119.53, N = 14 SE +/- 27.28, N = 3 SE +/- 11.55, N = 3 SE +/- 49.78, N = 3 17059 16883 16920 16947 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 1a 2 4 5 600 1200 1800 2400 3000 SE +/- 6.53, N = 3 SE +/- 8.81, N = 3 SE +/- 8.05, N = 3 SE +/- 11.03, N = 3 2936.88 2940.54 2929.72 2950.68 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1a 2 4 5 120K 240K 360K 480K 600K SE +/- 3940.00, N = 3 SE +/- 3628.00, N = 3 SE +/- 3934.48, N = 3 SE +/- 5261.67, N = 3 577410 571013 575833 565767 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 1a 2 4 5 1000 2000 3000 4000 5000 SE +/- 68.57, N = 4 SE +/- 65.81, N = 3 SE +/- 2.46, N = 3 SE +/- 0.83, N = 3 4590.08 4639.97 4521.83 4520.30 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 1a 2 4 5 30K 60K 90K 120K 150K SE +/- 918.82, N = 3 SE +/- 1055.21, N = 3 SE +/- 1707.45, N = 3 SE +/- 441.63, N = 3 137040 135643 131437 132140 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1a 2 4 5 2K 4K 6K 8K 10K SE +/- 8.82, N = 3 SE +/- 17.32, N = 3 SE +/- 66.58, N = 3 SE +/- 12.02, N = 3 10123 10130 10210 10127 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 1a 2 4 5 10K 20K 30K 40K 50K SE +/- 668.84, N = 3 SE +/- 502.13, N = 3 SE +/- 536.95, N = 3 SE +/- 381.65, N = 15 45567 45570 46943 45451 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 1a 2 4 5 40K 80K 120K 160K 200K SE +/- 222.71, N = 3 SE +/- 1021.96, N = 3 SE +/- 1373.44, N = 3 SE +/- 1729.38, N = 3 166940 165833 165943 166993 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 1a 2 4 5 50K 100K 150K 200K 250K SE +/- 469.18, N = 3 SE +/- 612.27, N = 3 SE +/- 568.46, N = 3 SE +/- 877.54, N = 3 221130 218553 220147 220797 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 1 2 3 4 5 120 240 360 480 600 SE +/- 8.93, N = 3 SE +/- 1.71, N = 3 SE +/- 1.05, N = 3 SE +/- 2.60, N = 3 SE +/- 2.78, N = 3 533.91 540.87 542.26 540.92 539.62 MIN: 411.82 / MAX: 669.31 MIN: 423.28 / MAX: 669.12 MIN: 422.86 / MAX: 672.41 MIN: 423.15 / MAX: 672.98 MIN: 421.29 / MAX: 668.77 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 1 2 3 4 5 50 100 150 200 250 SE +/- 2.02, N = 15 SE +/- 2.03, N = 15 SE +/- 1.90, N = 15 SE +/- 2.27, N = 7 SE +/- 0.49, N = 3 206.02 204.27 202.23 208.11 209.54 MIN: 129.35 / MAX: 225.85 MIN: 127.4 / MAX: 222.46 MIN: 126.91 / MAX: 221.77 MIN: 133.38 / MAX: 225.17 MIN: 140.28 / MAX: 222.38 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 1 2 3 4 5 120 240 360 480 600 SE +/- 2.78, N = 3 SE +/- 2.92, N = 3 SE +/- 2.25, N = 3 SE +/- 1.32, N = 3 SE +/- 2.21, N = 3 555.39 548.65 552.84 554.60 552.65 MIN: 320.77 / MAX: 613.37 MIN: 328.88 / MAX: 602.05 MIN: 322.61 / MAX: 608.3 MIN: 342.81 / MAX: 607.82 MIN: 338.43 / MAX: 607.43 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 1 2 3 4 5 30 60 90 120 150 SE +/- 0.47, N = 3 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.13, N = 3 SE +/- 0.22, N = 3 117.16 117.18 117.72 117.85 118.30 MIN: 80.9 / MAX: 196.36 MIN: 80.97 / MAX: 191.46 MIN: 81.13 / MAX: 195.69 MIN: 81.72 / MAX: 196.43 MIN: 81.84 / MAX: 191.83 1. (CC) gcc options: -pthread
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 1 2 3 4 5 0.0774 0.1548 0.2322 0.3096 0.387 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 0.340 0.337 0.338 0.343 0.344
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 1 2 3 4 5 0.2282 0.4564 0.6846 0.9128 1.141 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 1.002 0.993 0.995 1.014 1.008
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 1 2 3 4 5 0.299 0.598 0.897 1.196 1.495 SE +/- 0.003, N = 3 SE +/- 0.006, N = 3 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.003, N = 3 1.322 1.307 1.308 1.329 1.320
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 1 2 3 4 5 0.6653 1.3306 1.9959 2.6612 3.3265 SE +/- 0.011, N = 3 SE +/- 0.009, N = 3 SE +/- 0.005, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 2.957 2.885 2.911 2.924 2.908
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 1 2 4 5 20 40 60 80 100 SE +/- 0.87, N = 3 SE +/- 0.43, N = 2 SE +/- 0.19, N = 3 SE +/- 0.38, N = 3 83.68 82.32 81.89 82.11
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 1a 2 4 5 9K 18K 27K 36K 45K SE +/- 192.21, N = 3 SE +/- 566.70, N = 3 SE +/- 57.95, N = 3 SE +/- 260.65, N = 3 42858.38 42328.95 41316.54 41135.45 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 1a 2 4 5 13K 26K 39K 52K 65K SE +/- 196.31, N = 3 SE +/- 203.39, N = 3 SE +/- 106.37, N = 3 SE +/- 43.63, N = 3 58307.90 58878.98 56415.08 56829.99 1. (CXX) g++ options: -O3 -march=native -fopenmp
IOR Block Size: 2MB - Disk Target: Default Test Directory OpenBenchmarking.org MB/s, More Is Better IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory 1 2 3 4 5 200 400 600 800 1000 SE +/- 3.76, N = 3 SE +/- 8.48, N = 3 SE +/- 10.17, N = 3 SE +/- 6.02, N = 3 SE +/- 3.23, N = 3 958.85 781.51 753.94 791.04 961.66 MIN: 837.5 / MAX: 1071.89 MIN: 347.79 / MAX: 1068.81 MIN: 291.39 / MAX: 1030.85 MIN: 324.26 / MAX: 1057.74 MIN: 792.79 / MAX: 1076.37 1. (CC) gcc options: -O2 -lm -pthread -lmpi
Phoronix Test Suite v10.8.4