AMD EPYC 7F52 16-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2101232-HA-EPYC7F52W33 EPYC 7F52 Weekend - Phoronix Test Suite EPYC 7F52 Weekend AMD EPYC 7F52 16-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101232-HA-EPYC7F52W33&grr&sor .
EPYC 7F52 Weekend Processor Motherboard Chipset Memory Disk Graphics Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 AMD EPYC 7F52 16-Core @ 3.50GHz (16 Cores / 32 Threads) Supermicro H11DSi-NT v2.00 (2.1 BIOS) AMD Starship/Matisse 64GB 280GB INTEL SSDPE21D280GA llvmpipe VE228 Ubuntu 20.04 5.10.3-051003-generic (x86_64) GNOME Shell 3.36.1 X Server 1.20.8 modesetting 1.20.8 3.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034 Python Details - Python 2.7.18rc1 + Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
EPYC 7F52 Weekend onnx: super-resolution-10 - OpenMP CPU onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU gcrypt: onnx: fcn-resnet101-11 - OpenMP CPU cpuminer-opt: Myriad-Groestl onnx: shufflenet-v2-10 - OpenMP CPU npb: EP.D financebench: Bonds OpenMP gnupg: 2.7GB Sample File Encryption qmcpack: simple-H2O npb: LU.C financebench: Repo OpenMP cpuminer-opt: LBC, LBRY Credits lzbench: XZ 0 - Decompression lzbench: XZ 0 - Compression npb: BT.C redis: SADD quantlib: cpuminer-opt: Triple SHA-256, Onecoin lzbench: Zstd 8 - Decompression lzbench: Zstd 8 - Compression cpuminer-opt: Blake-2 S cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: x25x cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Deepcoin cpuminer-opt: Magi cpuminer-opt: Ringcoin etcpak: ETC2 lzbench: Crush 0 - Decompression lzbench: Crush 0 - Compression npb: IS.D lzbench: Brotli 2 - Decompression lzbench: Brotli 2 - Compression lzbench: Libdeflate 1 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 1 - Compression redis: LPOP etcpak: ETC1 + Dithering redis: LPUSH etcpak: ETC1 redis: SET redis: GET npb: FT.C npb: SP.B npb: CG.C npb: EP.C etcpak: DXT1 npb: MG.C 1 2 3 4919 269 476 224.157 79 19561.50 9782 1412.71 75330.263021 73.171 26.938 66210.72 44212.686198 46290 117 38 74244.18 1485937.90 2322.3 133183 1740 97 555593 100563 529.63 3561.12 125147 14710 834.50 2226.54 160.122 454 106 1596.44 670 194 1126 240 563 486 1556 513 1762023.78 257.211 1161192.96 271.927 1314077.04 1614743.50 36835.15 60930.18 17362.66 1406.82 1186.398 46933.86 4945 278 555 223.603 79 14408.49 9790 1412.62 74841.869792 72.903 27.312 67598.46 44469.527344 46673 116 38 74409.14 1472079.42 2314.0 133213 1739 97 553680 100697 532.07 3581.84 126070 14533 834.73 2224.47 160.115 455 106 1595.17 669 193 1126 239 563 488 1557 512 1189182.29 257.124 1145456.96 271.776 1293523.87 1593488.17 36845.54 60521.16 17290.04 1406.37 1185.717 46981.16 5103 282 545 224.011 79 15840.18 9775 1413.20 75991.294271 72.887 26.854 67655.54 44145.303385 47293 117 38 74392.62 1495629.17 2324.0 132467 1738 98 532650 99417 531.57 3557.62 125807 14343 834.13 2232.93 160.071 454 106 1595.74 670 193 1125 239 564 487 1555 512 1165576.55 257.050 1134796.29 272.004 1296106.71 1563673.46 36810.96 59747.78 17678.30 1398.91 1189.084 46951.91 OpenBenchmarking.org
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 2 1 1100 2200 3300 4400 5500 SE +/- 104.09, N = 12 SE +/- 108.07, N = 12 SE +/- 125.40, N = 12 5103 4945 4919 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 60 120 180 240 300 SE +/- 2.45, N = 8 SE +/- 2.62, N = 12 SE +/- 2.50, N = 7 282 278 269 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 2 3 1 120 240 360 480 600 SE +/- 15.02, N = 12 SE +/- 17.07, N = 9 SE +/- 1.64, N = 3 555 545 476 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 2 3 1 50 100 150 200 250 SE +/- 0.38, N = 3 SE +/- 0.58, N = 3 SE +/- 0.85, N = 3 223.60 224.01 224.16 1. (CC) gcc options: -O2 -fvisibility=hidden
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 20 40 60 80 100 SE +/- 0.58, N = 3 SE +/- 0.60, N = 3 SE +/- 0.71, N = 7 79 79 79 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl 1 3 2 4K 8K 12K 16K 20K SE +/- 2318.71, N = 15 SE +/- 2673.90, N = 15 SE +/- 2627.82, N = 15 19561.50 15840.18 14408.49 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 2 1 3 2K 4K 6K 8K 10K SE +/- 13.35, N = 3 SE +/- 14.47, N = 3 SE +/- 14.88, N = 3 9790 9782 9775 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 3 1 2 300 600 900 1200 1500 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 SE +/- 0.36, N = 3 1413.20 1412.71 1412.62 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 2 1 3 16K 32K 48K 64K 80K SE +/- 61.57, N = 3 SE +/- 108.46, N = 3 SE +/- 639.42, N = 3 74841.87 75330.26 75991.29 1. (CXX) g++ options: -O3 -march=native -fopenmp
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption 3 2 1 16 32 48 64 80 SE +/- 0.48, N = 3 SE +/- 0.50, N = 3 SE +/- 0.63, N = 3 72.89 72.90 73.17 1. (CC) gcc options: -O2
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 3 1 2 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.28, N = 3 SE +/- 0.23, N = 15 26.85 26.94 27.31 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 14K 28K 42K 56K 70K SE +/- 56.64, N = 3 SE +/- 8.67, N = 3 SE +/- 998.67, N = 12 67655.54 67598.46 66210.72 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 3 1 2 10K 20K 30K 40K 50K SE +/- 155.29, N = 3 SE +/- 122.58, N = 3 SE +/- 431.36, N = 3 44145.30 44212.69 44469.53 1. (CXX) g++ options: -O3 -march=native -fopenmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 3 2 1 10K 20K 30K 40K 50K SE +/- 166.97, N = 3 SE +/- 326.92, N = 3 SE +/- 403.51, N = 7 47293 46673 46290 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 1 2 30 60 90 120 150 117 117 116 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Compression 3 2 1 9 18 27 36 45 38 38 38 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 3 1 16K 32K 48K 64K 80K SE +/- 42.15, N = 3 SE +/- 35.11, N = 3 SE +/- 60.49, N = 3 74409.14 74392.62 74244.18 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SADD 3 1 2 300K 600K 900K 1200K 1500K SE +/- 5317.31, N = 3 SE +/- 9643.77, N = 15 SE +/- 8791.08, N = 3 1495629.17 1485937.90 1472079.42 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 3 1 2 500 1000 1500 2000 2500 SE +/- 1.27, N = 3 SE +/- 0.15, N = 3 SE +/- 8.90, N = 3 2324.0 2322.3 2314.0 1. (CXX) g++ options: -O3 -march=native -rdynamic
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Triple SHA-256, Onecoin 2 1 3 30K 60K 90K 120K 150K SE +/- 583.36, N = 3 SE +/- 639.83, N = 3 SE +/- 145.18, N = 3 133213 133183 132467 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
lzbench Test: Zstd 8 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 1 2 3 400 800 1200 1600 2000 SE +/- 2.08, N = 3 SE +/- 1.76, N = 3 SE +/- 0.33, N = 3 1740 1739 1738 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 8 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.67, N = 3 98 97 97 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 1 2 3 120K 240K 360K 480K 600K SE +/- 5754.38, N = 3 SE +/- 5801.01, N = 3 SE +/- 2759.15, N = 3 555593 553680 532650 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Quad SHA-256, Pyrite 2 1 3 20K 40K 60K 80K 100K SE +/- 816.95, N = 3 SE +/- 517.21, N = 3 SE +/- 320.49, N = 3 100697 100563 99417 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 2 3 1 120 240 360 480 600 SE +/- 2.10, N = 3 SE +/- 1.80, N = 3 SE +/- 2.09, N = 3 532.07 531.57 529.63 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 2 1 3 800 1600 2400 3200 4000 SE +/- 16.67, N = 3 SE +/- 5.71, N = 3 SE +/- 8.36, N = 3 3581.84 3561.12 3557.62 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 2 3 1 30K 60K 90K 120K 150K SE +/- 181.48, N = 3 SE +/- 400.10, N = 3 SE +/- 924.52, N = 3 126070 125807 125147 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 1 2 3 3K 6K 9K 12K 15K SE +/- 79.37, N = 3 SE +/- 106.82, N = 3 SE +/- 184.78, N = 3 14710 14533 14343 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 2 1 3 200 400 600 800 1000 SE +/- 1.25, N = 3 SE +/- 1.34, N = 3 SE +/- 0.63, N = 3 834.73 834.50 834.13 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Ringcoin 3 1 2 500 1000 1500 2000 2500 SE +/- 8.43, N = 3 SE +/- 9.57, N = 3 SE +/- 8.06, N = 3 2232.93 2226.54 2224.47 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 1 2 3 40 80 120 160 200 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 160.12 160.12 160.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
lzbench Test: Crush 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 2 3 1 100 200 300 400 500 SE +/- 0.33, N = 3 455 454 454 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Crush 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 106 106 106 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 1 3 2 300 600 900 1200 1500 SE +/- 0.98, N = 3 SE +/- 0.81, N = 3 SE +/- 1.41, N = 3 1596.44 1595.74 1595.17 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
lzbench Test: Brotli 2 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 1 2 140 280 420 560 700 SE +/- 1.76, N = 3 SE +/- 1.15, N = 3 SE +/- 1.20, N = 3 670 670 669 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 2 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 1 3 2 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 194 193 193 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 2 1 3 200 400 600 800 1000 1126 1126 1125 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 1 3 2 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 240 239 239 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 2 1 120 240 360 480 600 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 564 563 563 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 2 3 1 110 220 330 440 550 SE +/- 1.20, N = 3 488 487 486 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 2 1 3 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.45, N = 3 1557 1556 1555 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 1 3 2 110 220 330 440 550 SE +/- 0.67, N = 3 513 512 512 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPOP 1 2 3 400K 800K 1200K 1600K 2000K SE +/- 20791.58, N = 4 SE +/- 16258.91, N = 3 SE +/- 9222.57, N = 3 1762023.78 1189182.29 1165576.55 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering 1 2 3 60 120 180 240 300 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 257.21 257.12 257.05 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: LPUSH 1 2 3 200K 400K 600K 800K 1000K SE +/- 10576.58, N = 3 SE +/- 9055.81, N = 3 SE +/- 14115.16, N = 3 1161192.96 1145456.96 1134796.29 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 3 1 2 60 120 180 240 300 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 272.00 271.93 271.78 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET 1 3 2 300K 600K 900K 1200K 1500K SE +/- 12683.70, N = 3 SE +/- 13808.18, N = 3 SE +/- 17411.45, N = 3 1314077.04 1296106.71 1293523.87 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET 1 2 3 300K 600K 900K 1200K 1500K SE +/- 11267.75, N = 3 SE +/- 14298.06, N = 3 SE +/- 14427.11, N = 3 1614743.50 1593488.17 1563673.46 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 1 3 8K 16K 24K 32K 40K SE +/- 18.44, N = 3 SE +/- 5.54, N = 3 SE +/- 29.47, N = 3 36845.54 36835.15 36810.96 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 1 2 3 13K 26K 39K 52K 65K SE +/- 367.38, N = 3 SE +/- 500.34, N = 9 SE +/- 808.65, N = 3 60930.18 60521.16 59747.78 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 3 1 2 4K 8K 12K 16K 20K SE +/- 137.26, N = 3 SE +/- 139.46, N = 3 SE +/- 122.11, N = 3 17678.30 17362.66 17290.04 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 1 2 3 300 600 900 1200 1500 SE +/- 2.08, N = 3 SE +/- 2.66, N = 3 SE +/- 6.60, N = 3 1406.82 1406.37 1398.91 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 3 1 2 300 600 900 1200 1500 SE +/- 3.77, N = 3 SE +/- 3.55, N = 3 SE +/- 3.83, N = 3 1189.08 1186.40 1185.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 3 1 10K 20K 30K 40K 50K SE +/- 9.06, N = 3 SE +/- 55.85, N = 3 SE +/- 33.55, N = 3 46981.16 46951.91 46933.86 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 4.0.3
Phoronix Test Suite v10.8.4