AMD EPYC 72F3 Retpoline benchmarks by Michael Larabel.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2203111-NE-EPYC72F3R09 EPYC 72F3 Retpolines Change In Linux 5.17 - Phoronix Test Suite EPYC 72F3 Retpolines Change In Linux 5.17 AMD EPYC 72F3 Retpoline benchmarks by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/2203111-NE-EPYC72F3R09&export=pdf&sor&grw .
EPYC 72F3 Retpolines Change In Linux 5.17 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution New Default - Generic Retpolines AMD Retpolines Former Default AMD EPYC 72F3 8-Core @ 3.70GHz (8 Cores / 16 Threads) Supermicro H12SSL-i v1.01 (2.0 BIOS) AMD Starship/Matisse 128GB 3841GB Micron_9300_MTFDHAL3T8TDP + 1000GB Corsair Force MP600 ASPEED VE228 2 x Broadcom NetXtreme BCM5720 2-port PCIe Ubuntu 21.04 5.17.0-051700rc7daily20220310-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa001119 Java Details - OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.04) Python Details - Python 3.9.5 Security Details - New Default - Generic Retpolines: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - AMD Retpolines Former Default: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Vulnerable: LFENCE + srbds: Not affected + tsx_async_abort: Not affected
EPYC 72F3 Retpolines Change In Linux 5.17 compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed hackbench: 8 - Thread hackbench: 16 - Thread hackbench: 8 - Process hackbench: 16 - Process hackbench: 32 - Process stress-ng: SENDFILE stress-ng: Semaphores stress-ng: Socket Activity stress-ng: Context Switching stress-ng: System V Message Passing sockperf: Throughput sockperf: Latency Under Load dacapobench: H2 dacapobench: Jython dacapobench: Tradesoap dacapobench: Tradebeans renaissance: Finagle HTTP Requests renaissance: In-Memory Database Shootout draco: Lion draco: Church Facade fio: Rand Read - IO_uring - No - Yes - 4KB - Default Test Directory fio: Rand Write - IO_uring - No - Yes - 4KB - Default Test Directory osbench: Create Files osbench: Create Threads osbench: Launch Programs osbench: Create Processes wireguard: postmark: Disk Transaction Performance build-gdb: Time To Compile build-mplayer: Time To Compile compress-7zip: Compression Rating compress-7zip: Decompression Rating compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compression build-linux-kernel: defconfig build-godot: Time To Compile build-eigen: Time To Compile nginx: 20 nginx: 100 nginx: 200 apache: 100 apache: 200 apache: 500 openssl: SHA256 openssl: RSA4096 openssl: RSA4096 node-express-loadtest: influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 memtier-benchmark: Redis keydb: redis: GET redis: SET rocksdb: Rand Fill rocksdb: Rand Read rocksdb: Update Rand rocksdb: Seq Fill rocksdb: Read While Writing rocksdb: Read Rand Write Rand leveldb: Hot Read leveldb: Fill Sync leveldb: Overwrite leveldb: Overwrite leveldb: Rand Fill leveldb: Rand Fill leveldb: Rand Read leveldb: Seek Rand leveldb: Rand Delete leveldb: Seq Fill leveldb: Seq Fill sqlite-speedtest: Timed Time - Size 1,000 cassandra: Writes pgbench: 100 - 50 - Read Only pgbench: 100 - 50 - Read Only - Average Latency pgbench: 100 - 100 - Read Only pgbench: 100 - 100 - Read Only - Average Latency pgbench: 100 - 100 - Read Write pgbench: 100 - 100 - Read Write - Average Latency ebizzy: node-web-tooling: New Default - Generic Retpolines AMD Retpolines Former Default 61.24 14455.1 59.57 14586.2 31.275 62.203 29.950 58.782 119.831 200771.29 1326767.92 10751.13 4296558.49 4521841.84 539724 3.666 4268 3761 3660 5762 2378.7 4788.2 5088 6697 239667 223933 14.706538 17.376741 49.362977 43.140253 257.104 6944 54.955 30.709 86357 69393 1343.6 4255.1 51.6 3694.5 10.784 78.618 111.507 61.685 215287.79 215190.56 213684.40 82500.75 86362.18 80298.15 13238722860 2372.3 155003.1 6692 1326859.1 1638195.7 1324327.32 136793.59 2506720.25 1998825.92 605572 61331721 393101 657523 1734878 1391000 10.537 1820.488 17.1 103.365 17.2 103.217 10.577 17.139 99.876 16.9 105.116 50.483 86420 295179 0.170 302523 0.331 10383 9.641 619271 11.26 61.36 14817.3 59.69 14807.4 31.046 62.029 28.828 58.356 117.984 193652.98 1342170.84 10567.09 4326244.50 4530010.92 587523 3.641 4228 3697 3628 5634 2366.9 4878.7 5070 6649 243000 229500 13.723014 17.703374 48.627853 43.362777 248.756 7282 55.032 30.802 87210 69308 1338.6 4160.1 51.9 3668.5 10.664 78.591 111.580 61.098 210605.86 207615.65 206147.39 83397.57 86408.48 80970.15 13234833737 2374.1 155031.8 6910 1323460.2 1639910.0 1313054.11 142573.34 2438721.71 1995692.89 617250 61586232 398526 670760 1746524 1386586 10.502 1806.759 16.9 104.608 17.1 103.680 10.545 17.011 98.953 16.5 106.946 49.007 85956 295730 0.169 302399 0.330 10440 9.590 596554 11.31 OpenBenchmarking.org
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.30, N = 3 SE +/- 0.47, N = 15 61.36 61.24 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 3K 6K 9K 12K 15K SE +/- 22.12, N = 3 SE +/- 24.84, N = 15 14817.3 14455.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.63, N = 3 59.69 59.57 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 3K 6K 9K 12K 15K SE +/- 61.43, N = 3 SE +/- 17.39, N = 3 14807.4 14586.2 1. (CC) gcc options: -O3
Hackbench Count: 8 - Type: Thread OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Thread AMD Retpolines Former Default New Default - Generic Retpolines 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 31.05 31.28 1. (CC) gcc options: -lpthread
Hackbench Count: 16 - Type: Thread OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Thread AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 62.03 62.20 1. (CC) gcc options: -lpthread
Hackbench Count: 8 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 28.83 29.95 1. (CC) gcc options: -lpthread
Hackbench Count: 16 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 13 26 39 52 65 SE +/- 0.23, N = 3 SE +/- 0.16, N = 3 58.36 58.78 1. (CC) gcc options: -lpthread
Hackbench Count: 32 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 32 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 30 60 90 120 150 SE +/- 0.59, N = 3 SE +/- 0.30, N = 3 117.98 119.83 1. (CC) gcc options: -lpthread
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: SENDFILE New Default - Generic Retpolines AMD Retpolines Former Default 40K 80K 120K 160K 200K SE +/- 510.20, N = 3 SE +/- 443.51, N = 3 200771.29 193652.98 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Semaphores AMD Retpolines Former Default New Default - Generic Retpolines 300K 600K 900K 1200K 1500K SE +/- 10886.78, N = 3 SE +/- 4542.73, N = 3 1342170.84 1326767.92 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Socket Activity New Default - Generic Retpolines AMD Retpolines Former Default 2K 4K 6K 8K 10K SE +/- 12.31, N = 3 SE +/- 13.84, N = 3 10751.13 10567.09 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Context Switching AMD Retpolines Former Default New Default - Generic Retpolines 900K 1800K 2700K 3600K 4500K SE +/- 26201.07, N = 3 SE +/- 18653.91, N = 3 4326244.50 4296558.49 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: System V Message Passing AMD Retpolines Former Default New Default - Generic Retpolines 1000K 2000K 3000K 4000K 5000K SE +/- 10539.69, N = 3 SE +/- 16005.42, N = 3 4530010.92 4521841.84 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.7 Test: Throughput AMD Retpolines Former Default New Default - Generic Retpolines 130K 260K 390K 520K 650K SE +/- 5255.38, N = 25 SE +/- 4906.55, N = 25 587523 539724 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.7 Test: Latency Under Load AMD Retpolines Former Default New Default - Generic Retpolines 0.8249 1.6498 2.4747 3.2996 4.1245 SE +/- 0.033, N = 25 SE +/- 0.035, N = 5 3.641 3.666 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 AMD Retpolines Former Default New Default - Generic Retpolines 900 1800 2700 3600 4500 SE +/- 25.85, N = 16 SE +/- 26.89, N = 4 4228 4268
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython AMD Retpolines Former Default New Default - Generic Retpolines 800 1600 2400 3200 4000 SE +/- 24.35, N = 20 SE +/- 28.58, N = 4 3697 3761
DaCapo Benchmark Java Test: Tradesoap OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap AMD Retpolines Former Default New Default - Generic Retpolines 800 1600 2400 3200 4000 SE +/- 33.50, N = 4 SE +/- 40.14, N = 4 3628 3660
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans AMD Retpolines Former Default New Default - Generic Retpolines 1200 2400 3600 4800 6000 SE +/- 38.99, N = 4 SE +/- 34.00, N = 4 5634 5762
Renaissance Test: Finagle HTTP Requests OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.12 Test: Finagle HTTP Requests AMD Retpolines Former Default New Default - Generic Retpolines 500 1000 1500 2000 2500 SE +/- 15.84, N = 3 SE +/- 24.88, N = 3 2366.9 2378.7 MIN: 2178.29 / MAX: 2434.21 MIN: 2177.37 / MAX: 2449.07
Renaissance Test: In-Memory Database Shootout OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.12 Test: In-Memory Database Shootout New Default - Generic Retpolines AMD Retpolines Former Default 1000 2000 3000 4000 5000 SE +/- 22.81, N = 3 SE +/- 64.75, N = 3 4788.2 4878.7 MIN: 4452.21 / MAX: 5160.31 MIN: 4512.55 / MAX: 5542.48
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Lion AMD Retpolines Former Default New Default - Generic Retpolines 1100 2200 3300 4400 5500 SE +/- 0.88, N = 3 SE +/- 7.37, N = 3 5070 5088 1. (CXX) g++ options: -O3
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Church Facade AMD Retpolines Former Default New Default - Generic Retpolines 1400 2800 4200 5600 7000 SE +/- 5.21, N = 3 SE +/- 17.74, N = 3 6649 6697 1. (CXX) g++ options: -O3
Flexible IO Tester Type: Random Read - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 3.29 Type: Random Read - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 577.35, N = 3 SE +/- 333.33, N = 3 243000 239667 1. (CC) gcc options: -rdynamic -ltcmalloc -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native
Flexible IO Tester Type: Random Write - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 3.29 Type: Random Write - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 2958.04, N = 4 SE +/- 3027.60, N = 15 229500 223933 1. (CC) gcc options: -rdynamic -ltcmalloc -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 13.72 14.71 1. (CC) gcc options: -lm
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads New Default - Generic Retpolines AMD Retpolines Former Default 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 17.38 17.70 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs AMD Retpolines Former Default New Default - Generic Retpolines 11 22 33 44 55 SE +/- 0.53, N = 5 SE +/- 0.10, N = 3 48.63 49.36 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes New Default - Generic Retpolines AMD Retpolines Former Default 10 20 30 40 50 SE +/- 0.23, N = 3 SE +/- 0.54, N = 3 43.14 43.36 1. (CC) gcc options: -lm
WireGuard + Linux Networking Stack Stress Test OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test AMD Retpolines Former Default New Default - Generic Retpolines 60 120 180 240 300 SE +/- 0.99, N = 3 SE +/- 1.39, N = 3 248.76 257.10
PostMark Disk Transaction Performance OpenBenchmarking.org TPS, More Is Better PostMark 1.51 Disk Transaction Performance AMD Retpolines Former Default New Default - Generic Retpolines 1600 3200 4800 6400 8000 SE +/- 70.00, N = 3 7282 6944 1. (CC) gcc options: -O3
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile New Default - Generic Retpolines AMD Retpolines Former Default 12 24 36 48 60 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 54.96 55.03
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.5 Time To Compile New Default - Generic Retpolines AMD Retpolines Former Default 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 30.71 30.80
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 870.19, N = 5 SE +/- 438.70, N = 3 87210 86357 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating New Default - Generic Retpolines AMD Retpolines Former Default 15K 30K 45K 60K 75K SE +/- 8.76, N = 3 SE +/- 59.62, N = 5 69393 69308 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed New Default - Generic Retpolines AMD Retpolines Former Default 300 600 900 1200 1500 SE +/- 3.86, N = 3 SE +/- 10.10, N = 3 1343.6 1338.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed New Default - Generic Retpolines AMD Retpolines Former Default 900 1800 2700 3600 4500 SE +/- 54.59, N = 3 SE +/- 4.29, N = 3 4255.1 4160.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 12 24 36 48 60 SE +/- 0.45, N = 15 SE +/- 0.47, N = 15 51.9 51.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed New Default - Generic Retpolines AMD Retpolines Former Default 800 1600 2400 3200 4000 SE +/- 18.20, N = 15 SE +/- 14.48, N = 15 3694.5 3668.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Parallel BZIP2 Compression FreeBSD-13.0-RELEASE-amd64-memstick.img Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.13 FreeBSD-13.0-RELEASE-amd64-memstick.img Compression AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.15, N = 3 SE +/- 0.11, N = 6 10.66 10.78 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: defconfig AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 78.59 78.62
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile New Default - Generic Retpolines AMD Retpolines Former Default 20 40 60 80 100 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 111.51 111.58
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 61.10 61.69
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 New Default - Generic Retpolines AMD Retpolines Former Default 50K 100K 150K 200K 250K SE +/- 627.93, N = 3 SE +/- 247.73, N = 3 215287.79 210605.86 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 New Default - Generic Retpolines AMD Retpolines Former Default 50K 100K 150K 200K 250K SE +/- 1627.81, N = 3 SE +/- 369.20, N = 3 215190.56 207615.65 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 New Default - Generic Retpolines AMD Retpolines Former Default 50K 100K 150K 200K 250K SE +/- 275.98, N = 3 SE +/- 466.89, N = 3 213684.40 206147.39 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 311.80, N = 3 SE +/- 56.43, N = 3 83397.57 82500.75 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 123.84, N = 3 SE +/- 557.73, N = 3 86408.48 86362.18 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 149.43, N = 3 SE +/- 264.49, N = 3 80970.15 80298.15 1. (CC) gcc options: -shared -fPIC -O2 -pthread
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 New Default - Generic Retpolines AMD Retpolines Former Default 3000M 6000M 9000M 12000M 15000M SE +/- 6459514.72, N = 3 SE +/- 9115647.24, N = 3 13238722860 13234833737 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 AMD Retpolines Former Default New Default - Generic Retpolines 500 1000 1500 2000 2500 SE +/- 0.15, N = 3 SE +/- 2.17, N = 3 2374.1 2372.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 AMD Retpolines Former Default New Default - Generic Retpolines 30K 60K 90K 120K 150K SE +/- 8.58, N = 3 SE +/- 7.94, N = 3 155031.8 155003.1 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Node.js Express HTTP Load Test OpenBenchmarking.org Requests Per Second, More Is Better Node.js Express HTTP Load Test AMD Retpolines Former Default New Default - Generic Retpolines 1500 3000 4500 6000 7500 SE +/- 43.97, N = 3 SE +/- 34.82, N = 3 6910 6692
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 New Default - Generic Retpolines AMD Retpolines Former Default 300K 600K 900K 1200K 1500K SE +/- 1561.10, N = 3 SE +/- 2326.89, N = 3 1326859.1 1323460.2
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 AMD Retpolines Former Default New Default - Generic Retpolines 400K 800K 1200K 1600K 2000K SE +/- 1145.84, N = 3 SE +/- 3111.87, N = 3 1639910.0 1638195.7
Memtier_benchmark Protocol: Redis OpenBenchmarking.org Ops/sec, More Is Better Memtier_benchmark 1.3 Protocol: Redis New Default - Generic Retpolines AMD Retpolines Former Default 300K 600K 900K 1200K 1500K SE +/- 3316.11, N = 3 SE +/- 1363.28, N = 3 1324327.32 1313054.11 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.2.0 AMD Retpolines Former Default New Default - Generic Retpolines 30K 60K 90K 120K 150K SE +/- 1055.00, N = 15 SE +/- 1054.95, N = 10 142573.34 136793.59 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET New Default - Generic Retpolines AMD Retpolines Former Default 500K 1000K 1500K 2000K 2500K SE +/- 15897.48, N = 3 SE +/- 22769.87, N = 6 2506720.25 2438721.71 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET New Default - Generic Retpolines AMD Retpolines Former Default 400K 800K 1200K 1600K 2000K SE +/- 2446.32, N = 3 SE +/- 18125.41, N = 7 1998825.92 1995692.89 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Random Fill AMD Retpolines Former Default New Default - Generic Retpolines 130K 260K 390K 520K 650K SE +/- 3544.93, N = 3 SE +/- 5871.64, N = 3 617250 605572 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Random Read AMD Retpolines Former Default New Default - Generic Retpolines 13M 26M 39M 52M 65M SE +/- 78029.55, N = 3 SE +/- 195189.61, N = 3 61586232 61331721 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Update Random AMD Retpolines Former Default New Default - Generic Retpolines 90K 180K 270K 360K 450K SE +/- 1232.53, N = 3 SE +/- 2036.18, N = 3 398526 393101 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Sequential Fill AMD Retpolines Former Default New Default - Generic Retpolines 140K 280K 420K 560K 700K SE +/- 3478.74, N = 3 SE +/- 1947.52, N = 3 670760 657523 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Read While Writing AMD Retpolines Former Default New Default - Generic Retpolines 400K 800K 1200K 1600K 2000K SE +/- 20451.18, N = 4 SE +/- 24226.58, N = 3 1746524 1734878 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Read Random Write Random New Default - Generic Retpolines AMD Retpolines Former Default 300K 600K 900K 1200K 1500K SE +/- 2292.69, N = 3 SE +/- 6491.63, N = 3 1391000 1386586 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
LevelDB Benchmark: Hot Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Hot Read AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.11, N = 6 10.50 10.54 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Fill Sync OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Fill Sync AMD Retpolines Former Default New Default - Generic Retpolines 400 800 1200 1600 2000 SE +/- 3.85, N = 3 SE +/- 8.19, N = 3 1806.76 1820.49 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Overwrite New Default - Generic Retpolines AMD Retpolines Former Default 4 8 12 16 20 SE +/- 0.15, N = 15 SE +/- 0.11, N = 14 17.1 16.9 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Overwrite New Default - Generic Retpolines AMD Retpolines Former Default 20 40 60 80 100 SE +/- 0.87, N = 15 SE +/- 0.70, N = 14 103.37 104.61 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Random Fill New Default - Generic Retpolines AMD Retpolines Former Default 4 8 12 16 20 SE +/- 0.15, N = 15 SE +/- 0.22, N = 3 17.2 17.1 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Fill New Default - Generic Retpolines AMD Retpolines Former Default 20 40 60 80 100 SE +/- 0.90, N = 15 SE +/- 1.34, N = 3 103.22 103.68 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Read AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 10.55 10.58 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Seek Random OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Seek Random AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 17.01 17.14 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Delete OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Delete AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 1.08, N = 4 SE +/- 0.42, N = 3 98.95 99.88 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Sequential Fill New Default - Generic Retpolines AMD Retpolines Former Default 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 16.9 16.5 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Sequential Fill New Default - Generic Retpolines AMD Retpolines Former Default 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.61, N = 3 105.12 106.95 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 AMD Retpolines Former Default New Default - Generic Retpolines 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 49.01 50.48 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Writes New Default - Generic Retpolines AMD Retpolines Former Default 20K 40K 60K 80K 100K SE +/- 31.48, N = 3 SE +/- 518.75, N = 3 86420 85956
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only AMD Retpolines Former Default New Default - Generic Retpolines 60K 120K 180K 240K 300K SE +/- 3223.57, N = 12 SE +/- 2460.01, N = 12 295730 295179 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.002, N = 12 SE +/- 0.001, N = 12 0.169 0.170 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only New Default - Generic Retpolines AMD Retpolines Former Default 60K 120K 180K 240K 300K SE +/- 3587.18, N = 4 SE +/- 303.18, N = 3 302523 302399 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 0.0745 0.149 0.2235 0.298 0.3725 SE +/- 0.000, N = 3 SE +/- 0.004, N = 4 0.330 0.331 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write AMD Retpolines Former Default New Default - Generic Retpolines 2K 4K 6K 8K 10K SE +/- 108.62, N = 12 SE +/- 102.24, N = 12 10440 10383 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.099, N = 12 SE +/- 0.094, N = 12 9.590 9.641 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 New Default - Generic Retpolines AMD Retpolines Former Default 130K 260K 390K 520K 650K SE +/- 5150.50, N = 15 SE +/- 6095.03, N = 15 619271 596554 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 11.31 11.26
Phoronix Test Suite v10.8.4