AMD EPYC 72F3 Retpoline benchmarks by Michael Larabel.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2203111-NE-EPYC72F3R09 EPYC 72F3 Retpolines Change In Linux 5.17 - Phoronix Test Suite EPYC 72F3 Retpolines Change In Linux 5.17 AMD EPYC 72F3 Retpoline benchmarks by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/2203111-NE-EPYC72F3R09&export=pdf&grt&rdt&rro .
EPYC 72F3 Retpolines Change In Linux 5.17 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution New Default - Generic Retpolines AMD Retpolines Former Default AMD EPYC 72F3 8-Core @ 3.70GHz (8 Cores / 16 Threads) Supermicro H12SSL-i v1.01 (2.0 BIOS) AMD Starship/Matisse 128GB 3841GB Micron_9300_MTFDHAL3T8TDP + 1000GB Corsair Force MP600 ASPEED VE228 2 x Broadcom NetXtreme BCM5720 2-port PCIe Ubuntu 21.04 5.17.0-051700rc7daily20220310-generic (x86_64) GNOME Shell 3.38.4 X Server GCC 10.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa001119 Java Details - OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.04) Python Details - Python 3.9.5 Security Details - New Default - Generic Retpolines: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - AMD Retpolines Former Default: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Vulnerable: LFENCE + srbds: Not affected + tsx_async_abort: Not affected
EPYC 72F3 Retpolines Change In Linux 5.17 compress-7zip: Compression Rating compress-7zip: Decompression Rating cassandra: Writes apache: 100 apache: 200 apache: 500 dacapobench: H2 dacapobench: Jython dacapobench: Tradesoap dacapobench: Tradebeans ebizzy: rocksdb: Rand Fill rocksdb: Rand Read rocksdb: Update Rand rocksdb: Seq Fill rocksdb: Read While Writing rocksdb: Read Rand Write Rand fio: Rand Read - IO_uring - No - Yes - 4KB - Default Test Directory fio: Rand Write - IO_uring - No - Yes - 4KB - Default Test Directory draco: Lion draco: Church Facade hackbench: 8 - Thread hackbench: 16 - Thread hackbench: 8 - Process hackbench: 16 - Process hackbench: 32 - Process influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 keydb: leveldb: Hot Read leveldb: Fill Sync leveldb: Overwrite leveldb: Overwrite leveldb: Rand Fill leveldb: Rand Fill leveldb: Rand Read leveldb: Seek Rand leveldb: Rand Delete leveldb: Seq Fill leveldb: Seq Fill compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed memtier-benchmark: Redis nginx: 20 nginx: 100 nginx: 200 node-express-loadtest: node-web-tooling: openssl: SHA256 openssl: RSA4096 openssl: RSA4096 osbench: Create Files osbench: Create Threads osbench: Launch Programs osbench: Create Processes compress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compression pgbench: 100 - 50 - Read Only pgbench: 100 - 50 - Read Only - Average Latency pgbench: 100 - 100 - Read Only pgbench: 100 - 100 - Read Only - Average Latency pgbench: 100 - 100 - Read Write pgbench: 100 - 100 - Read Write - Average Latency postmark: Disk Transaction Performance redis: GET redis: SET renaissance: Finagle HTTP Requests renaissance: In-Memory Database Shootout sockperf: Throughput sockperf: Latency Under Load sqlite-speedtest: Timed Time - Size 1,000 stress-ng: SENDFILE stress-ng: Semaphores stress-ng: Socket Activity stress-ng: Context Switching stress-ng: System V Message Passing build-eigen: Time To Compile build-gdb: Time To Compile build-godot: Time To Compile build-linux-kernel: defconfig build-mplayer: Time To Compile wireguard: compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed New Default - Generic Retpolines AMD Retpolines Former Default 86357 69393 86420 82500.75 86362.18 80298.15 4268 3761 3660 5762 619271 605572 61331721 393101 657523 1734878 1391000 239667 223933 5088 6697 31.275 62.203 29.950 58.782 119.831 1326859.1 1638195.7 136793.59 10.537 1820.488 17.1 103.365 17.2 103.217 10.577 17.139 99.876 16.9 105.116 61.24 14455.1 59.57 14586.2 1324327.32 215287.79 215190.56 213684.40 6692 11.26 13238722860 2372.3 155003.1 14.706538 17.376741 49.362977 43.140253 10.784 295179 0.170 302523 0.331 10383 9.641 6944 2506720.25 1998825.92 2378.7 4788.2 539724 3.666 50.483 200771.29 1326767.92 10751.13 4296558.49 4521841.84 61.685 54.955 111.507 78.618 30.709 257.104 1343.6 4255.1 51.6 3694.5 87210 69308 85956 83397.57 86408.48 80970.15 4228 3697 3628 5634 596554 617250 61586232 398526 670760 1746524 1386586 243000 229500 5070 6649 31.046 62.029 28.828 58.356 117.984 1323460.2 1639910.0 142573.34 10.502 1806.759 16.9 104.608 17.1 103.680 10.545 17.011 98.953 16.5 106.946 61.36 14817.3 59.69 14807.4 1313054.11 210605.86 207615.65 206147.39 6910 11.31 13234833737 2374.1 155031.8 13.723014 17.703374 48.627853 43.362777 10.664 295730 0.169 302399 0.330 10440 9.590 7282 2438721.71 1995692.89 2366.9 4878.7 587523 3.641 49.007 193652.98 1342170.84 10567.09 4326244.50 4530010.92 61.098 55.032 111.580 78.591 30.802 248.756 1338.6 4160.1 51.9 3668.5 OpenBenchmarking.org
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 870.19, N = 5 SE +/- 438.70, N = 3 87210 86357 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating AMD Retpolines Former Default New Default - Generic Retpolines 15K 30K 45K 60K 75K SE +/- 59.62, N = 5 SE +/- 8.76, N = 3 69308 69393 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Writes AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 518.75, N = 3 SE +/- 31.48, N = 3 85956 86420
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 311.80, N = 3 SE +/- 56.43, N = 3 83397.57 82500.75 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 123.84, N = 3 SE +/- 557.73, N = 3 86408.48 86362.18 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 AMD Retpolines Former Default New Default - Generic Retpolines 20K 40K 60K 80K 100K SE +/- 149.43, N = 3 SE +/- 264.49, N = 3 80970.15 80298.15 1. (CC) gcc options: -shared -fPIC -O2 -pthread
DaCapo Benchmark Java Test: H2 OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: H2 AMD Retpolines Former Default New Default - Generic Retpolines 900 1800 2700 3600 4500 SE +/- 25.85, N = 16 SE +/- 26.89, N = 4 4228 4268
DaCapo Benchmark Java Test: Jython OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Jython AMD Retpolines Former Default New Default - Generic Retpolines 800 1600 2400 3200 4000 SE +/- 24.35, N = 20 SE +/- 28.58, N = 4 3697 3761
DaCapo Benchmark Java Test: Tradesoap OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap AMD Retpolines Former Default New Default - Generic Retpolines 800 1600 2400 3200 4000 SE +/- 33.50, N = 4 SE +/- 40.14, N = 4 3628 3660
DaCapo Benchmark Java Test: Tradebeans OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans AMD Retpolines Former Default New Default - Generic Retpolines 1200 2400 3600 4800 6000 SE +/- 38.99, N = 4 SE +/- 34.00, N = 4 5634 5762
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 AMD Retpolines Former Default New Default - Generic Retpolines 130K 260K 390K 520K 650K SE +/- 6095.03, N = 15 SE +/- 5150.50, N = 15 596554 619271 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Random Fill AMD Retpolines Former Default New Default - Generic Retpolines 130K 260K 390K 520K 650K SE +/- 3544.93, N = 3 SE +/- 5871.64, N = 3 617250 605572 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Random Read AMD Retpolines Former Default New Default - Generic Retpolines 13M 26M 39M 52M 65M SE +/- 78029.55, N = 3 SE +/- 195189.61, N = 3 61586232 61331721 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Update Random AMD Retpolines Former Default New Default - Generic Retpolines 90K 180K 270K 360K 450K SE +/- 1232.53, N = 3 SE +/- 2036.18, N = 3 398526 393101 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Sequential Fill AMD Retpolines Former Default New Default - Generic Retpolines 140K 280K 420K 560K 700K SE +/- 3478.74, N = 3 SE +/- 1947.52, N = 3 670760 657523 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Read While Writing AMD Retpolines Former Default New Default - Generic Retpolines 400K 800K 1200K 1600K 2000K SE +/- 20451.18, N = 4 SE +/- 24226.58, N = 3 1746524 1734878 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.22.1 Test: Read Random Write Random AMD Retpolines Former Default New Default - Generic Retpolines 300K 600K 900K 1200K 1500K SE +/- 6491.63, N = 3 SE +/- 2292.69, N = 3 1386586 1391000 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Flexible IO Tester Type: Random Read - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 3.29 Type: Random Read - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 577.35, N = 3 SE +/- 333.33, N = 3 243000 239667 1. (CC) gcc options: -rdynamic -ltcmalloc -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native
Flexible IO Tester Type: Random Write - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 3.29 Type: Random Write - Engine: IO_uring - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 2958.04, N = 4 SE +/- 3027.60, N = 15 229500 223933 1. (CC) gcc options: -rdynamic -ltcmalloc -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Lion AMD Retpolines Former Default New Default - Generic Retpolines 1100 2200 3300 4400 5500 SE +/- 0.88, N = 3 SE +/- 7.37, N = 3 5070 5088 1. (CXX) g++ options: -O3
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Church Facade AMD Retpolines Former Default New Default - Generic Retpolines 1400 2800 4200 5600 7000 SE +/- 5.21, N = 3 SE +/- 17.74, N = 3 6649 6697 1. (CXX) g++ options: -O3
Hackbench Count: 8 - Type: Thread OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Thread AMD Retpolines Former Default New Default - Generic Retpolines 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 31.05 31.28 1. (CC) gcc options: -lpthread
Hackbench Count: 16 - Type: Thread OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Thread AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 62.03 62.20 1. (CC) gcc options: -lpthread
Hackbench Count: 8 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 8 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 28.83 29.95 1. (CC) gcc options: -lpthread
Hackbench Count: 16 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 13 26 39 52 65 SE +/- 0.23, N = 3 SE +/- 0.16, N = 3 58.36 58.78 1. (CC) gcc options: -lpthread
Hackbench Count: 32 - Type: Process OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 32 - Type: Process AMD Retpolines Former Default New Default - Generic Retpolines 30 60 90 120 150 SE +/- 0.59, N = 3 SE +/- 0.30, N = 3 117.98 119.83 1. (CC) gcc options: -lpthread
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 AMD Retpolines Former Default New Default - Generic Retpolines 300K 600K 900K 1200K 1500K SE +/- 2326.89, N = 3 SE +/- 1561.10, N = 3 1323460.2 1326859.1
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 AMD Retpolines Former Default New Default - Generic Retpolines 400K 800K 1200K 1600K 2000K SE +/- 1145.84, N = 3 SE +/- 3111.87, N = 3 1639910.0 1638195.7
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.2.0 AMD Retpolines Former Default New Default - Generic Retpolines 30K 60K 90K 120K 150K SE +/- 1055.00, N = 15 SE +/- 1054.95, N = 10 142573.34 136793.59 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
LevelDB Benchmark: Hot Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Hot Read AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.11, N = 6 10.50 10.54 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Fill Sync OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Fill Sync AMD Retpolines Former Default New Default - Generic Retpolines 400 800 1200 1600 2000 SE +/- 3.85, N = 3 SE +/- 8.19, N = 3 1806.76 1820.49 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Overwrite AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.11, N = 14 SE +/- 0.15, N = 15 16.9 17.1 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Overwrite AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 0.70, N = 14 SE +/- 0.87, N = 15 104.61 103.37 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Random Fill AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.22, N = 3 SE +/- 0.15, N = 15 17.1 17.2 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Fill AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 1.34, N = 3 SE +/- 0.90, N = 15 103.68 103.22 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Read AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 10.55 10.58 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Seek Random OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Seek Random AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.05, N = 3 17.01 17.14 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Random Delete OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Delete AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 1.08, N = 4 SE +/- 0.42, N = 3 98.95 99.88 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Sequential Fill AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 16.5 16.9 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Sequential Fill AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.43, N = 3 106.95 105.12 1. (CXX) g++ options: -O3 -lsnappy -ltcmalloc -lpthread
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.30, N = 3 SE +/- 0.47, N = 15 61.36 61.24 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 3K 6K 9K 12K 15K SE +/- 22.12, N = 3 SE +/- 24.84, N = 15 14817.3 14455.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.63, N = 3 59.69 59.57 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 3K 6K 9K 12K 15K SE +/- 61.43, N = 3 SE +/- 17.39, N = 3 14807.4 14586.2 1. (CC) gcc options: -O3
Memtier_benchmark Protocol: Redis OpenBenchmarking.org Ops/sec, More Is Better Memtier_benchmark 1.3 Protocol: Redis AMD Retpolines Former Default New Default - Generic Retpolines 300K 600K 900K 1200K 1500K SE +/- 1363.28, N = 3 SE +/- 3316.11, N = 3 1313054.11 1324327.32 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 247.73, N = 3 SE +/- 627.93, N = 3 210605.86 215287.79 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 369.20, N = 3 SE +/- 1627.81, N = 3 207615.65 215190.56 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 AMD Retpolines Former Default New Default - Generic Retpolines 50K 100K 150K 200K 250K SE +/- 466.89, N = 3 SE +/- 275.98, N = 3 206147.39 213684.40 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Node.js Express HTTP Load Test OpenBenchmarking.org Requests Per Second, More Is Better Node.js Express HTTP Load Test AMD Retpolines Former Default New Default - Generic Retpolines 1500 3000 4500 6000 7500 SE +/- 43.97, N = 3 SE +/- 34.82, N = 3 6910 6692
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 11.31 11.26
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 AMD Retpolines Former Default New Default - Generic Retpolines 3000M 6000M 9000M 12000M 15000M SE +/- 9115647.24, N = 3 SE +/- 6459514.72, N = 3 13234833737 13238722860 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 AMD Retpolines Former Default New Default - Generic Retpolines 500 1000 1500 2000 2500 SE +/- 0.15, N = 3 SE +/- 2.17, N = 3 2374.1 2372.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 AMD Retpolines Former Default New Default - Generic Retpolines 30K 60K 90K 120K 150K SE +/- 8.58, N = 3 SE +/- 7.94, N = 3 155031.8 155003.1 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OSBench Test: Create Files OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Files AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 13.72 14.71 1. (CC) gcc options: -lm
OSBench Test: Create Threads OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Threads AMD Retpolines Former Default New Default - Generic Retpolines 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 17.70 17.38 1. (CC) gcc options: -lm
OSBench Test: Launch Programs OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Launch Programs AMD Retpolines Former Default New Default - Generic Retpolines 11 22 33 44 55 SE +/- 0.53, N = 5 SE +/- 0.10, N = 3 48.63 49.36 1. (CC) gcc options: -lm
OSBench Test: Create Processes OpenBenchmarking.org us Per Event, Fewer Is Better OSBench Test: Create Processes AMD Retpolines Former Default New Default - Generic Retpolines 10 20 30 40 50 SE +/- 0.54, N = 3 SE +/- 0.23, N = 3 43.36 43.14 1. (CC) gcc options: -lm
Parallel BZIP2 Compression FreeBSD-13.0-RELEASE-amd64-memstick.img Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.13 FreeBSD-13.0-RELEASE-amd64-memstick.img Compression AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.15, N = 3 SE +/- 0.11, N = 6 10.66 10.78 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only AMD Retpolines Former Default New Default - Generic Retpolines 60K 120K 180K 240K 300K SE +/- 3223.57, N = 12 SE +/- 2460.01, N = 12 295730 295179 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.002, N = 12 SE +/- 0.001, N = 12 0.169 0.170 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only AMD Retpolines Former Default New Default - Generic Retpolines 60K 120K 180K 240K 300K SE +/- 303.18, N = 3 SE +/- 3587.18, N = 4 302399 302523 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 0.0745 0.149 0.2235 0.298 0.3725 SE +/- 0.000, N = 3 SE +/- 0.004, N = 4 0.330 0.331 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write AMD Retpolines Former Default New Default - Generic Retpolines 2K 4K 6K 8K 10K SE +/- 108.62, N = 12 SE +/- 102.24, N = 12 10440 10383 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency AMD Retpolines Former Default New Default - Generic Retpolines 3 6 9 12 15 SE +/- 0.099, N = 12 SE +/- 0.094, N = 12 9.590 9.641 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostMark Disk Transaction Performance OpenBenchmarking.org TPS, More Is Better PostMark 1.51 Disk Transaction Performance AMD Retpolines Former Default New Default - Generic Retpolines 1600 3200 4800 6400 8000 SE +/- 70.00, N = 3 7282 6944 1. (CC) gcc options: -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET AMD Retpolines Former Default New Default - Generic Retpolines 500K 1000K 1500K 2000K 2500K SE +/- 22769.87, N = 6 SE +/- 15897.48, N = 3 2438721.71 2506720.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET AMD Retpolines Former Default New Default - Generic Retpolines 400K 800K 1200K 1600K 2000K SE +/- 18125.41, N = 7 SE +/- 2446.32, N = 3 1995692.89 1998825.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Renaissance Test: Finagle HTTP Requests OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.12 Test: Finagle HTTP Requests AMD Retpolines Former Default New Default - Generic Retpolines 500 1000 1500 2000 2500 SE +/- 15.84, N = 3 SE +/- 24.88, N = 3 2366.9 2378.7 MIN: 2178.29 / MAX: 2434.21 MIN: 2177.37 / MAX: 2449.07
Renaissance Test: In-Memory Database Shootout OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.12 Test: In-Memory Database Shootout AMD Retpolines Former Default New Default - Generic Retpolines 1000 2000 3000 4000 5000 SE +/- 64.75, N = 3 SE +/- 22.81, N = 3 4878.7 4788.2 MIN: 4512.55 / MAX: 5542.48 MIN: 4452.21 / MAX: 5160.31
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.7 Test: Throughput AMD Retpolines Former Default New Default - Generic Retpolines 130K 260K 390K 520K 650K SE +/- 5255.38, N = 25 SE +/- 4906.55, N = 25 587523 539724 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.7 Test: Latency Under Load AMD Retpolines Former Default New Default - Generic Retpolines 0.8249 1.6498 2.4747 3.2996 4.1245 SE +/- 0.033, N = 25 SE +/- 0.035, N = 5 3.641 3.666 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 AMD Retpolines Former Default New Default - Generic Retpolines 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 49.01 50.48 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: SENDFILE AMD Retpolines Former Default New Default - Generic Retpolines 40K 80K 120K 160K 200K SE +/- 443.51, N = 3 SE +/- 510.20, N = 3 193652.98 200771.29 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Semaphores AMD Retpolines Former Default New Default - Generic Retpolines 300K 600K 900K 1200K 1500K SE +/- 10886.78, N = 3 SE +/- 4542.73, N = 3 1342170.84 1326767.92 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Socket Activity AMD Retpolines Former Default New Default - Generic Retpolines 2K 4K 6K 8K 10K SE +/- 13.84, N = 3 SE +/- 12.31, N = 3 10567.09 10751.13 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: Context Switching AMD Retpolines Former Default New Default - Generic Retpolines 900K 1800K 2700K 3600K 4500K SE +/- 26201.07, N = 3 SE +/- 18653.91, N = 3 4326244.50 4296558.49 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.13.02 Test: System V Message Passing AMD Retpolines Former Default New Default - Generic Retpolines 1000K 2000K 3000K 4000K 5000K SE +/- 10539.69, N = 3 SE +/- 16005.42, N = 3 4530010.92 4521841.84 1. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lsctp -lz -ldl -pthread -lc -latomic
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.3.9 Time To Compile AMD Retpolines Former Default New Default - Generic Retpolines 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 61.10 61.69
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile AMD Retpolines Former Default New Default - Generic Retpolines 12 24 36 48 60 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 55.03 54.96
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.22, N = 3 111.58 111.51
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: defconfig AMD Retpolines Former Default New Default - Generic Retpolines 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 78.59 78.62
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.5 Time To Compile AMD Retpolines Former Default New Default - Generic Retpolines 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 30.80 30.71
WireGuard + Linux Networking Stack Stress Test OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test AMD Retpolines Former Default New Default - Generic Retpolines 60 120 180 240 300 SE +/- 0.99, N = 3 SE +/- 1.39, N = 3 248.76 257.10
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 300 600 900 1200 1500 SE +/- 10.10, N = 3 SE +/- 3.86, N = 3 1338.6 1343.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 900 1800 2700 3600 4500 SE +/- 4.29, N = 3 SE +/- 54.59, N = 3 4160.1 4255.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed AMD Retpolines Former Default New Default - Generic Retpolines 12 24 36 48 60 SE +/- 0.45, N = 15 SE +/- 0.47, N = 15 51.9 51.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed AMD Retpolines Former Default New Default - Generic Retpolines 800 1600 2400 3200 4000 SE +/- 14.48, N = 15 SE +/- 18.20, N = 15 3668.5 3694.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
Phoronix Test Suite v10.8.4