cur AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.7 BIOS) and NVIDIA GeForce RTX 3080 10GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2401261-NE-CUR11415260&grw&sor&rro .
cur Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB AMD EPYC 7R13 48-Core AMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads) Supermicro H12SSL-I v1.02 (2.7 BIOS) AMD Starship/Matisse 256GB 15363GB Micron_7450_MTFDKCC15T3TFR NVIDIA GeForce RTX 3080 10GB NVIDIA GA102 HD Audio 38GN950 2 x Intel X710 for 10GbE SFP+ EndeavourOS rolling 6.7.0-zen3-1-zen (x86_64) Xfce 4.18 X Server 1.21.1.11 NVIDIA 545.29.06 4.6.0 GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3 btrfs 3840x1600 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: always Environment Details - NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d1 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
cur onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU povray: Trace Time memcached: 1:1 memcached: 1:5 memcached: 5:1 memcached: 1:10 memcached: 1:100 redis: GET - 50 redis: SET - 50 redis: GET - 500 redis: LPOP - 50 redis: SADD - 50 redis: SET - 500 redis: GET - 1000 redis: LPOP - 500 redis: LPUSH - 50 redis: SADD - 500 redis: SET - 1000 redis: LPOP - 1000 redis: LPUSH - 500 redis: SADD - 1000 redis: LPUSH - 1000 rocksdb: Rand Fill rocksdb: Rand Read rocksdb: Update Rand rocksdb: Seq Fill rocksdb: Rand Fill Sync rocksdb: Read While Writing rocksdb: Read Rand Write Rand AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB AMD EPYC 7R13 48-Core 1402255.55 3892133.86 859655.41 4933024.30 5095243.46 3291152.25 2296166.75 2652247.58 3346418.75 2450984.92 2040798.23 2523813.88 3032015.83 1995175.85 2293241.02 2092101.7 2529330.97 1743815.39 2343177.00 1797052.21 496967 223375461 447591 470790 478763 5897561 2962759 1.79340 3.90328 1.00977 11.5275 3.00691 1195.24 686.396 10.652 OpenBenchmarking.org
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.4035 0.807 1.2105 1.614 2.0175 SE +/- 0.02416, N = 15 1.79340 MIN: 1.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.8782 1.7564 2.6346 3.5128 4.391 SE +/- 0.02463, N = 3 3.90328 MIN: 3.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.2272 0.4544 0.6816 0.9088 1.136 SE +/- 0.00838, N = 3 1.00977 MIN: 0.93 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 3 6 9 12 15 SE +/- 0.09, N = 15 11.53 MIN: 9.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.6766 1.3532 2.0298 2.7064 3.383 SE +/- 0.01842, N = 3 3.00691 MIN: 1.88 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 300 600 900 1200 1500 SE +/- 25.92, N = 15 1195.24 MIN: 1032.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 150 300 450 600 750 SE +/- 1.94, N = 3 686.40 MIN: 630.01 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time AMD EPYC 7R13 48-Core 3 6 9 12 15 SE +/- 0.02, N = 3 10.65 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Memcached Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:1 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 300K 600K 900K 1200K 1500K SE +/- 5721.30, N = 3 1402255.55 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:5 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 800K 1600K 2400K 3200K 4000K SE +/- 15518.47, N = 3 3892133.86 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 5:1 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 200K 400K 600K 800K 1000K SE +/- 2839.29, N = 3 859655.41 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 9855.19, N = 3 4933024.30 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 8768.35, N = 3 5095243.46 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis Test: GET - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 50 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 700K 1400K 2100K 2800K 3500K SE +/- 38278.62, N = 3 3291152.25 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 50 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 20591.72, N = 3 2296166.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 500 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 600K 1200K 1800K 2400K 3000K SE +/- 23868.25, N = 3 2652247.58 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPOP - Parallel Connections: 50 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 700K 1400K 2100K 2800K 3500K SE +/- 12011.34, N = 3 3346418.75 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SADD - Parallel Connections: 50 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 12288.07, N = 3 2450984.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 500 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 400K 800K 1200K 1600K 2000K SE +/- 34542.79, N = 15 2040798.23 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET - Parallel Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 1000 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 50493.06, N = 15 2523813.88 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPOP - Parallel Connections: 500 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 600K 1200K 1800K 2400K 3000K SE +/- 13897.94, N = 3 3032015.83 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPUSH - Parallel Connections: 50 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 400K 800K 1200K 1600K 2000K SE +/- 23682.92, N = 4 1995175.85 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SADD - Parallel Connections: 500 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 36861.44, N = 12 2293241.02 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET - Parallel Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 1000 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 400K 800K 1200K 1600K 2000K SE +/- 4868.26, N = 3 2092101.7 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPOP - Parallel Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPOP - Parallel Connections: 1000 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 189107.44, N = 12 2529330.97 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPUSH - Parallel Connections: 500 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 400K 800K 1200K 1600K 2000K SE +/- 21242.42, N = 15 1743815.39 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SADD - Parallel Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SADD - Parallel Connections: 1000 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 500K 1000K 1500K 2000K 2500K SE +/- 17528.59, N = 3 2343177.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: LPUSH - Parallel Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: LPUSH - Parallel Connections: 1000 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 400K 800K 1200K 1600K 2000K SE +/- 16752.86, N = 3 1797052.21 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 110K 220K 330K 440K 550K SE +/- 1048.18, N = 3 496967 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 50M 100M 150M 200M 250M SE +/- 214809.66, N = 3 223375461 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 4418.47, N = 3 447591 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Sequential Fill AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 1051.04, N = 3 470790 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill Sync AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 369.08, N = 3 478763 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 22994.63, N = 3 5897561 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 600K 1200K 1800K 2400K 3000K SE +/- 20667.77, N = 3 2962759 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Phoronix Test Suite v10.8.5