cur AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.7 BIOS) and NVIDIA GeForce RTX 3080 10GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2401217-NE-CUR12004850&sro&grs .
cur Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB AMD EPYC 7R13 48-Core AMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads) Supermicro H12SSL-I v1.02 (2.7 BIOS) AMD Starship/Matisse 256GB 15363GB Micron_7450_MTFDKCC15T3TFR NVIDIA GeForce RTX 3080 10GB NVIDIA GA102 HD Audio 38GN950 2 x Intel X710 for 10GbE SFP+ EndeavourOS rolling 6.7.0-zen3-1-zen (x86_64) Xfce 4.18 X Server 1.21.1.11 NVIDIA 545.29.06 4.6.0 GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3 btrfs 3840x1600 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: always Environment Details - NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin" Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d1 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
cur memcached: 1:100 memcached: 1:10 memcached: 5:1 memcached: 1:5 memcached: 1:1 povray: Trace Time onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - f32 - CPU rocksdb: Read Rand Write Rand rocksdb: Read While Writing rocksdb: Rand Fill Sync rocksdb: Seq Fill rocksdb: Update Rand rocksdb: Rand Read rocksdb: Rand Fill onednn: Recurrent Neural Network Training - f32 - CPU openvino: Face Detection FP16 - CPU AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB AMD EPYC 7R13 48-Core 5095243.46 4933024.30 859655.41 3892133.86 1402255.55 2962759 5897561 478763 470790 447591 223375461 496967 10.652 686.396 3.00691 11.5275 1.00977 3.90328 1.79340 1195.24 OpenBenchmarking.org
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 8768.35, N = 3 5095243.46 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 9855.19, N = 3 4933024.30 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 5:1 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 200K 400K 600K 800K 1000K SE +/- 2839.29, N = 3 859655.41 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:5 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 800K 1600K 2400K 3200K 4000K SE +/- 15518.47, N = 3 3892133.86 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:1 AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 300K 600K 900K 1200K 1500K SE +/- 5721.30, N = 3 1402255.55 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time AMD EPYC 7R13 48-Core 3 6 9 12 15 SE +/- 0.02, N = 3 10.65 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 150 300 450 600 750 SE +/- 1.94, N = 3 686.40 MIN: 630.01 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.6766 1.3532 2.0298 2.7064 3.383 SE +/- 0.01842, N = 3 3.00691 MIN: 1.88 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 3 6 9 12 15 SE +/- 0.09, N = 15 11.53 MIN: 9.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.2272 0.4544 0.6816 0.9088 1.136 SE +/- 0.00838, N = 3 1.00977 MIN: 0.93 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.8782 1.7564 2.6346 3.5128 4.391 SE +/- 0.02463, N = 3 3.90328 MIN: 3.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 0.4035 0.807 1.2105 1.614 2.0175 SE +/- 0.02416, N = 15 1.79340 MIN: 1.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 600K 1200K 1800K 2400K 3000K SE +/- 20667.77, N = 3 2962759 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 22994.63, N = 3 5897561 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill Sync AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 369.08, N = 3 478763 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Sequential Fill AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 1051.04, N = 3 470790 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 100K 200K 300K 400K 500K SE +/- 4418.47, N = 3 447591 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 50M 100M 150M 200M 250M SE +/- 214809.66, N = 3 223375461 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB 110K 220K 330K 440K 550K SE +/- 1048.18, N = 3 496967 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU AMD EPYC 7R13 48-Core 300 600 900 1200 1500 SE +/- 25.92, N = 15 1195.24 MIN: 1032.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Phoronix Test Suite v10.8.5