cur

AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.7 BIOS) and NVIDIA GeForce RTX 3080 10GB on EndeavourOS rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2401261-NE-CUR11415260&rdt.

curProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GBAMD EPYC 7R13 48-CoreAMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads)Supermicro H12SSL-I v1.02 (2.7 BIOS)AMD Starship/Matisse256GB15363GB Micron_7450_MTFDKCC15T3TFRNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD Audio38GN9502 x Intel X710 for 10GbE SFP+EndeavourOS rolling6.7.0-zen3-1-zen (x86_64)Xfce 4.18X Server 1.21.1.11NVIDIA 545.29.064.6.0GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3btrfs3840x1600OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"Compiler Details- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d1Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

currocksdb: Rand Fillrocksdb: Rand Readrocksdb: Update Randrocksdb: Seq Fillrocksdb: Rand Fill Syncrocksdb: Read While Writingrocksdb: Read Rand Write Randonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUpovray: Trace Timememcached: 1:1memcached: 1:5memcached: 5:1memcached: 1:10memcached: 1:100redis: GET - 50redis: SET - 50redis: GET - 500redis: LPOP - 50redis: SADD - 50redis: SET - 500redis: GET - 1000redis: LPOP - 500redis: LPUSH - 50redis: SADD - 500redis: SET - 1000redis: LPOP - 1000redis: LPUSH - 500redis: SADD - 1000redis: LPUSH - 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GBAMD EPYC 7R13 48-Core496967223375461447591470790478763589756129627591402255.553892133.86859655.414933024.305095243.463291152.252296166.752652247.583346418.752450984.922040798.232523813.883032015.831995175.852293241.022092101.72529330.971743815.392343177.001797052.211.793403.903281.0097711.52753.006911195.24686.39610.652OpenBenchmarking.org

RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random FillAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB110K220K330K440K550KSE +/- 1048.18, N = 34969671. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random ReadAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB50M100M150M200M250MSE +/- 214809.66, N = 32233754611. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update RandomAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB100K200K300K400K500KSE +/- 4418.47, N = 34475911. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Sequential Fill

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Sequential FillAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB100K200K300K400K500KSE +/- 1051.04, N = 34707901. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Random Fill Sync

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random Fill SyncAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB100K200K300K400K500KSE +/- 369.08, N = 34787631. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read While WritingAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB1.3M2.6M3.9M5.2M6.5MSE +/- 22994.63, N = 358975611. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write RandomAMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB600K1200K1800K2400K3000KSE +/- 20667.77, N = 329627591. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core0.40350.8071.21051.6142.0175SE +/- 0.02416, N = 151.79340MIN: 1.341. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core0.87821.75642.63463.51284.391SE +/- 0.02463, N = 33.90328MIN: 3.531. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core0.22720.45440.68160.90881.136SE +/- 0.00838, N = 31.00977MIN: 0.931. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core3691215SE +/- 0.09, N = 1511.53MIN: 9.631. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core0.67661.35322.02982.70643.383SE +/- 0.01842, N = 33.00691MIN: 1.881. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core30060090012001500SE +/- 25.92, N = 151195.24MIN: 1032.551. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAMD EPYC 7R13 48-Core150300450600750SE +/- 1.94, N = 3686.40MIN: 630.011. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAMD EPYC 7R13 48-Core3691215SE +/- 0.02, N = 310.651. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Memcached

Set To Get Ratio: 1:1

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB300K600K900K1200K1500KSE +/- 5721.30, N = 31402255.551. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB800K1600K2400K3200K4000KSE +/- 15518.47, N = 33892133.861. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 5:1

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 5:1AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB200K400K600K800K1000KSE +/- 2839.29, N = 3859655.411. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB1.1M2.2M3.3M4.4M5.5MSE +/- 9855.19, N = 34933024.301. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB1.1M2.2M3.3M4.4M5.5MSE +/- 8768.35, N = 35095243.461. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis

Test: GET - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 50AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB700K1400K2100K2800K3500KSE +/- 38278.62, N = 33291152.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 50AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 20591.72, N = 32296166.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB600K1200K1800K2400K3000KSE +/- 23868.25, N = 32652247.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPOP - Parallel Connections: 50AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB700K1400K2100K2800K3500KSE +/- 12011.34, N = 33346418.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SADD - Parallel Connections: 50AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 12288.07, N = 32450984.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB400K800K1200K1600K2000KSE +/- 34542.79, N = 152040798.231. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET - Parallel Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 50493.06, N = 152523813.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPOP - Parallel Connections: 500AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB600K1200K1800K2400K3000KSE +/- 13897.94, N = 33032015.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPUSH - Parallel Connections: 50AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB400K800K1200K1600K2000KSE +/- 23682.92, N = 41995175.851. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SADD - Parallel Connections: 500AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 36861.44, N = 122293241.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET - Parallel Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB400K800K1200K1600K2000KSE +/- 4868.26, N = 32092101.71. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP - Parallel Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPOP - Parallel Connections: 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 189107.44, N = 122529330.971. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH - Parallel Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPUSH - Parallel Connections: 500AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB400K800K1200K1600K2000KSE +/- 21242.42, N = 151743815.391. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD - Parallel Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SADD - Parallel Connections: 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB500K1000K1500K2000K2500KSE +/- 17528.59, N = 32343177.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH - Parallel Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: LPUSH - Parallel Connections: 1000AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 3080 10GB400K800K1200K1600K2000KSE +/- 16752.86, N = 31797052.211. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3


Phoronix Test Suite v10.8.4