cpuall_v2

AMD EPYC 7413 24-Core testing with a GIGABYTE MZ32-AR0-00 v01000100 (M18 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 on Rocky Linux 9.3 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2403189-NE-2403174NE64&sro&grw.

cpuall_v2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelOpenCLCompilerFile-SystemScreen ResolutionDesktopDisplay Server6000_4ch_ll4090x2Intel 0000% @ 3.30GHz (48 Cores / 96 Threads)ASUS Pro WS W790E-SAGE SE (0215 BIOS)Intel Alder Lake-S PCH64GB4001GB CT4000P3SSD8 + 0GB Virtual HDisk0ASPEEDRealtek ALC12202 x Intel X710 for 10GBASE-TFedora 396.7.7-200.fc39.x86_64 (x86_64)OpenCL 3.0 + OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3GCC 13.2.1 20231205 + Clang 17.0.6 + LLVM 17.0.6xfs1920x1200AMD EPYC 7413 24-Core @ 2.65GHz (24 Cores)GIGABYTE MZ32-AR0-00 v01000100 (M18 BIOS)AMD Starship/Matisse6 x 16 GB DDR4-2667MT/s 18ASF2G72PZ-2G6D2960GB INTEL SSDPE21D960GA + 2 x 1600GB Toshiba KXG50PNV2T04 + 4001GB Nextorage SSD NE1N4TB + 3 x 59GB INTEL SSDPEK1A058GAGigabyte NVIDIA GeForce RTX 4090NVIDIA AD102 HD AudioAquantia AQC107 NBase-T/IEEE + Mellanox MT27500Rocky Linux 9.35.14.0-362.24.1.el9_3.x86_64 (x86_64)GNOME Shell 40.10X Server 1.20.11GCC 11.4.1 20230605 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.31024x768OpenBenchmarking.orgKernel Details- 6000_4ch_ll: Transparent Huge Pages: madvise- 4090x2: Transparent Huge Pages: alwaysCompiler Details- 6000_4ch_ll: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - 4090x2: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Disk Details- NONE / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Details- 6000_4ch_ll: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xd0004b1- 4090x2: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Security Details- 6000_4ch_ll: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - 4090x2: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

cpuall_v2intel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writesintel-mlc: Max Bandwidth - Stream-Triad Likestream: Copyintel-mlc: Max Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - 1:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - Stream-Triad Likeintel-mlc: Max Bandwidth - 3:1 Reads-Writesstream: Scaleintel-mlc: Idle Latencystream: Triadstream: Addcachebench: Readcachebench: Writecachebench: Read / Modify / Writeintel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - All Readsintel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - All Readsfio: Rand Read - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 32 - Default Test Directorywhisper-cpp: ggml-medium.en - 2016 State of the Unionbuild-linux-kernel: defconfig6000_4ch_ll4090x297387.895035.2790064.093920.6688062.6595038.598042.6489640.2114.794108.693836.812582.59535485773.38262593157.94456992578.4113954.6587976.7113973.357.71477157.51472038999667389996201652.1643351.72731776.032011.5532574.730554.8929284.5032055.032025.2321120.991.523746.523560.99190.90335751622.472353102381.01713930510.838822.9329207.138812.030.6784442.21079825364667253649671218.0726572.107OpenBenchmarking.org

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 3:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 19.44, N = 3SE +/- 30.53, N = 331776.097387.8

Intel Memory Latency Checker

Test: Max Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - Stream-Triad Like4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.09, N = 3SE +/- 8.34, N = 332011.5595035.27

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy4090x26000_4ch_ll20K40K60K80K100KSE +/- 31.85, N = 5SE +/- 16.98, N = 532574.790064.01. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Intel Memory Latency Checker

Test: Max Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 2:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.39, N = 3SE +/- 7.15, N = 330554.8993920.66

Intel Memory Latency Checker

Test: Max Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 1:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 19.96, N = 3SE +/- 4.74, N = 329284.5088062.65

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - Stream-Triad Like4090x26000_4ch_ll20K40K60K80K100KSE +/- 16.97, N = 3SE +/- 8.22, N = 332055.095038.5

Intel Memory Latency Checker

Test: Max Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 3:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 27.70, N = 3SE +/- 47.67, N = 332025.2398042.64

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale4090x26000_4ch_ll20K40K60K80K100KSE +/- 16.22, N = 5SE +/- 63.75, N = 521120.989640.21. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Intel Memory Latency Checker

Test: Idle Latency

OpenBenchmarking.orgns, Fewer Is BetterIntel Memory Latency Checker 3.10Test: Idle Latency4090x26000_4ch_ll306090120150SE +/- 0.03, N = 3SE +/- 0.37, N = 391.5114.7

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad4090x26000_4ch_ll20K40K60K80K100KSE +/- 14.55, N = 5SE +/- 12.86, N = 523746.594108.61. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add4090x26000_4ch_ll20K40K60K80K100KSE +/- 17.00, N = 5SE +/- 26.62, N = 523560.993836.81. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read4090x26000_4ch_ll3K6K9K12K15KSE +/- 6.47, N = 3SE +/- 0.15, N = 39190.9012582.60MIN: 9160.88 / MAX: 9203.86MIN: 12577.1 / MAX: 12583.241. (CC) gcc options: -O3 -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Write4090x26000_4ch_ll20K40K60K80K100KSE +/- 41.10, N = 3SE +/- 23.70, N = 351622.4785773.38MIN: 39519.8 / MAX: 54896.66MIN: 51047.3 / MAX: 97997.281. (CC) gcc options: -O3 -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Write4090x26000_4ch_ll20K40K60K80K100KSE +/- 367.10, N = 3SE +/- 6.98, N = 3102381.0293157.94MIN: 77271.95 / MAX: 109243.01MIN: 81727.05 / MAX: 98992.881. (CC) gcc options: -O3 -lrt

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 2:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.15, N = 3SE +/- 22.58, N = 330510.892578.4

Intel Memory Latency Checker

Test: Max Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - All Reads4090x26000_4ch_ll20K40K60K80K100KSE +/- 94.69, N = 3SE +/- 27.22, N = 338822.93113954.65

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 1:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 50.27, N = 3SE +/- 29.53, N = 329207.187976.7

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - All Reads4090x26000_4ch_ll20K40K60K80K100KSE +/- 88.37, N = 3SE +/- 26.33, N = 338812.0113973.3

Flexible IO Tester

Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll1326395265SE +/- 0.03, N = 3SE +/- 0.54, N = 730.657.7-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll3K6K9K12K15KSE +/- 5.78, N = 3SE +/- 137.52, N = 7784414771-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll1326395265SE +/- 1.10, N = 15SE +/- 0.62, N = 542.257.5-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll3K6K9K12K15KSE +/- 280.58, N = 15SE +/- 159.37, N = 51079814720-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll80160240320400SE +/- 0.67, N = 3SE +/- 7.12, N = 15253389-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll20K40K60K80K100KSE +/- 120.19, N = 3SE +/- 1816.90, N = 156466799667-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll80160240320400SE +/- 0.33, N = 3SE +/- 5.69, N = 15253389-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Flexible IO Tester

Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll20K40K60K80K100KSE +/- 66.67, N = 3SE +/- 1450.85, N = 156496799620-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Whisper.cpp

Model: ggml-medium.en - Input: 2016 State of the Union

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.4Model: ggml-medium.en - Input: 2016 State of the Union4090x26000_4ch_ll400800120016002000SE +/- 14.14, N = 9SE +/- 10.69, N = 31218.071652.161. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig4090x26000_4ch_ll1632486480SE +/- 0.70, N = 3SE +/- 0.51, N = 1572.1151.73


Phoronix Test Suite v10.8.5