cpuall_v2

AMD EPYC 7413 24-Core testing with a GIGABYTE MZ32-AR0-00 v01000100 (M18 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 on Rocky Linux 9.3 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2403189-NE-2403174NE64
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
6000_4ch_ll
March 17
  4 Hours, 41 Minutes
4090x2
March 17
  5 Hours, 21 Minutes
Invert Behavior (Only Show Selected Data)
  5 Hours, 1 Minute
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


cpuall_v2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelOpenCLCompilerFile-SystemScreen ResolutionDesktopDisplay Server6000_4ch_ll4090x2Intel 0000% @ 3.30GHz (48 Cores / 96 Threads)ASUS Pro WS W790E-SAGE SE (0215 BIOS)Intel Alder Lake-S PCH64GB4001GB CT4000P3SSD8 + 0GB Virtual HDisk0ASPEEDRealtek ALC12202 x Intel X710 for 10GBASE-TFedora 396.7.7-200.fc39.x86_64 (x86_64)OpenCL 3.0 + OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3GCC 13.2.1 20231205 + Clang 17.0.6 + LLVM 17.0.6xfs1920x1200AMD EPYC 7413 24-Core @ 2.65GHz (24 Cores)GIGABYTE MZ32-AR0-00 v01000100 (M18 BIOS)AMD Starship/Matisse6 x 16 GB DDR4-2667MT/s 18ASF2G72PZ-2G6D2960GB INTEL SSDPE21D960GA + 2 x 1600GB Toshiba KXG50PNV2T04 + 4001GB Nextorage SSD NE1N4TB + 3 x 59GB INTEL SSDPEK1A058GAGigabyte NVIDIA GeForce RTX 4090NVIDIA AD102 HD AudioAquantia AQC107 NBase-T/IEEE + Mellanox MT27500Rocky Linux 9.35.14.0-362.24.1.el9_3.x86_64 (x86_64)GNOME Shell 40.10X Server 1.20.11GCC 11.4.1 20230605 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.31024x768OpenBenchmarking.orgKernel Details- 6000_4ch_ll: Transparent Huge Pages: madvise- 4090x2: Transparent Huge Pages: alwaysCompiler Details- 6000_4ch_ll: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - 4090x2: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Disk Details- NONE / attr2,inode64,logbsize=32k,logbufs=8,noquota,relatime,rw,seclabel / Block Size: 4096Processor Details- 6000_4ch_ll: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xd0004b1- 4090x2: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa0011d1Security Details- 6000_4ch_ll: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - 4090x2: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

6000_4ch_ll vs. 4090x2 ComparisonPhoronix Test SuiteBaseline+81.1%+81.1%+162.2%+162.2%+243.3%+243.3%35.6%25.4%9.9%Scale324.4%Add298.3%Triad296.3%Max Bandwidth - 2:1 Reads-Writes207.4%P.I.B - 3:1 Reads-Writes206.5%Max Bandwidth - 3:1 Reads-Writes206.1%P.I.B - 2:1 Reads-Writes203.4%P.I.B - 1:1 Reads-Writes201.2%Max Bandwidth - 1:1 Reads-Writes200.7%Max Bandwidth - Stream-Triad Like196.9%P.I.B - Stream-Triad Like196.5%P.I.B - All Reads193.7%Max Bandwidth - All Reads193.5%Copy176.5%Rand Read - POSIX AIO - Yes - 4KB - 188.6%Rand Read - POSIX AIO - Yes - 4KB - 188.3%Write66.2%Rand Write - POSIX AIO - Yes - 4KB - 154.1%Rand Write - POSIX AIO - Yes - 4KB - 153.8%Rand Write - POSIX AIO - Yes - 4KB - 3253.8%Rand Write - POSIX AIO - Yes - 4KB - 3253.3%defconfig39.4%Read36.9%Rand Read - POSIX AIO - Yes - 4KB - 3236.3%Rand Read - POSIX AIO - Yes - 4KB - 3236.3%ggml-medium.en - 2.S.o.t.UIdle LatencyR.M.WStreamStreamStreamIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerIntel Memory Latency CheckerStreamFlexible IO TesterFlexible IO TesterCacheBenchFlexible IO TesterFlexible IO TesterFlexible IO TesterFlexible IO TesterTimed Linux Kernel CompilationCacheBenchFlexible IO TesterFlexible IO TesterWhisper.cppIntel Memory Latency CheckerCacheBench6000_4ch_ll4090x2

cpuall_v2fio: Rand Read - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Read - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 1 - Default Test Directoryfio: Rand Write - POSIX AIO - Yes - 4KB - 32 - Default Test Directoryintel-mlc: Max Bandwidth - All Readsintel-mlc: Max Bandwidth - 3:1 Reads-Writesintel-mlc: Max Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - 1:1 Reads-Writesintel-mlc: Max Bandwidth - Stream-Triad Likeintel-mlc: Peak Injection Bandwidth - All Readsintel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - Stream-Triad Likestream: Copystream: Scalestream: Triadstream: Addcachebench: Readcachebench: Writecachebench: Read / Modify / Writeintel-mlc: Idle Latencybuild-linux-kernel: defconfigwhisper-cpp: ggml-medium.en - 2016 State of the Union6000_4ch_ll4090x21477114720996679962057.757.5389389113954.6598042.6493920.6688062.6595035.27113973.397387.892578.487976.795038.590064.089640.294108.693836.812582.59535485773.38262593157.944569114.751.7271652.16433784410798646676496730.642.225325338822.9332025.2330554.8929284.5032011.5538812.031776.030510.829207.132055.032574.721120.923746.523560.99190.90335751622.472353102381.01713991.572.1071218.07265OpenBenchmarking.org

Flexible IO Tester

FIO, the Flexible I/O Tester, is an advanced Linux disk benchmark supporting multiple I/O engines and a wealth of options. FIO was written by Jens Axboe for testing of the Linux I/O subsystem and schedulers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll3K6K9K12K15KSE +/- 5.78, N = 3SE +/- 137.52, N = 7784414771-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll3K6K9K12K15KSE +/- 280.58, N = 15SE +/- 159.37, N = 51079814720-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll20K40K60K80K100KSE +/- 120.19, N = 3SE +/- 1816.90, N = 156466799667-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll20K40K60K80K100KSE +/- 66.67, N = 3SE +/- 1450.85, N = 156496799620-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll1326395265SE +/- 0.03, N = 3SE +/- 0.54, N = 730.657.7-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Read - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll1326395265SE +/- 1.10, N = 15SE +/- 0.62, N = 542.257.5-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 1 - Disk Target: Default Test Directory4090x26000_4ch_ll80160240320400SE +/- 0.67, N = 3SE +/- 7.12, N = 15253389-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Random Write - Engine: POSIX AIO - Direct: Yes - Block Size: 4KB - Job Count: 32 - Disk Target: Default Test Directory4090x26000_4ch_ll80160240320400SE +/- 0.33, N = 3SE +/- 5.69, N = 15253389-libverbs -lrdmacm -lcurl -lssl -lcrypto1. (CC) gcc options: -rdynamic -lz -lm -laio -lpthread -ldl -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

Intel Memory Latency Checker

Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. If the download fails you may need to manually download the file from https://www.intel.com/content/www/us/en/developer/articles/tool/intelr-memory-latency-checker.html and place it in your PTS download cache. On some systems root privileges are needed to run the MLC tester. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - All Reads4090x26000_4ch_ll20K40K60K80K100KSE +/- 94.69, N = 3SE +/- 27.22, N = 338822.93113954.65

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 3:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 27.70, N = 3SE +/- 47.67, N = 332025.2398042.64

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 2:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.39, N = 3SE +/- 7.15, N = 330554.8993920.66

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 1:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 19.96, N = 3SE +/- 4.74, N = 329284.5088062.65

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - Stream-Triad Like4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.09, N = 3SE +/- 8.34, N = 332011.5595035.27

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - All Reads4090x26000_4ch_ll20K40K60K80K100KSE +/- 88.37, N = 3SE +/- 26.33, N = 338812.0113973.3

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 3:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 19.44, N = 3SE +/- 30.53, N = 331776.097387.8

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 2:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 25.15, N = 3SE +/- 22.58, N = 330510.892578.4

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 1:1 Reads-Writes4090x26000_4ch_ll20K40K60K80K100KSE +/- 50.27, N = 3SE +/- 29.53, N = 329207.187976.7

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - Stream-Triad Like4090x26000_4ch_ll20K40K60K80K100KSE +/- 16.97, N = 3SE +/- 8.22, N = 332055.095038.5

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy4090x26000_4ch_ll20K40K60K80K100KSE +/- 31.85, N = 5SE +/- 16.98, N = 532574.790064.01. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale4090x26000_4ch_ll20K40K60K80K100KSE +/- 16.22, N = 5SE +/- 63.75, N = 521120.989640.21. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad4090x26000_4ch_ll20K40K60K80K100KSE +/- 14.55, N = 5SE +/- 12.86, N = 523746.594108.61. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add4090x26000_4ch_ll20K40K60K80K100KSE +/- 17.00, N = 5SE +/- 26.62, N = 523560.993836.81. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read4090x26000_4ch_ll3K6K9K12K15KSE +/- 6.47, N = 3SE +/- 0.15, N = 39190.9012582.60MIN: 9160.88 / MAX: 9203.86MIN: 12577.1 / MAX: 12583.241. (CC) gcc options: -O3 -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Write4090x26000_4ch_ll20K40K60K80K100KSE +/- 41.10, N = 3SE +/- 23.70, N = 351622.4785773.38MIN: 39519.8 / MAX: 54896.66MIN: 51047.3 / MAX: 97997.281. (CC) gcc options: -O3 -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Write6000_4ch_ll4090x220K40K60K80K100KSE +/- 6.98, N = 3SE +/- 367.10, N = 393157.94102381.02MIN: 81727.05 / MAX: 98992.88MIN: 77271.95 / MAX: 109243.011. (CC) gcc options: -O3 -lrt

Intel Memory Latency Checker

Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. If the download fails you may need to manually download the file from https://www.intel.com/content/www/us/en/developer/articles/tool/intelr-memory-latency-checker.html and place it in your PTS download cache. On some systems root privileges are needed to run the MLC tester. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns, Fewer Is BetterIntel Memory Latency Checker 3.10Test: Idle Latency6000_4ch_ll4090x2306090120150SE +/- 0.37, N = 3SE +/- 0.03, N = 3114.791.5

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig4090x26000_4ch_ll1632486480SE +/- 0.70, N = 3SE +/- 0.51, N = 1572.1151.73

Whisper.cpp

Whisper.cpp is a port of OpenAI's Whisper model in C/C++. Whisper.cpp is developed by Georgi Gerganov for transcribing WAV audio files to text / speech recognition. Whisper.cpp supports ARM NEON, x86 AVX, and other advanced CPU features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.4Model: ggml-medium.en - Input: 2016 State of the Union6000_4ch_ll4090x2400800120016002000SE +/- 10.69, N = 3SE +/- 14.14, N = 91652.161218.071. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread