EPYC 9754 1P 320W cTDP

AMD EPYC 9754 128-Core benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2307273-NE-EPYC9754112
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

C++ Boost Tests 2 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 4 Tests
CPU Massive 5 Tests
Creator Workloads 3 Tests
Cryptography 2 Tests
Database Test Suite 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 2 Tests
Common Kernel Benchmarks 2 Tests
Multi-Core 7 Tests
OpenMPI Tests 2 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 4 Tests
Software Defined Radio 2 Tests
Server 4 Tests
Server CPU Tests 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
360W Default
July 27 2023
  7 Hours, 59 Minutes
320W cTDP
July 26 2023
  8 Hours, 19 Minutes
Invert Hiding All Results Option
  8 Hours, 9 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC 9754 1P 320W cTDPOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9754 128-Core @ 2.25GHz (128 Cores / 256 Threads)AMD Titanite_4G (RTI1007B BIOS)AMD Device 14a4768GB3201GB Micron_7450_MTFDKCC3T2TFSASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 23.046.2.0-23-generic (x86_64)GNOME Shell 44.0X Server 1.21.1.7GCC 12.2.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionEPYC 9754 1P 320W CTDP BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-Pa930Z/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa0010b - Python 3.11.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

360W Default vs. 320W cTDP ComparisonPhoronix Test SuiteBaseline+20.1%+20.1%+40.2%+40.2%+60.3%+60.3%9.1%8.6%4.6%4.1%P.P.B.T.T80.2%d.M.M.S - Mesh Time11.6%Read While WritingRedis - 100 - 1:10Redis - 100 - 1:5d.S.M.S - Execution Time4.6%d.S.M.S - Mesh TimeH.H3.5%Tomographic Model3.4%W.l.H2.3%P.D.F - CPU2.1%AES-256-GCM2.1%P.D.F - CPU2.1%W.P.D.F.I - CPU2%W.P.D.F.I - CPU2%Mount St. Helens2%srsRAN ProjectOpenFOAMRocksDBRedis 7.0.12 + memtier_benchmarkRedis 7.0.12 + memtier_benchmarkOpenFOAMOpenFOAMSPECFEM3DSPECFEM3DSPECFEM3DOpenVINOOpenSSLOpenVINOOpenVINOOpenVINOSPECFEM3D360W Default320W cTDP

EPYC 9754 1P 320W cTDPopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenssl: RSA4096openssl: RSA4096openssl: SHA256openssl: SHA512openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20openssl: ChaCha20-Poly1305build-llvm: Ninjabuild-llvm: Unix Makefilesbuild-godot: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-nodejs: Time To Compilespecfem3d: Layered Halfspacespecfem3d: Water-layered Halfspacespecfem3d: Homogeneous Halfspacespecfem3d: Mount St. Helensspecfem3d: Tomographic Modelrocksdb: Read While Writingrocksdb: Read Rand Write Randrocksdb: Update Randcryptopp: Unkeyed Algorithmscryptopp: Keyed Algorithmsmemtier-benchmark: Redis - 100 - 1:10memtier-benchmark: Redis - 100 - 1:5memtier-benchmark: Redis - 100 - 1:1blender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlycompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timecouchdb: 100 - 1000 - 30couchdb: 100 - 3000 - 30couchdb: 300 - 1000 - 30couchdb: 300 - 3000 - 30srsran: PUSCH Processor Benchmark, Throughput Totalsrsran: Downlink Processor Benchmarkliquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 256 - 256 - 512libxsmm: 32libxsmm: 64libxsmm: 128360W Default320W cTDP117.91540.7672957.031.5928.092253.8811803.3110.835751.9411.146238.3110.25587.69108.8354024.71891226.71648780794475384691526311761663506671007760124223659262686183462253216250127.313242.718120.63734.015234.336137.66316.05958228418.5889511729.2369237136.3877690247.410253218106868682555380649664388.284611511.6761192859027.942785376.162613205.7612.8131.2016.5739.21116.7073189079104528.53121729.237995148.18484422.3965295.644321.656175.319651.74516942.1621.066515666713148333331718000000760.51457.92704.8115.71550.9371961.661.6227.502300.6711567.6611.055664.8511.316173.5310.36592.35107.9754045.51891039.1162135232660529226415371153720597800986743892193659092380787456325201830127.742242.369121.44033.747237.760138.37116.15679941019.0195645419.5608487376.5160156587.658590024116610122523158646819389.111227512.4742663104598.682913398.092563279.6712.9231.7116.7139.72117.6372399778088427.40741230.572754165.31432420.4611694.842318.634174.870648.4889402.1616.766599666712908000001688433333756.91456.62721.6OpenBenchmarking.org

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU360W Default320W cTDP306090120150SE +/- 0.03, N = 3SE +/- 0.01, N = 3117.91115.711. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU360W Default320W cTDP120240360480600SE +/- 0.07, N = 3SE +/- 0.06, N = 3540.76550.93MIN: 474.84 / MAX: 585.36MIN: 279.71 / MAX: 585.921. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU360W Default320W cTDP16K32K48K64K80KSE +/- 52.01, N = 3SE +/- 170.16, N = 372957.0371961.661. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU360W Default320W cTDP0.36450.7291.09351.4581.8225SE +/- 0.00, N = 3SE +/- 0.01, N = 31.591.62MIN: 0.55 / MAX: 19.36MIN: 0.55 / MAX: 19.661. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU360W Default320W cTDP714212835SE +/- 0.32, N = 15SE +/- 0.33, N = 1528.0927.501. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU360W Default320W cTDP5001000150020002500SE +/- 23.27, N = 15SE +/- 24.52, N = 152253.882300.67MIN: 1038.91 / MAX: 3097.84MIN: 1002.98 / MAX: 3164.361. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU360W Default320W cTDP3K6K9K12K15KSE +/- 3.80, N = 3SE +/- 1.00, N = 311803.3111567.661. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU360W Default320W cTDP3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 310.8311.05MIN: 5.01 / MAX: 33.12MIN: 5.13 / MAX: 30.281. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU360W Default320W cTDP12002400360048006000SE +/- 88.00, N = 15SE +/- 89.35, N = 155751.945664.851. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU360W Default320W cTDP3691215SE +/- 0.15, N = 15SE +/- 0.16, N = 1511.1411.31MIN: 4.35 / MAX: 50.85MIN: 4.45 / MAX: 67.661. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU360W Default320W cTDP13002600390052006500SE +/- 48.42, N = 10SE +/- 48.53, N = 106238.316173.531. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU360W Default320W cTDP3691215SE +/- 0.08, N = 10SE +/- 0.08, N = 1010.2510.36MIN: 6.44 / MAX: 44.37MIN: 6.14 / MAX: 43.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU360W Default320W cTDP130260390520650SE +/- 4.43, N = 12SE +/- 5.31, N = 7587.69592.351. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU360W Default320W cTDP20406080100SE +/- 0.78, N = 12SE +/- 0.93, N = 7108.83107.97MIN: 50.16 / MAX: 184.06MIN: 49.87 / MAX: 181.071. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096360W Default320W cTDP12K24K36K48K60KSE +/- 13.52, N = 3SE +/- 10.43, N = 354024.754045.51. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096360W Default320W cTDP400K800K1200K1600K2000KSE +/- 136.08, N = 3SE +/- 86.14, N = 31891226.71891039.11. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256360W Default320W cTDP40000M80000M120000M160000M200000MSE +/- 153601820.35, N = 3SE +/- 157626960.97, N = 31648780794471621352326601. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512360W Default320W cTDP12000M24000M36000M48000M60000MSE +/- 12064131.86, N = 3SE +/- 83975512.16, N = 353846915263529226415371. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCM360W Default320W cTDP300000M600000M900000M1200000M1500000MSE +/- 1233880309.13, N = 3SE +/- 589073272.79, N = 3117616635066711537205978001. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCM360W Default320W cTDP200000M400000M600000M800000M1000000MSE +/- 2213247486.05, N = 3SE +/- 3274830551.84, N = 310077601242239867438921931. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20360W Default320W cTDP140000M280000M420000M560000M700000MSE +/- 123237011.79, N = 3SE +/- 28567150.55, N = 36592626861836590923807871. (CC) gcc options: -pthread -m64 -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305360W Default320W cTDP100000M200000M300000M400000M500000MSE +/- 116990620.17, N = 3SE +/- 121169876.06, N = 34622532162504563252018301. (CC) gcc options: -pthread -m64 -O3 -ldl

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja360W Default320W cTDP306090120150SE +/- 0.27, N = 3SE +/- 0.19, N = 3127.31127.74

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix Makefiles360W Default320W cTDP50100150200250SE +/- 0.14, N = 3SE +/- 0.39, N = 3242.72242.37

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compile360W Default320W cTDP306090120150SE +/- 0.06, N = 3SE +/- 0.04, N = 3120.64121.44

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfig360W Default320W cTDP816243240SE +/- 0.38, N = 5SE +/- 0.36, N = 534.0233.75

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfig360W Default320W cTDP50100150200250SE +/- 0.53, N = 3SE +/- 0.34, N = 3234.34237.76

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compile360W Default320W cTDP306090120150SE +/- 0.30, N = 3SE +/- 0.23, N = 3137.66138.37

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspace360W Default320W cTDP48121620SE +/- 0.18, N = 3SE +/- 0.15, N = 316.0616.161. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspace360W Default320W cTDP510152025SE +/- 0.26, N = 3SE +/- 0.18, N = 318.5919.021. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspace360W Default320W cTDP3691215SE +/- 0.019071134, N = 4SE +/- 0.048446701, N = 49.2369237139.5608487371. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helens360W Default320W cTDP246810SE +/- 0.057309149, N = 7SE +/- 0.041202140, N = 56.3877690246.5160156581. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Model360W Default320W cTDP246810SE +/- 0.052359982, N = 12SE +/- 0.074647921, N = 67.4102532187.6585900241. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read While Writing360W Default320W cTDP2M4M6M8M10MSE +/- 87349.44, N = 9SE +/- 96904.96, N = 1510686868116610121. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Random360W Default320W cTDP500K1000K1500K2000K2500KSE +/- 5700.62, N = 3SE +/- 21499.49, N = 3255538025231581. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Random360W Default320W cTDP140K280K420K560K700KSE +/- 419.71, N = 3SE +/- 588.18, N = 36496646468191. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.8Test: Unkeyed Algorithms360W Default320W cTDP80160240320400SE +/- 0.70, N = 3SE +/- 0.15, N = 3388.28389.111. (CXX) g++ options: -g2 -O3 -fPIC -fno-devirtualize -pthread -pipe

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.8Test: Keyed Algorithms360W Default320W cTDP110220330440550SE +/- 0.74, N = 3SE +/- 0.18, N = 3511.68512.471. (CXX) g++ options: -g2 -O3 -fPIC -fno-devirtualize -pthread -pipe

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10360W Default320W cTDP700K1400K2100K2800K3500KSE +/- 38852.94, N = 2SE +/- 16613.14, N = 22859027.943104598.681. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5360W Default320W cTDP600K1200K1800K2400K3000KSE +/- 28005.28, N = 5SE +/- 30433.60, N = 42785376.162913398.091. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1360W Default320W cTDP600K1200K1800K2400K3000KSE +/- 17553.33, N = 2SE +/- 25832.18, N = 152613205.762563279.671. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Only360W Default320W cTDP3691215SE +/- 0.03, N = 4SE +/- 0.02, N = 412.8112.92

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Only360W Default320W cTDP714212835SE +/- 0.06, N = 3SE +/- 0.07, N = 331.2031.71

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Only360W Default320W cTDP48121620SE +/- 0.06, N = 3SE +/- 0.05, N = 316.5716.71

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Only360W Default320W cTDP918273645SE +/- 0.06, N = 3SE +/- 0.09, N = 339.2139.72

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Only360W Default320W cTDP306090120150SE +/- 0.15, N = 3SE +/- 0.17, N = 3116.70117.63

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating360W Default320W cTDP160K320K480K640K800KSE +/- 1550.06, N = 3SE +/- 856.27, N = 37318907239971. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating360W Default320W cTDP200K400K600K800K1000KSE +/- 1173.34, N = 3SE +/- 1176.64, N = 37910457808841. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time360W Default320W cTDP71421283528.5327.411. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time360W Default320W cTDP71421283529.2430.571. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time360W Default320W cTDP4080120160200148.18165.311. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time360W Default320W cTDP90180270360450422.40420.461. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Apache CouchDB

This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 100 - Inserts: 1000 - Rounds: 30360W Default320W cTDP20406080100SE +/- 0.16, N = 3SE +/- 0.26, N = 395.6494.841. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 100 - Inserts: 3000 - Rounds: 30360W Default320W cTDP70140210280350SE +/- 0.42, N = 3SE +/- 0.52, N = 3321.66318.631. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 300 - Inserts: 1000 - Rounds: 30360W Default320W cTDP4080120160200SE +/- 1.51, N = 12SE +/- 1.44, N = 12175.32174.871. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 300 - Inserts: 3000 - Rounds: 30360W Default320W cTDP140280420560700SE +/- 1.86, N = 3SE +/- 2.02, N = 3651.75648.491. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

srsRAN Project

srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Total360W Default320W cTDP4K8K12K16K20KSE +/- 46.90, N = 3SE +/- 755.30, N = 1516942.19402.11. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmark360W Default320W cTDP130260390520650SE +/- 1.11, N = 3SE +/- 2.84, N = 3621.0616.71. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512360W Default320W cTDP140M280M420M560M700MSE +/- 2452443.23, N = 3SE +/- 1318488.19, N = 36651566676659966671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512360W Default320W cTDP300M600M900M1200M1500MSE +/- 2425100.23, N = 3SE +/- 2013289.18, N = 3131483333312908000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512360W Default320W cTDP400M800M1200M1600M2000MSE +/- 907377.17, N = 3SE +/- 592546.29, N = 3171800000016884333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32360W Default320W cTDP160320480640800SE +/- 1.44, N = 6SE +/- 2.23, N = 6760.5756.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64360W Default320W cTDP30060090012001500SE +/- 0.99, N = 6SE +/- 2.67, N = 61457.91456.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128360W Default320W cTDP6001200180024003000SE +/- 6.55, N = 3SE +/- 9.18, N = 32704.82721.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

64 Results Shown

OpenVINO:
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    FPS
    ms
  Person Detection FP16 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16-INT8 - CPU:
    FPS
    ms
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
OpenSSL:
  RSA4096:
    sign/s
    verify/s
  SHA256:
    byte/s
  SHA512:
    byte/s
  AES-128-GCM:
    byte/s
  AES-256-GCM:
    byte/s
  ChaCha20:
    byte/s
  ChaCha20-Poly1305:
    byte/s
Timed LLVM Compilation:
  Ninja
  Unix Makefiles
Timed Godot Game Engine Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Timed Node.js Compilation
SPECFEM3D:
  Layered Halfspace
  Water-layered Halfspace
  Homogeneous Halfspace
  Mount St. Helens
  Tomographic Model
RocksDB:
  Read While Writing
  Read Rand Write Rand
  Update Rand
Crypto++:
  Unkeyed Algorithms
  Keyed Algorithms
Redis 7.0.12 + memtier_benchmark:
  Redis - 100 - 1:10
  Redis - 100 - 1:5
  Redis - 100 - 1:1
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
  Pabellon Barcelona - CPU-Only
  Barbershop - CPU-Only
7-Zip Compression:
  Compression Rating
  Decompression Rating
OpenFOAM:
  drivaerFastback, Small Mesh Size - Mesh Time
  drivaerFastback, Small Mesh Size - Execution Time
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
Apache CouchDB:
  100 - 1000 - 30
  100 - 3000 - 30
  300 - 1000 - 30
  300 - 3000 - 30
srsRAN Project:
  PUSCH Processor Benchmark, Throughput Total
  Downlink Processor Benchmark
Liquid-DSP:
  64 - 256 - 512
  128 - 256 - 512
  256 - 256 - 512
libxsmm:
  32
  64
  128