AMD EPYC 9754 Bergamo SMT On/Off Comparison

Benchmarks by Michael Larabel for a future article (post 19th) looking at SMT on/off comparison toggled via BIOS. SMT comparison testing of AMD EPYC 9754 128-Core CPUs on Titanite with Ubuntu 22.04 LTS.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2307190-NE-BERGAMOSM27
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 9754 1P: SMT On
July 14 2023
  16 Hours, 51 Minutes
EPYC 9754 1P: SMT Off
July 13 2023
  15 Hours, 43 Minutes
EPYC 9754 2P: SMT On
July 11 2023
  14 Hours, 12 Minutes
EPYC 9754 2P: SMT Off
July 12 2023
  13 Hours, 40 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 6 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionEPYC 9754 2PEPYC 9754 1P SMT On SMT Off SMT Off SMT On2 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores / 512 Threads)AMD Titanite_4G (RTI1007B BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 22.045.19.0-41-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.41.3.224GCC 11.3.0ext41024x7682 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores)AMD EPYC 9754 128-Core @ 2.25GHz (128 Cores)768GBAMD EPYC 9754 128-Core @ 2.25GHz (128 Cores / 256 Threads)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa0010bPython Details- Python 3.10.6Security Details- EPYC 9754 2P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 2P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 1P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 1P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

openvino: Vehicle Detection FP16 - CPUopenssl: SHA256toybrot: TBBspecfem3d: Water-layered Halfspacecompress-7zip: Decompression Ratingdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamjohn-the-ripper: Blowfishjohn-the-ripper: bcryptdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamembree: Pathtracer ISPC - Crowndeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamgraph500: 26openssl: ChaCha20embree: Pathtracer ISPC - Asian Dragonspecfem3d: Mount St. Helensgraph500: 26blender: Classroom - CPU-Onlynpb: LU.Cliquid-dsp: 512 - 256 - 512ospray-studio: 3 - 4K - 32 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracercp2k: H2O-DFT-LSospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracercloverleaf: Lagrangian-Eulerian Hydrodynamicsospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 32 - Path Traceropenssl: ChaCha20-Poly1305ospray-studio: 2 - 4K - 1 - Path Tracergraph500: 26ospray-studio: 1 - 4K - 32 - Path Tracerblender: Pabellon Barcelona - CPU-Onlyjohn-the-ripper: WPA PSKxmrig: Wownero - 1Mastcenc: Exhaustiveopenvino: Person Detection FP32 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Detection FP16 - CPUspecfem3d: Layered Halfspaceblender: Barbershop - CPU-Onlyblender: BMW27 - CPU-Onlyhelsing: 14 digitgraph500: 26openssl: RSA4096npb: MG.Castcenc: Fastospray: gravity_spheres_volume/dim_512/ao/real_timeopenssl: RSA4096john-the-ripper: MD5ospray: particle_volume/scivis/real_timeblender: Fishy Cat - CPU-Onlyospray: particle_volume/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeopenssl: SHA512openvino: Face Detection FP16 - CPUopenssl: AES-128-GCMopenssl: AES-256-GCMopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16 - CPUdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamastcenc: Thoroughopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUnamd: ATPase Simulation - 327,506 Atomsliquid-dsp: 256 - 256 - 512openvino: Person Vehicle Bike Detection FP16 - CPUbuild-linux-kernel: allmodconfigopenvino: Weld Porosity Detection FP16 - CPUprimesieve: 1e13libxsmm: 256toybrot: OpenMPopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUminibude: OpenMP - BM2minibude: OpenMP - BM2npb: IS.Dnpb: BT.Cheffte: r2c - FFTW - float - 512npb: SP.Cluxcorerender: LuxCore Benchmark - CPUheffte: c2c - FFTW - float - 512primesieve: 1e12openvino: Person Vehicle Bike Detection FP16 - CPUtensorflow: CPU - 256 - GoogLeNetappleseed: Material Testernpb: FT.Ccompress-7zip: Compression Ratingopenvkl: vklBenchmark ISPCtensorflow: CPU - 512 - ResNet-50openvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUtensorflow: CPU - 512 - GoogLeNetopenvino: Age Gender Recognition Retail 0013 FP16 - CPUnpb: CG.Cbuild-linux-kernel: defconfigluxcorerender: DLSC - CPUdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamoidn: RTLightmap.hdr.4096x4096 - CPU-Onlydeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streammysqlslap: 2048appleseed: Emilyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyopenvino: Age Gender Recognition Retail 0013 FP16 - CPUaircrack-ng: oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlytensorflow: CPU - 256 - AlexNetmysqlslap: 4096build-llvm: Ninjatensorflow: CPU - 512 - AlexNetbuild-nodejs: Time To Compiletensorflow: CPU - 256 - ResNet-50minife: Smallopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUappleseed: Disney Materialbuild-gem5: Time To Compilebuild-llvm: Unix Makefilesbuild-godot: Time To Compileopenvino: Weld Porosity Detection FP16-INT8 - CPUnekrs: Kershawopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streampgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 800 - Read Onlystockfish: Total Timeluxcorerender: Rainbow Colors and Prism - CPUluxcorerender: Orange Juice - CPUsrsran: PUSCH Processor Benchmark, Throughput Totalxmrig: Monero - 1Mnekrs: TurboPipe Periodicspecfem3d: Homogeneous Halfspacespecfem3d: Tomographic Modelheffte: r2c - FFTW - double - 512heffte: c2c - FFTW - double - 512libxsmm: 128minibude: OpenMP - BM1minibude: OpenMP - BM1npb: SP.Bnpb: EP.DEPYC 9754 2PEPYC 9754 1P SMT On SMT Off SMT Off SMT On11.0332792603851320149.7783814371353957797.6248409850407860139.88211868.3310139.65901190.7168210.0714602.26029604710001317549954027255.98643.906675095172403000016.28591505.17333016666718455925376982143.513482781721.65582157319098173601104956725410001561921.751524533142082.715.93051552.151159.991559.959.99744757069.037.1227.28015711000003782091.8249109.09610.113753.8295108490.53487933349.23119.8749.132852.7318106049655203526.982339890700107201931707032755.12270.93235.65121.032627.3142134.885322954.4411299.320.1064625444000009889.61145.9295.6411.1206112.63321133931.537888.085315.5249849.01491231.83433.601224243.289.86221.7651.5086.46329.22211432.899258201720172.3640.6740.90538.52168372.8967554.7420.34418.61902.8593902.779268.3638107.183548.62062.35211.4848159.8861580164.5671644.780.624.761225.41545107.7211770.9193.271124.9862774.21.08152.586199.365100.79011.004.8313214.205810.41521.3108242.9589229.1624556.94110.97482787858238692419.0234.4517891.486533.74.7278301483.782741798207.197109.6454976.7253.1236328.069236490.7623705.4010.2622226942886729766.2180692579130911058.6776320885317046182.59742409.8035182.44531541.9512146.2051770.139210759000001100453394570178.30462.630122599207888000020.24658754.2126109333332423812127100462363.5796311023215.15763205457844895705236457701800002013327.471301500100754.314.40991579.141174.991545.237.47077293085.548.4550.05418151600003598946.3268721.05693.396144.7104113251.43022166741.553111.7241.656744.2889102232233590526.462307524535457199823414886354.42271.24235.37121.082730.4980127.208822955.6411373.950.1396925426333339878.88118.0995.6111.1526373.03671118225.5510989.938439.5978635.30536518.74430.265231041.576.97223.5841.1606.47452.99265.885808224178.107714621530189.4041.0540.20634.13142135.5866822.9718.47414.45676.8128676.434251.810381.199445.87652.09162.0191118.1285591159.9066144.310.67128050.2194.351581.6657999.2151908.7693.952146.7453798.61.1340.570691148.376198.750100.24011.104.8613141.516242.68426.7362290.7562212.6821590.87891.01878596844702314313.5125.1136573.885946.03.4518301692.709205428210.933112.4134505.5439.06710976.682246272.7925983.7411.49111414557280559013.419454501510470504.357816326316322097.09131275.770796.9697812.102385.2893404.2338493535000550700307433107.36085.03003158092832000038.44289518.1414147000004337121666180044957.6861127182699.2713583653939278268998711463637500003592650.0167621863182.27.31471102.82545.131105.9615.028121929147.2215.1557.9568936240001799293.0136942.131278.646625.717856647.31675166723.721020.4723.733325.512151804333637513.23115283892806399703462188358.71268.82119.0162.181780.442168.307211710.595837.620.2059513139333335118.29177.7585.4721.1323813.4624271673.855903.205236.1285315.15298801.44248.557133415.428.88128.5971.7966.24525.39167.900597147448.725927411107123.8828.7428.83429.16113162.2948672.2422.98413.27644.9768645.001949.511177.814835.55491.72156.0563125.4467783123.2983963.570.70149056.9533.571375.33695119.5761526.81116.122121.4751741.01.3038.492263148.484213.907102.58810.9157342666674.746744.662784.67468.7809134.5717259.2779246.37320.91787772227272294014.3720.9820430.951218.925860833336.0927502994.93726550667.990735.40492696.5234.6845867.108161475.2414274.5343.73163633625553359117.194492779791787417.118621611521606473.2077968.626773.3807624.4789125.5813316.0466445912000659346857987157.65046.21637984988024900031.12279662.5517837000003297216484137595012.268611396312.001032278854624158373208733334450002753839.2781037574803.68.17432378.07582.082377.7915.964931688116.5412.7750.4738578900001890935.3128129.561190.754932.675054195.12031266730.816516.4930.850731.9245530058793301049.3111696737355571012537766210110.00541.29117.8860.771376.214275.083111794.326067.780.2070216967666676148.89227.51710.5321.2863331.7408185400.885972.643238.9055300.29292243.61245.542131909.9112.18128.1241.94410.40504.09166.676131140791.527262711396122.7726.6126.57416.03120515.1145686.8826.22516.34859.8653859.87765.9747102.226546.43181.74201.6258152.9895780122.5877093.620.83171120.3543.621422.08655125.3641628.80113.704118.4551784.11.2144.30492161.648211.079105.79710.84580863666712.075311.481464.29499.5866126.9310267.7088240.86670.95285556936503434920.8824.478389.024409.425384069239.4043485007.48067470666.341834.84512713.4237.7635944.062149355.5413264.79OpenBenchmarking.org

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OnSMT Off1020304050SE +/- 0.55, N = 14SE +/- 0.10, N = 13SE +/- 0.12, N = 13SE +/- 0.15, N = 1543.7311.4910.2611.031. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256SMT OnSMT Off70000M140000M210000M280000M350000MSE +/- 161352968.33, N = 3SE +/- 15184699.42, N = 3SE +/- 224959781.68, N = 3SE +/- 207719324.78, N = 31636336255531114145572802222694288673279260385131. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBSMT OnSMT Off12002400360048006000SE +/- 21.75, N = 9SE +/- 43.30, N = 15SE +/- 31.80, N = 15SE +/- 23.04, N = 1535915590297620141. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered HalfspaceSMT OnSMT Off48121620SE +/- 0.142353904, N = 3SE +/- 0.021567327, N = 3SE +/- 0.062799204, N = 15SE +/- 0.063515594, N = 417.19449277913.4194545016.2180692579.7783814371. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingSMT OnSMT Off300K600K900K1200K1500KSE +/- 1608.29, N = 3SE +/- 429.17, N = 3SE +/- 1897.81, N = 3SE +/- 5499.45, N = 379178751047091309113539571. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.46, N = 3SE +/- 5.07, N = 15SE +/- 0.90, N = 3SE +/- 0.32, N = 3417.12504.361058.68797.62

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT OnSMT Off90K180K270K360K450KSE +/- 117.40, N = 3SE +/- 12.67, N = 3SE +/- 1247.49, N = 3SE +/- 3726.56, N = 32161151632633208854098501. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT OnSMT Off90K180K270K360K450KSE +/- 92.54, N = 3SE +/- 33.22, N = 3SE +/- 1964.45, N = 3SE +/- 2298.36, N = 32160641632203170464078601. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.25, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 373.2197.09182.60139.88

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OnSMT Off5001000150020002500SE +/- 0.74, N = 3SE +/- 5.02, N = 3SE +/- 1.63, N = 3SE +/- 1.60, N = 3968.631275.772409.801868.33

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.25, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 373.3896.97182.45139.66

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OnSMT Off30060090012001500SE +/- 0.29, N = 3SE +/- 3.93, N = 3SE +/- 1.38, N = 3SE +/- 1.02, N = 3624.48812.101541.951190.72

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: CrownSMT OnSMT Off50100150200250SE +/- 0.13, N = 7SE +/- 0.05, N = 6SE +/- 0.13, N = 7SE +/- 0.16, N = 9125.5885.29146.21210.07

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off170340510680850SE +/- 1.02, N = 3SE +/- 2.15, N = 3SE +/- 0.92, N = 3SE +/- 0.72, N = 3316.05404.23770.14602.26

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off200M400M600M800M1000M44591200049353500010759000009604710001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20SMT OnSMT Off300000M600000M900000M1200000M1500000MSE +/- 23916253.09, N = 3SE +/- 70893114.79, N = 3SE +/- 107897909.37, N = 3SE +/- 46014499.85, N = 3659346857987550700307433110045339457013175499540271. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian DragonSMT OnSMT Off60120180240300SE +/- 0.09, N = 8SE +/- 0.09, N = 6SE +/- 0.26, N = 8SE +/- 0.48, N = 9157.65107.36178.30255.99

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. HelensSMT OnSMT Off246810SE +/- 0.034771470, N = 5SE +/- 0.069356320, N = 12SE +/- 0.013893024, N = 5SE +/- 0.014293172, N = 56.2163798495.0300315802.6301225993.9066750951. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off400M800M1200M1600M2000M880249000928320000207888000017240300001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Blender

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-OnlySMT OnSMT Off918273645SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 331.1238.4420.2416.28

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT OnSMT Off140K280K420K560K700KSE +/- 2132.06, N = 6SE +/- 1485.96, N = 6SE +/- 4916.03, N = 15SE +/- 7199.59, N = 15279662.55289518.14658754.21591505.171. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 512 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off700M1400M2100M2800M3500MSE +/- 1814754.35, N = 3SE +/- 2946183.97, N = 3SE +/- 3555434.03, N = 3SE +/- 569600.25, N = 317837000001414700000261093333333301666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off9K18K27K36K45KSE +/- 17.58, N = 3SE +/- 76.61, N = 3SE +/- 17.91, N = 3SE +/- 34.44, N = 3329724337124238184551. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off5K10K15K20K25KSE +/- 11.89, N = 3SE +/- 15.65, N = 3SE +/- 8.65, N = 3SE +/- 27.17, N = 316484216661212792531. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 5.78, N = 3SE +/- 18.26, N = 3SE +/- 14.52, N = 3SE +/- 7.80, N = 313759180041004676981. (CXX) g++ options: -O3 -lm -ldl

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. More details on the CP2K benchmark test cases and details can be found @ https://www.cp2k.org/performance Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2023.1Input: H2O-DFT-LSSMT OnSMT Off110022003300440055005012.264957.692363.582143.511. (F9X) gfortran options: -fopenmp -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kdbm -lcp2kgrid -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -lhdf5 -lhdf5_hl -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 0.88, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 2.85, N = 386111276314821. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 6.12, N = 3SE +/- 11.93, N = 3SE +/- 5.29, N = 3SE +/- 19.63, N = 313963182691023278171. (CXX) g++ options: -O3 -lm -ldl

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT OnSMT Off510152025SE +/- 0.11, N = 4SE +/- 0.09, N = 5SE +/- 0.04, N = 4SE +/- 0.27, N = 412.009.2715.1521.651. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off30060090012001500SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3103213587635821. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 21.53, N = 3SE +/- 53.62, N = 3SE +/- 96.56, N = 3SE +/- 41.40, N = 3278853653920545157311. (CXX) g++ options: -O3 -lm -ldl

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305SMT OnSMT Off200000M400000M600000M800000M1000000MSE +/- 15599558.91, N = 3SE +/- 86138373.27, N = 3SE +/- 39882779.79, N = 3SE +/- 267574958.87, N = 34624158373203927826899877844895705239098173601101. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 0.88, N = 3SE +/- 1.53, N = 3SE +/- 0.33, N = 3SE +/- 1.45, N = 387311466454951. (CXX) g++ options: -O3 -lm -ldl

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off160M320M480M640M800M3334450003637500007701800006725410001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 10.48, N = 3SE +/- 29.21, N = 3SE +/- 38.84, N = 3SE +/- 64.83, N = 3275383592620133156191. (CXX) g++ options: -O3 -lm -ldl

Blender

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT OnSMT Off1122334455SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 339.2750.0127.4721.75

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT OnSMT Off300K600K900K1200K1500KSE +/- 505.98, N = 3SE +/- 6933.79, N = 3SE +/- 15887.63, N = 4SE +/- 18163.51, N = 15810375676218130150015245331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1MSMT OnSMT Off30K60K90K120K150KSE +/- 513.11, N = 15SE +/- 13.77, N = 4SE +/- 741.13, N = 4SE +/- 677.82, N = 574803.663182.2100754.3142082.71. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveSMT OnSMT Off48121620SE +/- 0.0006, N = 5SE +/- 0.0028, N = 5SE +/- 0.0257, N = 6SE +/- 0.0081, N = 68.17437.314714.409915.93051. (CXX) g++ options: -O3 -flto -pthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OnSMT Off5001000150020002500SE +/- 12.47, N = 15SE +/- 10.77, N = 5SE +/- 4.61, N = 3SE +/- 8.82, N = 32378.071102.821579.141552.151. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OnSMT Off30060090012001500SE +/- 6.51, N = 15SE +/- 4.46, N = 15SE +/- 4.51, N = 3SE +/- 7.44, N = 3582.08545.131174.991159.991. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OnSMT Off5001000150020002500SE +/- 16.41, N = 12SE +/- 8.36, N = 9SE +/- 10.43, N = 3SE +/- 4.07, N = 32377.791105.961545.231559.951. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered HalfspaceSMT OnSMT Off48121620SE +/- 0.034780206, N = 3SE +/- 0.159428812, N = 3SE +/- 0.056354216, N = 4SE +/- 0.066481426, N = 1515.96493168815.0281219297.4707729309.9974475701. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-OnlySMT OnSMT Off306090120150SE +/- 0.11, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3116.54147.2285.5469.03

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-OnlySMT OnSMT Off48121620SE +/- 0.02, N = 4SE +/- 0.05, N = 4SE +/- 0.05, N = 5SE +/- 0.02, N = 612.7715.158.457.12

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitSMT OnSMT Off1326395265SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.31, N = 3SE +/- 0.37, N = 350.4757.9650.0527.281. (CC) gcc options: -O2 -pthread

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off400M800M1200M1600M2000M857890000893624000181516000015711000001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OnSMT Off800K1600K2400K3200K4000KSE +/- 405.88, N = 3SE +/- 1031.37, N = 3SE +/- 1067.40, N = 3SE +/- 163.02, N = 31890935.31799293.03598946.33782091.81. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT OnSMT Off60K120K180K240K300KSE +/- 296.36, N = 10SE +/- 104.98, N = 10SE +/- 599.41, N = 10SE +/- 1284.96, N = 11128129.56136942.13268721.05249109.091. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastSMT OnSMT Off30060090012001500SE +/- 1.79, N = 6SE +/- 1.20, N = 7SE +/- 1.25, N = 5SE +/- 1.29, N = 51190.751278.65693.40610.111. (CXX) g++ options: -O3 -flto -pthread

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeSMT OnSMT Off1224364860SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 332.6825.7244.7153.83

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OnSMT Off20K40K60K80K100KSE +/- 16.66, N = 3SE +/- 1.80, N = 3SE +/- 3.85, N = 3SE +/- 3.93, N = 354195.156647.3113251.4108490.51. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT OnSMT Off7M14M21M28M35MSE +/- 52818.35, N = 3SE +/- 35950.58, N = 3SE +/- 140717.61, N = 3SE +/- 83819.91, N = 3203126671675166730221667348793331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeSMT OnSMT Off1122334455SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 330.8223.7241.5549.23

Blender

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-OnlySMT OnSMT Off510152025SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 4SE +/- 0.02, N = 516.4920.4711.729.87

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeSMT OnSMT Off1122334455SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 330.8523.7341.6649.13

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeSMT OnSMT Off1224364860SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 331.9225.5144.2952.73

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512SMT OnSMT Off20000M40000M60000M80000M100000MSE +/- 4276543.39, N = 3SE +/- 19506083.54, N = 3SE +/- 660141733.06, N = 3SE +/- 22577791.79, N = 353005879330518043336371022322335901060496552031. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OnSMT Off2004006008001000SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.28, N = 3SE +/- 0.18, N = 31049.31513.23526.46526.981. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMSMT OnSMT Off500000M1000000M1500000M2000000M2500000MSE +/- 404585301.69, N = 3SE +/- 772883363.35, N = 3SE +/- 4718005576.14, N = 3SE +/- 4494053056.14, N = 311696737355571152838928063230752453545723398907001071. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMSMT OnSMT Off400000M800000M1200000M1600000M2000000MSE +/- 2018584981.53, N = 3SE +/- 917614258.55, N = 3SE +/- 2904032106.06, N = 3SE +/- 676471418.80, N = 31012537766210997034621883199823414886320193170703271. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OnSMT Off20406080100SE +/- 1.12, N = 15SE +/- 0.45, N = 15SE +/- 0.21, N = 3SE +/- 0.35, N = 3110.0058.7154.4255.121. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OnSMT Off120240360480600SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3541.29268.82271.24270.931. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OnSMT Off50100150200250SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3117.88119.01235.37235.651. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OnSMT Off306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 360.7762.18121.08121.031. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OnSMT Off6001200180024003000SE +/- 1.21, N = 3SE +/- 3.84, N = 3SE +/- 33.47, N = 15SE +/- 6.95, N = 31376.211780.442730.502627.31

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughSMT OnSMT Off306090120150SE +/- 0.02, N = 6SE +/- 0.01, N = 6SE +/- 0.05, N = 6SE +/- 0.22, N = 675.0868.31127.21134.891. (CXX) g++ options: -O3 -flto -pthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OnSMT Off5K10K15K20K25KSE +/- 2.13, N = 3SE +/- 13.11, N = 3SE +/- 15.58, N = 3SE +/- 16.03, N = 311794.3211710.5922955.6422954.441. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OnSMT Off2K4K6K8K10KSE +/- 0.58, N = 3SE +/- 11.25, N = 3SE +/- 2.58, N = 3SE +/- 9.34, N = 36067.785837.6211373.9511299.321. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsSMT OnSMT Off0.04660.09320.13980.18640.233SE +/- 0.00095, N = 4SE +/- 0.00018, N = 4SE +/- 0.00135, N = 5SE +/- 0.00040, N = 30.207020.205950.139690.10646

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off500M1000M1500M2000M2500MSE +/- 470224.53, N = 3SE +/- 592546.29, N = 3SE +/- 1068228.02, N = 3SE +/- 1877054.43, N = 316967666671313933333254263333325444000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OnSMT Off2K4K6K8K10KSE +/- 48.86, N = 15SE +/- 5.00, N = 3SE +/- 10.15, N = 3SE +/- 11.31, N = 36148.895118.299878.889889.611. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigSMT OnSMT Off50100150200250SE +/- 1.35, N = 3SE +/- 0.48, N = 3SE +/- 0.55, N = 3SE +/- 0.51, N = 3227.52177.76118.10145.93

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.535.475.615.641. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e13SMT OnSMT Off510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 5SE +/- 0.01, N = 521.2921.1311.1511.121. (CXX) g++ options: -O3

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256SMT OnSMT Off14002800420056007000SE +/- 2.32, N = 3SE +/- 16.75, N = 3SE +/- 66.66, N = 9SE +/- 1.43, N = 33331.73813.46373.06112.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPSMT OnSMT Off13002600390052006500SE +/- 17.08, N = 8SE +/- 0.20, N = 7SE +/- 53.03, N = 15SE +/- 23.29, N = 1240816242367133211. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off30K60K90K120K150KSE +/- 192.97, N = 3SE +/- 126.57, N = 3SE +/- 420.06, N = 3SE +/- 602.14, N = 385400.8871673.85118225.55133931.531. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OnSMT Off2K4K6K8K10KSE +/- 0.21, N = 3SE +/- 6.13, N = 3SE +/- 150.60, N = 12SE +/- 10.75, N = 45972.645903.2110989.947888.091. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OnSMT Off100200300400500SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 6.02, N = 12SE +/- 0.43, N = 4238.91236.13439.60315.521. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT OnSMT Off2K4K6K8K10KSE +/- 27.01, N = 6SE +/- 29.88, N = 6SE +/- 105.10, N = 15SE +/- 104.60, N = 155300.295315.158635.309849.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT OnSMT Off110K220K330K440K550KSE +/- 396.74, N = 5SE +/- 269.21, N = 5SE +/- 3792.90, N = 12SE +/- 4903.75, N = 15292243.61298801.44536518.74491231.831. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OnSMT Off90180270360450SE +/- 0.52, N = 5SE +/- 0.09, N = 5SE +/- 0.77, N = 6SE +/- 0.86, N = 7245.54248.56430.27433.601. (CXX) g++ options: -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT OnSMT Off50K100K150K200K250KSE +/- 290.95, N = 4SE +/- 228.86, N = 4SE +/- 1880.70, N = 6SE +/- 1293.24, N = 6131909.91133415.42231041.57224243.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT OnSMT Off3691215SE +/- 0.09, N = 15SE +/- 0.08, N = 8SE +/- 0.11, N = 12SE +/- 0.11, N = 1512.188.886.979.86

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OnSMT Off50100150200250SE +/- 0.15, N = 4SE +/- 0.01, N = 4SE +/- 1.28, N = 5SE +/- 1.37, N = 5128.12128.60223.58221.771. (CXX) g++ options: -O3

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e12SMT OnSMT Off0.43740.87481.31221.74962.187SE +/- 0.006, N = 11SE +/- 0.003, N = 11SE +/- 0.003, N = 12SE +/- 0.010, N = 141.9441.7961.1601.5081. (CXX) g++ options: -O3

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OnSMT Off3691215SE +/- 0.08, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.406.246.476.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: GoogLeNetSMT OnSMT Off110220330440550SE +/- 3.39, N = 3SE +/- 0.35, N = 3SE +/- 5.52, N = 4SE +/- 1.69, N = 3504.09525.39452.99329.22

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT OnSMT Off60120180240300166.68167.90265.89

Scene: Material Tester

EPYC 9754 2P: SMT On: The test run did not produce a result.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT OnSMT Off50K100K150K200K250KSE +/- 1029.98, N = 8SE +/- 93.32, N = 8SE +/- 2978.54, N = 13SE +/- 1757.80, N = 9140791.52147448.72224178.10211432.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingSMT OnSMT Off200K400K600K800K1000KSE +/- 1345.49, N = 3SE +/- 4792.89, N = 3SE +/- 4721.80, N = 3SE +/- 5539.49, N = 37262715927417714629258201. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCSMT OnSMT Off400800120016002000SE +/- 1.86, N = 3SE +/- 0.33, N = 3SE +/- 1.33, N = 3SE +/- 6.06, N = 31396110715301720

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50SMT OnSMT Off4080120160200SE +/- 0.90, N = 3SE +/- 1.35, N = 3SE +/- 0.13, N = 3SE +/- 0.59, N = 3122.77123.88189.40172.36

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OnSMT Off918273645SE +/- 0.20, N = 12SE +/- 0.23, N = 9SE +/- 0.29, N = 3SE +/- 0.14, N = 326.6128.7441.0540.671. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OnSMT Off918273645SE +/- 0.15, N = 15SE +/- 0.29, N = 5SE +/- 0.14, N = 3SE +/- 0.22, N = 326.5728.8340.2040.901. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: GoogLeNetSMT OnSMT Off140280420560700SE +/- 4.74, N = 12SE +/- 5.79, N = 12SE +/- 6.73, N = 3SE +/- 4.51, N = 15416.03429.16634.13538.52

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off40K80K120K160K200KSE +/- 390.62, N = 3SE +/- 202.90, N = 3SE +/- 158.47, N = 3SE +/- 676.62, N = 3120515.11113162.29142135.58168372.891. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT OnSMT Off14K28K42K56K70KSE +/- 441.00, N = 15SE +/- 285.31, N = 8SE +/- 568.39, N = 8SE +/- 348.18, N = 845686.8848672.2466822.9767554.741. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigSMT OnSMT Off612182430SE +/- 0.23, N = 7SE +/- 0.21, N = 7SE +/- 0.12, N = 13SE +/- 0.14, N = 1326.2322.9818.4720.34

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 416.3413.2714.4518.61

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 3859.87644.98676.81902.86

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.49, N = 3SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.45, N = 3859.88645.00676.43902.78

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OnSMT Off1530456075SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 365.9749.5151.8168.36

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OnSMT Off20406080100SE +/- 0.04, N = 3SE +/- 0.37, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3102.2377.8181.20107.18

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OnSMT Off1122334455SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.57, N = 15SE +/- 0.13, N = 346.4335.5545.8848.62

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-OnlySMT OnSMT Off0.52881.05761.58642.11522.644SE +/- 0.00, N = 4SE +/- 0.01, N = 4SE +/- 0.02, N = 5SE +/- 0.00, N = 51.741.722.092.35

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off50100150200250SE +/- 0.54, N = 3SE +/- 0.81, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 3201.63156.06162.02211.48

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.16, N = 3SE +/- 1.18, N = 15SE +/- 0.10, N = 3SE +/- 0.05, N = 3152.99125.45118.13159.89

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048SMT OnSMT Off2004006008001000SE +/- 1.81, N = 3SE +/- 1.06, N = 3SE +/- 0.73, N = 3SE +/- 8.04, N = 37807835915801. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT OnSMT Off4080120160200122.59123.30159.91164.57

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.07552.1513.22654.3025.3775SE +/- 0.00, N = 7SE +/- 0.02, N = 15SE +/- 0.03, N = 15SE +/- 0.01, N = 73.623.574.314.78

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off0.18680.37360.56040.74720.934SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.830.700.670.621. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.7SMT OnSMT Off40K80K120K160K200KSE +/- 101.44, N = 3SE +/- 1020.90, N = 3SE +/- 1109.71, N = 3171120.35149056.95128050.221. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpcre -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread

EPYC 9754 2P: SMT On: The test run did not produce a result.

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.0712.1423.2134.2845.355SE +/- 0.00, N = 7SE +/- 0.02, N = 7SE +/- 0.03, N = 7SE +/- 0.01, N = 73.623.574.354.76

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: AlexNetSMT OnSMT Off30060090012001500SE +/- 3.81, N = 3SE +/- 8.29, N = 3SE +/- 11.49, N = 15SE +/- 14.23, N = 151422.081375.331581.661225.41

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096SMT OnSMT Off150300450600750SE +/- 8.08, N = 3SE +/- 3.99, N = 3SE +/- 1.48, N = 3SE +/- 5.45, N = 66556955795451. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT OnSMT Off306090120150SE +/- 0.29, N = 3SE +/- 0.18, N = 3SE +/- 0.70, N = 3SE +/- 0.91, N = 3125.36119.5899.22107.72

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: AlexNetSMT OnSMT Off400800120016002000SE +/- 1.79, N = 3SE +/- 3.13, N = 3SE +/- 18.22, N = 3SE +/- 15.22, N = 151628.801526.811908.761770.91

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To CompileSMT OnSMT Off306090120150SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3113.70116.1293.9593.27

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50SMT OnSMT Off306090120150SE +/- 1.07, N = 12SE +/- 1.45, N = 12SE +/- 0.68, N = 3SE +/- 1.70, N = 3118.45121.47146.74124.98

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallSMT OnSMT Off13K26K39K52K65KSE +/- 52.11, N = 5SE +/- 25.22, N = 5SE +/- 478.13, N = 5SE +/- 411.42, N = 551784.151741.053798.662774.21. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off0.29250.5850.87751.171.4625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.211.301.131.081. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT OnSMT Off102030405044.3038.4940.57

Scene: Disney Material

EPYC 9754 2P: SMT On: The test run did not produce a result.

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To CompileSMT OnSMT Off4080120160200SE +/- 0.32, N = 3SE +/- 0.50, N = 3SE +/- 1.24, N = 3SE +/- 1.29, N = 3161.65148.48148.38152.59

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesSMT OnSMT Off50100150200250SE +/- 0.15, N = 3SE +/- 0.70, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3211.08213.91198.75199.37

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileSMT OnSMT Off20406080100SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.30, N = 3SE +/- 1.01, N = 3105.80102.59100.24100.79

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 310.8410.9111.1011.001. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: KershawSMT OnSMT Off1200M2400M3600M4800M6000MSE +/- 8226739.60, N = 3SE +/- 37190783.06, N = 3580863666757342666671. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: Kershaw

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

OpenBenchmarking.orgWatts, Fewer Is BetternekRS 23.0CPU Power Consumption MonitorSMT OnSMT Off60120180240300Min: 21.11 / Avg: 288.26 / Max: 349.64Min: 20.65 / Avg: 287.58 / Max: 350.2

OpenBenchmarking.orgflops/rank Per Watt, More Is BetternekRS 23.0Input: TurboPipe PeriodicSMT OnSMT Off2M4M6M8M10M8806106.198992432.84

CPU Power Consumption Monitor

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringSMT OnSMT Off140280420560700Min: 10.52 / Avg: 248.93 / Max: 397.25Min: 10.61 / Avg: 238.75 / Max: 362.1Min: 97.83 / Avg: 446.01 / Max: 702.85Min: 21.61 / Avg: 460.57 / Max: 792.38

OpenVINO

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterOpenVINO 2022.3CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 21.17 / Avg: 297.12 / Max: 335.54Min: 20.85 / Avg: 299.11 / Max: 323.89Min: 195.61 / Avg: 613.08 / Max: 656.04Min: 42.91 / Avg: 602.26 / Max: 657.52

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OnSMT Off3691215SE +/- 0.20, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.074.744.864.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OnSMT Off3K6K9K12K15KSE +/- 103.73, N = 15SE +/- 3.11, N = 3SE +/- 2.74, N = 3SE +/- 7.09, N = 35311.486744.6613141.5113214.201. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OnSMT Off13002600390052006500SE +/- 21.50, N = 14SE +/- 25.69, N = 13SE +/- 87.31, N = 13SE +/- 93.68, N = 151464.292784.676242.685810.411. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterNeural Magic DeepSparse 1.5CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 21.09 / Avg: 235.18 / Max: 327.39Min: 20.67 / Avg: 228.97 / Max: 321.66Min: 101.92 / Avg: 460.64 / Max: 658.21Min: 42.89 / Avg: 453.79 / Max: 677.58

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OnSMT Off110220330440550SE +/- 0.68, N = 3SE +/- 8.17, N = 15SE +/- 3.79, N = 3SE +/- 0.26, N = 3499.59468.78426.74521.31

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OnSMT Off60120180240300SE +/- 0.15, N = 3SE +/- 2.61, N = 15SE +/- 2.51, N = 3SE +/- 0.12, N = 3126.93134.57290.76242.96

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterNeural Magic DeepSparse 1.5CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.98 / Avg: 231.96 / Max: 341.24Min: 20.52 / Avg: 236.78 / Max: 330.52Min: 192.96 / Avg: 504.41 / Max: 673.83Min: 42.65 / Avg: 496.12 / Max: 694.42

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off60120180240300SE +/- 6.66, N = 15SE +/- 6.30, N = 15SE +/- 3.96, N = 15SE +/- 2.29, N = 15267.71259.28212.68229.16

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off130260390520650SE +/- 7.96, N = 15SE +/- 8.20, N = 15SE +/- 11.89, N = 15SE +/- 6.10, N = 15240.87246.37590.88556.94

PostgreSQL

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 15CPU Power Consumption MonitorSMT OnSMT Off70140210280350Min: 19.92 / Avg: 141 / Max: 209.56Min: 19.47 / Avg: 137.98 / Max: 261.97Min: 191.46 / Avg: 297.52 / Max: 385.38Min: 40.03 / Avg: 294.82 / Max: 404.71

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average LatencySMT OnSMT Off0.22910.45820.68730.91641.1455SE +/- 0.040, N = 9SE +/- 0.020, N = 12SE +/- 0.006, N = 3SE +/- 0.025, N = 120.9520.9171.0180.9741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read OnlySMT OnSMT Off200K400K600K800K1000KSE +/- 45813.37, N = 9SE +/- 21185.07, N = 12SE +/- 4149.87, N = 3SE +/- 23693.25, N = 128555698777227859688278781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Stockfish

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterStockfish 15CPU Power Consumption MonitorSMT OnSMT Off130260390520650Min: 21.02 / Avg: 326.55 / Max: 356.32Min: 20.41 / Avg: 309.55 / Max: 341.8Min: 41.98 / Avg: 598.99 / Max: 681.91Min: 42.5 / Avg: 649.7 / Max: 710.41

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterStockfish 15Total TimeSMT OnSMT Off200K400K600K800K1000K1117842.33881021.76746293.69896399.09

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeSMT OnSMT Off120M240M360M480M600MSE +/- 7021012.10, N = 12SE +/- 5762415.88, N = 15SE +/- 6859221.31, N = 12SE +/- 9265130.36, N = 153650343492727229404470231435823869241. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

LuxCoreRender

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterLuxCoreRender 2.6CPU Power Consumption MonitorSMT OnSMT Off70140210280350Min: 20.9 / Avg: 142.31 / Max: 202.02Min: 20.14 / Avg: 136.45 / Max: 184.56Min: 40.47 / Avg: 279.04 / Max: 366.94Min: 41.26 / Avg: 281.59 / Max: 380.32

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec Per Watt, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT OnSMT Off0.03310.06620.09930.13240.16550.1470.1050.0480.068

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.03, N = 5SE +/- 0.04, N = 4SE +/- 0.35, N = 15SE +/- 0.03, N = 520.8814.3713.5119.02

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterLuxCoreRender 2.6CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 21.26 / Avg: 298.35 / Max: 337.28Min: 20.39 / Avg: 293.46 / Max: 329.9Min: 41.2 / Avg: 501.47 / Max: 651.41Min: 42.69 / Avg: 558.11 / Max: 673.06

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec Per Watt, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT OnSMT Off0.01850.0370.05550.0740.09250.0820.0710.0500.062

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT OnSMT Off816243240SE +/- 0.30, N = 15SE +/- 0.03, N = 3SE +/- 0.60, N = 15SE +/- 1.35, N = 1524.4720.9825.1134.45

srsRAN Project

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BettersrsRAN Project 23.5CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.98 / Avg: 261.42 / Max: 327.2Min: 20.63 / Avg: 260.57 / Max: 317.08Min: 41.48 / Avg: 517.72 / Max: 634.9Min: 43.09 / Avg: 560.78 / Max: 670.82

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMbps Per Watt, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput TotalSMT OnSMT Off2040608010032.0978.4170.6431.90

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput TotalSMT OnSMT Off8K16K24K32K40KSE +/- 50.60, N = 3SE +/- 54.33, N = 3SE +/- 211.99, N = 3SE +/- 831.80, N = 158389.020430.936573.817891.41. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Xmrig

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterXmrig 6.18.1CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.63 / Avg: 221.93 / Max: 329.48Min: 20.83 / Avg: 268.02 / Max: 330.69Min: 40.87 / Avg: 462.3 / Max: 653.34Min: 41.43 / Avg: 488.1 / Max: 682.33

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgH/s Per Watt, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MSMT OnSMT Off4080120160200109.99191.10185.91177.29

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MSMT OnSMT Off20K40K60K80K100KSE +/- 587.76, N = 15SE +/- 513.99, N = 3SE +/- 83.35, N = 4SE +/- 871.70, N = 424409.451218.985946.086533.71. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe PeriodicSMT OnSMT Off600M1200M1800M2400M3000MSE +/- 62315945.32, N = 13SE +/- 87729075.59, N = 15253840692325860833331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: TurboPipe Periodic

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

SPECFEM3D

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.0CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.79 / Avg: 218.63 / Max: 337.15Min: 20.06 / Avg: 188.68 / Max: 325.98Min: 40.89 / Avg: 314 / Max: 658.8Min: 41.07 / Avg: 338.57 / Max: 663.27

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous HalfspaceSMT OnSMT Off3691215SE +/- 0.048294894, N = 4SE +/- 0.044108994, N = 15SE +/- 0.065109589, N = 12SE +/- 0.120152278, N = 159.4043485006.0927502993.4518301694.7278301481. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.0CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.85 / Avg: 203.93 / Max: 337.24Min: 20.14 / Avg: 175.44 / Max: 323.55Min: 41.07 / Avg: 299.85 / Max: 655.45Min: 42.2 / Avg: 321.36 / Max: 664.42

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic ModelSMT OnSMT Off246810SE +/- 0.080164897, N = 5SE +/- 0.048140615, N = 15SE +/- 0.014728964, N = 6SE +/- 0.086611165, N = 157.4806747064.9372655062.7092054283.7827417981. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3CPU Power Consumption MonitorSMT OnSMT Off110220330440550Min: 20.59 / Avg: 233.52 / Max: 280.97Min: 20.21 / Avg: 229.75 / Max: 280.78Min: 41.26 / Avg: 404.72 / Max: 596.32Min: 43.55 / Avg: 408.39 / Max: 622.11

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s Per Watt, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off0.11720.23440.35160.46880.5860.2840.2960.5210.507

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off50100150200250SE +/- 3.14, N = 15SE +/- 3.41, N = 15SE +/- 1.62, N = 5SE +/- 0.28, N = 566.3467.99210.93207.201. (CXX) g++ options: -O3

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3CPU Power Consumption MonitorSMT OnSMT Off110220330440550Min: 20.58 / Avg: 248.88 / Max: 282.9Min: 20.26 / Avg: 246.69 / Max: 279.29Min: 40.61 / Avg: 446.09 / Max: 590.49Min: 41.88 / Avg: 458.33 / Max: 608.28

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s Per Watt, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off0.05670.11340.17010.22680.28350.1400.1440.2520.239

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off306090120150SE +/- 2.09, N = 15SE +/- 2.10, N = 15SE +/- 1.48, N = 3SE +/- 0.77, N = 334.8535.40112.41109.651. (CXX) g++ options: -O3

libxsmm

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.34 / Avg: 205.01 / Max: 330.64Min: 20.02 / Avg: 204.17 / Max: 323.63Min: 41.28 / Avg: 324.27 / Max: 648.28Min: 42.27 / Avg: 322.92 / Max: 664.29

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOPS/s Per Watt, More Is Betterlibxsmm 2-1.17-3645M N K: 128SMT OnSMT Off4812162013.2413.2113.8915.41

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128SMT OnSMT Off11002200330044005500SE +/- 0.99, N = 3SE +/- 19.26, N = 3SE +/- 114.55, N = 9SE +/- 62.14, N = 42713.42696.54505.54976.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

miniBUDE

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterminiBUDE 20210901CPU Power Consumption MonitorSMT OnSMT Off110220330440550Min: 20.2 / Avg: 155.92 / Max: 319.62Min: 19.77 / Avg: 154.01 / Max: 315.25Min: 40.2 / Avg: 223.4 / Max: 628.17Min: 40.84 / Avg: 269.38 / Max: 547.76

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgBillion Interactions/s Per Watt, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OnSMT Off0.44210.88421.32631.76842.21051.5251.5241.9650.940

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OnSMT Off100200300400500SE +/- 0.05, N = 9SE +/- 0.10, N = 9SE +/- 7.21, N = 15SE +/- 0.21, N = 9237.76234.68439.07253.121. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OnSMT Off2K4K6K8K10KSE +/- 1.30, N = 9SE +/- 2.53, N = 9SE +/- 180.32, N = 15SE +/- 5.35, N = 95944.065867.1110976.686328.071. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAS Parallel Benchmarks

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 19.9 / Avg: 127.5 / Max: 326.83Min: 19.52 / Avg: 119.85 / Max: 307.11Min: 39.68 / Avg: 227.42 / Max: 648.76Min: 41.07 / Avg: 228.71 / Max: 655.77

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s Per Watt, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT OnSMT Off300600900120015001171.421347.261082.901034.03

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT OnSMT Off50K100K150K200K250KSE +/- 1110.64, N = 9SE +/- 885.17, N = 9SE +/- 8884.57, N = 15SE +/- 732.52, N = 9149355.54161475.24246272.79236490.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4CPU Power Consumption MonitorSMT OnSMT Off120240360480600Min: 20.22 / Avg: 238.61 / Max: 342.73Min: 19.6 / Avg: 229.32 / Max: 332.77Min: 40.26 / Avg: 375.87 / Max: 680.44Min: 41.42 / Avg: 399.82 / Max: 682.45

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s Per Watt, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT OnSMT Off153045607555.5962.2569.1359.29

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT OnSMT Off6K12K18K24K30KSE +/- 214.86, N = 15SE +/- 54.02, N = 5SE +/- 940.02, N = 12SE +/- 413.32, N = 1513264.7914274.5325983.7423705.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

184 Results Shown

OpenVINO
OpenSSL
toyBrot Fractal Generator
SPECFEM3D
7-Zip Compression
Neural Magic DeepSparse
John The Ripper:
  Blowfish
  bcrypt
Neural Magic DeepSparse:
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
Embree
Neural Magic DeepSparse
Graph500
OpenSSL
Embree
SPECFEM3D
Graph500
Blender
NAS Parallel Benchmarks
Liquid-DSP
OSPRay Studio:
  3 - 4K - 32 - Path Tracer
  3 - 4K - 16 - Path Tracer
  1 - 4K - 16 - Path Tracer
CP2K Molecular Dynamics
OSPRay Studio:
  1 - 4K - 1 - Path Tracer
  2 - 4K - 16 - Path Tracer
CloverLeaf
OSPRay Studio:
  3 - 4K - 1 - Path Tracer
  2 - 4K - 32 - Path Tracer
OpenSSL
OSPRay Studio
Graph500
OSPRay Studio
Blender
John The Ripper
Xmrig
ASTC Encoder
OpenVINO:
  Person Detection FP32 - CPU
  Machine Translation EN To DE FP16 - CPU
  Person Detection FP16 - CPU
SPECFEM3D
Blender:
  Barbershop - CPU-Only
  BMW27 - CPU-Only
Helsing
Graph500
OpenSSL
NAS Parallel Benchmarks
ASTC Encoder
OSPRay
OpenSSL
John The Ripper
OSPRay
Blender
OSPRay:
  particle_volume/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
OpenSSL
OpenVINO
OpenSSL:
  AES-128-GCM
  AES-256-GCM
OpenVINO:
  Machine Translation EN To DE FP16 - CPU
  Face Detection FP16-INT8 - CPU
  Face Detection FP16-INT8 - CPU
  Face Detection FP16 - CPU
Neural Magic DeepSparse
ASTC Encoder
OpenVINO:
  Weld Porosity Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16 - CPU
NAMD
Liquid-DSP
OpenVINO
Timed Linux Kernel Compilation
OpenVINO
Primesieve
libxsmm
toyBrot Fractal Generator
OpenVINO
miniBUDE:
  OpenMP - BM2:
    GFInst/s
    Billion Interactions/s
NAS Parallel Benchmarks:
  IS.D
  BT.C
HeFFTe - Highly Efficient FFT for Exascale
NAS Parallel Benchmarks
LuxCoreRender
HeFFTe - Highly Efficient FFT for Exascale
Primesieve
OpenVINO
TensorFlow
Appleseed
NAS Parallel Benchmarks
7-Zip Compression
OpenVKL
TensorFlow
OpenVINO:
  Person Detection FP16 - CPU
  Person Detection FP32 - CPU
TensorFlow
OpenVINO
NAS Parallel Benchmarks
Timed Linux Kernel Compilation
LuxCoreRender
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream
Intel Open Image Denoise
Neural Magic DeepSparse:
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
MariaDB
Appleseed
Intel Open Image Denoise
OpenVINO
Aircrack-ng
Intel Open Image Denoise
TensorFlow
MariaDB
Timed LLVM Compilation
TensorFlow
Timed Node.js Compilation
TensorFlow
miniFE
OpenVINO
Appleseed
Timed Gem5 Compilation
Timed LLVM Compilation
Timed Godot Game Engine Compilation
OpenVINO
nekRS
nekRS:
  CPU Power Consumption Monitor
  TurboPipe Periodic
  Phoronix Test Suite System Monitoring
  CPU Power Consumption Monitor
OpenVINO:
  Vehicle Detection FP16-INT8 - CPU:
    ms
    FPS
  Vehicle Detection FP16 - CPU:
    FPS
Neural Magic DeepSparse
Neural Magic DeepSparse:
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    ms/batch
    items/sec
Neural Magic DeepSparse
Neural Magic DeepSparse:
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
PostgreSQL
PostgreSQL:
  1000 - 800 - Read Only - Average Latency
  1000 - 800 - Read Only
Stockfish:
  CPU Power Consumption Monitor
  Total Time
Stockfish
LuxCoreRender:
  CPU Power Consumption Monitor
  Rainbow Colors and Prism - CPU
LuxCoreRender
LuxCoreRender:
  CPU Power Consumption Monitor
  Orange Juice - CPU
LuxCoreRender
srsRAN Project:
  CPU Power Consumption Monitor
  PUSCH Processor Benchmark, Throughput Total
srsRAN Project
Xmrig:
  CPU Power Consumption Monitor
  Monero - 1M
Xmrig
nekRS
SPECFEM3D
SPECFEM3D
SPECFEM3D
SPECFEM3D
HeFFTe - Highly Efficient FFT for Exascale:
  CPU Power Consumption Monitor
  r2c - FFTW - double - 512
HeFFTe - Highly Efficient FFT for Exascale
HeFFTe - Highly Efficient FFT for Exascale:
  CPU Power Consumption Monitor
  c2c - FFTW - double - 512
HeFFTe - Highly Efficient FFT for Exascale
libxsmm:
  CPU Power Consumption Monitor
  128
libxsmm
miniBUDE:
  CPU Power Consumption Monitor
  OpenMP - BM1
miniBUDE:
  OpenMP - BM1:
    Billion Interactions/s
    GFInst/s
NAS Parallel Benchmarks:
  CPU Power Consumption Monitor
  SP.B
NAS Parallel Benchmarks
NAS Parallel Benchmarks:
  CPU Power Consumption Monitor
  EP.D
NAS Parallel Benchmarks