AMD EPYC 9754 Bergamo SMT On/Off Comparison

Benchmarks by Michael Larabel for a future article (post 19th) looking at SMT on/off comparison toggled via BIOS. SMT comparison testing of AMD EPYC 9754 128-Core CPUs on Titanite with Ubuntu 22.04 LTS.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2307190-NE-BERGAMOSM27
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 9754 1P: SMT On
July 14 2023
  16 Hours, 51 Minutes
EPYC 9754 1P: SMT Off
July 13 2023
  15 Hours, 43 Minutes
EPYC 9754 2P: SMT On
July 11 2023
  14 Hours, 12 Minutes
EPYC 9754 2P: SMT Off
July 12 2023
  13 Hours, 40 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 6 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionEPYC 9754 1PEPYC 9754 2P SMT On SMT Off SMT On SMT OffAMD EPYC 9754 128-Core @ 2.25GHz (128 Cores / 256 Threads)AMD Titanite_4G (RTI1007B BIOS)AMD Device 14a4768GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 22.045.19.0-41-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.41.3.224GCC 11.3.0ext41024x768AMD EPYC 9754 128-Core @ 2.25GHz (128 Cores)2 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores / 512 Threads)1520GB2 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa0010bPython Details- Python 3.10.6Security Details- EPYC 9754 1P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 1P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 2P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 2P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

specfem3d: Layered Halfspaceheffte: c2c - FFTW - double - 512libxsmm: 128specfem3d: Water-layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspaceheffte: r2c - FFTW - float - 512toybrot: TBBtoybrot: OpenMPastcenc: Fastastcenc: Thoroughastcenc: Exhaustivexmrig: Monero - 1Mxmrig: Wownero - 1Mgraph500: 26graph500: 26heffte: c2c - FFTW - float - 512libxsmm: 256graph500: 26graph500: 26minibude: OpenMP - BM1minibude: OpenMP - BM1specfem3d: Mount St. Helensminibude: OpenMP - BM2minibude: OpenMP - BM2nekrs: Kershawnekrs: TurboPipe Periodictensorflow: CPU - 256 - AlexNettensorflow: CPU - 512 - AlexNettensorflow: CPU - 256 - GoogLeNettensorflow: CPU - 256 - ResNet-50tensorflow: CPU - 512 - GoogLeNettensorflow: CPU - 512 - ResNet-50cloverleaf: Lagrangian-Eulerian Hydrodynamicsdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamhelsing: 14 digitdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamheffte: r2c - FFTW - double - 512deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamnpb: BT.Cnpb: CG.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cnamd: ATPase Simulation - 327,506 Atomsopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUminife: Smallcp2k: H2O-DFT-LSaircrack-ng: primesieve: 1e12primesieve: 1e13stockfish: Total Timecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: MD5build-llvm: Ninjabuild-llvm: Unix Makefilesbuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlybuild-godot: Time To Compileembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyoidn: RTLightmap.hdr.4096x4096 - CPU-Onlyopenvkl: vklBenchmark ISPCluxcorerender: DLSC - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Tracerappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerbuild-gem5: Time To Compilebuild-nodejs: Time To Compileliquid-dsp: 256 - 256 - 512liquid-dsp: 512 - 256 - 512srsran: PUSCH Processor Benchmark, Throughput Totalopenssl: SHA256openssl: SHA512openssl: RSA4096openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305pgbench: 1000 - 800 - Read Onlypgbench: 1000 - 800 - Read Only - Average Latencymysqlslap: 2048mysqlslap: 4096EPYC 9754 1PEPYC 9754 2P SMT On SMT Off SMT On SMT Off15.96493168834.84512713.417.1944927797.4806747069.404348500245.542359140811190.754975.08318.174324409.474803.6857890000880249000128.1243331.73334450004459120005944.062237.7636.2163798495972.643238.905580863666725384069231422.081628.80504.09118.45416.03122.7712.0073.380750.473859.86531376.214246.4318240.8667267.7088417.1186152.9895968.6267624.4789102.2265126.9310499.5866316.0466201.625873.2077859.87766.341865.9747292243.6145686.8813264.79140791.525300.29279662.55128129.56149355.54131909.910.2070260.771049.3126.612377.7926.572378.071464.2943.73117.88541.295311.4812.076067.7810.53582.08110.0011794.3210.846148.8910.40120515.110.8385400.881.2151784.15012.26171120.3541.94421.28636503434972627179178721606481037521611520312667125.364211.07926.225227.51712.7731.1216.49116.5439.27105.797125.5813157.65043.623.621.74139616.3424.4712.1820.8830.850730.816532.675031.92458618731032137592753813963278851648432972122.58770944.30492166.676131161.648113.704169676666717837000008389.01636336255535300587933054195.11890935.3659346857987116967373555710125377662104624158373208555690.95278065515.02812192935.40492696.513.4194545014.9372655066.092750299248.557559062421278.646668.30727.314751218.963182.2893624000928320000128.5973813.43637500004935350005867.108234.6845.0300315805903.205236.128573426666725860833331375.331526.81525.39121.47429.16123.889.2796.969757.956644.97681780.442135.5549246.3732259.2779504.3578125.44671275.7707812.102377.8148134.5717468.7809404.2338156.056397.0913645.001967.990749.5111298801.4448672.2414274.53147448.725315.15289518.14136942.13161475.24133415.420.2059562.18513.2328.741105.9628.831102.822784.6711.49119.01268.826744.664.745837.625.47545.1358.7111710.5910.915118.296.24113162.290.7071673.851.3051741.04957.686149056.9531.79621.13227272294059274151047016322067621816326316751667119.576213.90722.984177.75815.1538.4420.47147.2250.01102.58885.2893107.36083.573.571.72110713.2720.988.8814.3723.733323.721025.717825.5121112711461358180043592618269365392166643371123.29839638.492263167.900597148.484116.1221313933333141470000020430.91114145572805180433363756647.31799293.055070030743311528389280639970346218833927826899878777220.9177836959.997447570109.6454976.79.7783814373.7827417984.727830148433.60120143321610.1137134.885315.930586533.7142082.715711000001724030000221.7656112.66725410009604710006328.069253.1233.9066750957888.085315.5241225.411770.91329.22124.98538.52172.3621.65139.659027.280902.85932627.314248.6206556.9411229.1624797.6248159.88611868.33101190.7168107.1835242.9589521.3108602.2602211.4848139.8821902.7792207.19768.3638491231.8367554.7423705.40211432.899849.01591505.17249109.09236490.76224243.280.10646121.03526.9840.671559.9540.901552.155810.4111.03235.65270.9313214.204.8311299.325.641159.9955.1222954.4411.009889.616.46168372.890.62133931.531.0862774.22143.5131.50811.1205823869249258201353957407860152453340985034879333107.721199.36520.344145.9297.1216.289.8769.0321.75100.790210.0714255.98644.764.782.35172018.6134.459.8619.0249.132849.231153.829552.7318482495582769815619781715731925318455164.567164152.58693.2712544400000333016666717891.4327926038513106049655203108490.53782091.81317549954027233989070010720193170703279098173601108278780.9745805457.470772930112.4134505.56.2180692572.7092054283.451830169430.26529763671693.3961127.208814.409985946.0100754.318151600002078880000223.5846373.0770180000107590000010976.682439.0672.63012259910989.938439.5971581.661908.76452.99146.74634.13189.4015.15182.445350.054676.81282730.498045.8765590.8789212.68211058.6776118.12852409.80351541.951281.1994290.7562426.7362770.1392162.0191182.5974676.4342210.93351.8103536518.7466822.9725983.74224178.108635.30658754.21268721.05246272.79231041.570.13969121.08526.4641.051545.2340.201579.146242.6810.26235.37271.2413141.514.8611373.955.611174.9954.4222955.6411.109878.886.47142135.580.67118225.551.1353798.62363.579128050.2191.16011.15244702314377146291309131704613015003208853022166799.215198.75018.474118.0998.4520.2411.7285.5427.47100.240146.2051178.30464.354.312.09153014.4525.116.9713.5141.656741.553144.710444.2889631645763100462013310232205451212724238159.90661440.570691265.885808148.37693.9522542633333261093333336573.8222269428867102232233590113251.43598946.31100453394570230752453545719982341488637844895705237859681.018591579OpenBenchmarking.org

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered HalfspaceSMT OffSMT On48121620SE +/- 0.056354216, N = 4SE +/- 0.066481426, N = 15SE +/- 0.159428812, N = 3SE +/- 0.034780206, N = 37.4707729309.99744757015.02812192915.9649316881. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OffSMT On306090120150SE +/- 1.48, N = 3SE +/- 0.77, N = 3SE +/- 2.10, N = 15SE +/- 2.09, N = 15112.41109.6535.4034.851. (CXX) g++ options: -O3

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128SMT OnSMT Off11002200330044005500SE +/- 62.14, N = 4SE +/- 114.55, N = 9SE +/- 0.99, N = 3SE +/- 19.26, N = 34976.74505.52713.42696.51. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered HalfspaceSMT OffSMT On48121620SE +/- 0.062799204, N = 15SE +/- 0.063515594, N = 4SE +/- 0.021567327, N = 3SE +/- 0.142353904, N = 36.2180692579.77838143713.41945450117.1944927791. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic ModelSMT OffSMT On246810SE +/- 0.014728964, N = 6SE +/- 0.086611165, N = 15SE +/- 0.048140615, N = 15SE +/- 0.080164897, N = 52.7092054283.7827417984.9372655067.4806747061. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous HalfspaceSMT OffSMT On3691215SE +/- 0.065109589, N = 12SE +/- 0.120152278, N = 15SE +/- 0.044108994, N = 15SE +/- 0.048294894, N = 43.4518301694.7278301486.0927502999.4043485001. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OnSMT Off90180270360450SE +/- 0.86, N = 7SE +/- 0.77, N = 6SE +/- 0.09, N = 5SE +/- 0.52, N = 5433.60430.27248.56245.541. (CXX) g++ options: -O3

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBSMT OnSMT Off12002400360048006000SE +/- 23.04, N = 15SE +/- 31.80, N = 15SE +/- 21.75, N = 9SE +/- 43.30, N = 1520142976359155901. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPSMT OnSMT Off13002600390052006500SE +/- 23.29, N = 12SE +/- 53.03, N = 15SE +/- 17.08, N = 8SE +/- 0.20, N = 733213671408162421. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastSMT OffSMT On30060090012001500SE +/- 1.20, N = 7SE +/- 1.79, N = 6SE +/- 1.25, N = 5SE +/- 1.29, N = 51278.651190.75693.40610.111. (CXX) g++ options: -O3 -flto -pthread

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughSMT OnSMT Off306090120150SE +/- 0.22, N = 6SE +/- 0.05, N = 6SE +/- 0.02, N = 6SE +/- 0.01, N = 6134.89127.2175.0868.311. (CXX) g++ options: -O3 -flto -pthread

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveSMT OnSMT Off48121620SE +/- 0.0081, N = 6SE +/- 0.0257, N = 6SE +/- 0.0006, N = 5SE +/- 0.0028, N = 515.930514.40998.17437.31471. (CXX) g++ options: -O3 -flto -pthread

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MSMT OnSMT Off20K40K60K80K100KSE +/- 871.70, N = 4SE +/- 83.35, N = 4SE +/- 513.99, N = 3SE +/- 587.76, N = 1586533.785946.051218.924409.41. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1MSMT OnSMT Off30K60K90K120K150KSE +/- 677.82, N = 5SE +/- 741.13, N = 4SE +/- 513.11, N = 15SE +/- 13.77, N = 4142082.7100754.374803.663182.21. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OffSMT On400M800M1200M1600M2000M181516000015711000008936240008578900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OffSMT On400M800M1200M1600M2000M207888000017240300009283200008802490001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OffSMT On50100150200250SE +/- 1.28, N = 5SE +/- 1.37, N = 5SE +/- 0.01, N = 4SE +/- 0.15, N = 4223.58221.77128.60128.121. (CXX) g++ options: -O3

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256SMT OffSMT On14002800420056007000SE +/- 66.66, N = 9SE +/- 1.43, N = 3SE +/- 16.75, N = 3SE +/- 2.32, N = 36373.06112.63813.43331.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OffSMT On160M320M480M640M800M7701800006725410003637500003334450001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OffSMT On200M400M600M800M1000M10759000009604710004935350004459120001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OffSMT On2K4K6K8K10KSE +/- 180.32, N = 15SE +/- 5.35, N = 9SE +/- 1.30, N = 9SE +/- 2.53, N = 910976.686328.075944.065867.111. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OffSMT On100200300400500SE +/- 7.21, N = 15SE +/- 0.21, N = 9SE +/- 0.05, N = 9SE +/- 0.10, N = 9439.07253.12237.76234.681. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. HelensSMT OffSMT On246810SE +/- 0.013893024, N = 5SE +/- 0.014293172, N = 5SE +/- 0.069356320, N = 12SE +/- 0.034771470, N = 52.6301225993.9066750955.0300315806.2163798491. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OffSMT On2K4K6K8K10KSE +/- 150.60, N = 12SE +/- 10.75, N = 4SE +/- 0.21, N = 3SE +/- 6.13, N = 310989.947888.095972.645903.211. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OffSMT On100200300400500SE +/- 6.02, N = 12SE +/- 0.43, N = 4SE +/- 0.01, N = 3SE +/- 0.25, N = 3439.60315.52238.91236.131. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: KershawSMT OnSMT Off1200M2400M3600M4800M6000MSE +/- 8226739.60, N = 3SE +/- 37190783.06, N = 3580863666757342666671. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: Kershaw

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe PeriodicSMT OffSMT On600M1200M1800M2400M3000MSE +/- 87729075.59, N = 15SE +/- 62315945.32, N = 13258608333325384069231. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: TurboPipe Periodic

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: AlexNetSMT OffSMT On30060090012001500SE +/- 11.49, N = 15SE +/- 14.23, N = 15SE +/- 3.81, N = 3SE +/- 8.29, N = 31581.661225.411422.081375.33

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: AlexNetSMT OffSMT On400800120016002000SE +/- 18.22, N = 3SE +/- 15.22, N = 15SE +/- 1.79, N = 3SE +/- 3.13, N = 31908.761770.911628.801526.81

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: GoogLeNetSMT OffSMT On110220330440550SE +/- 0.35, N = 3SE +/- 3.39, N = 3SE +/- 5.52, N = 4SE +/- 1.69, N = 3525.39504.09452.99329.22

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50SMT OffSMT On306090120150SE +/- 0.68, N = 3SE +/- 1.70, N = 3SE +/- 1.45, N = 12SE +/- 1.07, N = 12146.74124.98121.47118.45

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: GoogLeNetSMT OffSMT On140280420560700SE +/- 6.73, N = 3SE +/- 4.51, N = 15SE +/- 5.79, N = 12SE +/- 4.74, N = 12634.13538.52429.16416.03

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50SMT OffSMT On4080120160200SE +/- 0.13, N = 3SE +/- 0.59, N = 3SE +/- 1.35, N = 3SE +/- 0.90, N = 3189.40172.36123.88122.77

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT OffSMT On510152025SE +/- 0.09, N = 5SE +/- 0.11, N = 4SE +/- 0.04, N = 4SE +/- 0.27, N = 49.2712.0015.1521.651. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OffSMT On4080120160200SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.25, N = 3182.45139.6696.9773.38

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitSMT OnSMT Off1326395265SE +/- 0.37, N = 3SE +/- 0.31, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 327.2850.0550.4757.961. (CC) gcc options: -O2 -pthread

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OffSMT On2004006008001000SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 3644.98859.87676.81902.86

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OffSMT On6001200180024003000SE +/- 33.47, N = 15SE +/- 6.95, N = 3SE +/- 3.84, N = 3SE +/- 1.21, N = 32730.502627.311780.441376.21

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OffSMT On1122334455SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.57, N = 15SE +/- 0.13, N = 335.5546.4345.8848.62

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OffSMT On130260390520650SE +/- 11.89, N = 15SE +/- 6.10, N = 15SE +/- 8.20, N = 15SE +/- 7.96, N = 15590.88556.94246.37240.87

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OffSMT On60120180240300SE +/- 3.96, N = 15SE +/- 2.29, N = 15SE +/- 6.30, N = 15SE +/- 6.66, N = 15212.68229.16259.28267.71

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OffSMT On2004006008001000SE +/- 0.90, N = 3SE +/- 0.32, N = 3SE +/- 5.07, N = 15SE +/- 0.46, N = 31058.68797.62504.36417.12

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OffSMT On4080120160200SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 1.18, N = 15SE +/- 0.16, N = 3118.13159.89125.45152.99

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OffSMT On5001000150020002500SE +/- 1.63, N = 3SE +/- 1.60, N = 3SE +/- 5.02, N = 3SE +/- 0.74, N = 32409.801868.331275.77968.63

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OffSMT On30060090012001500SE +/- 1.38, N = 3SE +/- 1.02, N = 3SE +/- 3.93, N = 3SE +/- 0.29, N = 31541.951190.72812.10624.48

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OffSMT On20406080100SE +/- 0.37, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 377.81102.2381.20107.18

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OffSMT On60120180240300SE +/- 2.51, N = 3SE +/- 0.12, N = 3SE +/- 2.61, N = 15SE +/- 0.15, N = 3290.76242.96134.57126.93

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OffSMT On110220330440550SE +/- 3.79, N = 3SE +/- 0.26, N = 3SE +/- 8.17, N = 15SE +/- 0.68, N = 3426.74521.31468.78499.59

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OffSMT On170340510680850SE +/- 0.92, N = 3SE +/- 0.72, N = 3SE +/- 2.15, N = 3SE +/- 1.02, N = 3770.14602.26404.23316.05

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OffSMT On50100150200250SE +/- 0.81, N = 3SE +/- 0.54, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 3156.06201.63162.02211.48

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OffSMT On4080120160200SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.25, N = 3182.60139.8897.0973.21

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OffSMT On2004006008001000SE +/- 0.12, N = 3SE +/- 0.49, N = 3SE +/- 0.21, N = 3SE +/- 0.45, N = 3645.00859.88676.43902.78

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OffSMT On50100150200250SE +/- 1.62, N = 5SE +/- 0.28, N = 5SE +/- 3.41, N = 15SE +/- 3.14, N = 15210.93207.2067.9966.341. (CXX) g++ options: -O3

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OffSMT On1530456075SE +/- 0.20, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 349.5165.9751.8168.36

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT OffSMT On110K220K330K440K550KSE +/- 3792.90, N = 12SE +/- 4903.75, N = 15SE +/- 269.21, N = 5SE +/- 396.74, N = 5536518.74491231.83298801.44292243.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT OnSMT Off14K28K42K56K70KSE +/- 348.18, N = 8SE +/- 568.39, N = 8SE +/- 285.31, N = 8SE +/- 441.00, N = 1567554.7466822.9748672.2445686.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT OffSMT On6K12K18K24K30KSE +/- 940.02, N = 12SE +/- 413.32, N = 15SE +/- 54.02, N = 5SE +/- 214.86, N = 1525983.7423705.4014274.5313264.791. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT OffSMT On50K100K150K200K250KSE +/- 2978.54, N = 13SE +/- 1757.80, N = 9SE +/- 93.32, N = 8SE +/- 1029.98, N = 8224178.10211432.89147448.72140791.521. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT OnSMT Off2K4K6K8K10KSE +/- 104.60, N = 15SE +/- 105.10, N = 15SE +/- 29.88, N = 6SE +/- 27.01, N = 69849.018635.305315.155300.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT OffSMT On140K280K420K560K700KSE +/- 4916.03, N = 15SE +/- 7199.59, N = 15SE +/- 1485.96, N = 6SE +/- 2132.06, N = 6658754.21591505.17289518.14279662.551. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT OffSMT On60K120K180K240K300KSE +/- 599.41, N = 10SE +/- 1284.96, N = 11SE +/- 104.98, N = 10SE +/- 296.36, N = 10268721.05249109.09136942.13128129.561. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT OffSMT On50K100K150K200K250KSE +/- 8884.57, N = 15SE +/- 732.52, N = 9SE +/- 885.17, N = 9SE +/- 1110.64, N = 9246272.79236490.76161475.24149355.541. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT OffSMT On50K100K150K200K250KSE +/- 1880.70, N = 6SE +/- 1293.24, N = 6SE +/- 228.86, N = 4SE +/- 290.95, N = 4231041.57224243.28133415.42131909.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsSMT OnSMT Off0.04660.09320.13980.18640.233SE +/- 0.00040, N = 3SE +/- 0.00135, N = 5SE +/- 0.00018, N = 4SE +/- 0.00095, N = 40.106460.139690.205950.20702

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OffSMT On306090120150SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3121.08121.0362.1860.771. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OffSMT On2004006008001000SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.28, N = 3SE +/- 0.18, N = 3513.231049.31526.46526.981. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OffSMT On918273645SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 0.23, N = 9SE +/- 0.20, N = 1241.0540.6728.7426.611. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OffSMT On5001000150020002500SE +/- 8.36, N = 9SE +/- 16.41, N = 12SE +/- 10.43, N = 3SE +/- 4.07, N = 31105.962377.791545.231559.951. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OnSMT Off918273645SE +/- 0.22, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 5SE +/- 0.15, N = 1540.9040.2028.8326.571. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OffSMT On5001000150020002500SE +/- 10.77, N = 5SE +/- 12.47, N = 15SE +/- 8.82, N = 3SE +/- 4.61, N = 31102.822378.071552.151579.141. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OffSMT On13002600390052006500SE +/- 87.31, N = 13SE +/- 93.68, N = 15SE +/- 25.69, N = 13SE +/- 21.50, N = 146242.685810.412784.671464.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OffSMT On1020304050SE +/- 0.12, N = 13SE +/- 0.15, N = 15SE +/- 0.10, N = 13SE +/- 0.55, N = 1410.2611.0311.4943.731. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OnSMT Off50100150200250SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3235.65235.37119.01117.881. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OffSMT On120240360480600SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3268.82541.29270.93271.241. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OnSMT Off3K6K9K12K15KSE +/- 7.09, N = 3SE +/- 2.74, N = 3SE +/- 3.11, N = 3SE +/- 103.73, N = 1513214.2013141.516744.665311.481. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OffSMT On3691215SE +/- 0.00, N = 3SE +/- 0.20, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 34.7412.074.834.861. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OffSMT On2K4K6K8K10KSE +/- 2.58, N = 3SE +/- 9.34, N = 3SE +/- 0.58, N = 3SE +/- 11.25, N = 311373.9511299.326067.785837.621. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OffSMT On3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.4710.535.615.641. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OffSMT On30060090012001500SE +/- 4.51, N = 3SE +/- 7.44, N = 3SE +/- 6.51, N = 15SE +/- 4.46, N = 151174.991159.99582.08545.131. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OffSMT On20406080100SE +/- 0.21, N = 3SE +/- 0.35, N = 3SE +/- 0.45, N = 15SE +/- 1.12, N = 1554.4255.1258.71110.001. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OffSMT On5K10K15K20K25KSE +/- 15.58, N = 3SE +/- 16.03, N = 3SE +/- 2.13, N = 3SE +/- 13.11, N = 322955.6422954.4411794.3211710.591. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.8410.9111.0011.101. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OnSMT Off2K4K6K8K10KSE +/- 11.31, N = 3SE +/- 10.15, N = 3SE +/- 48.86, N = 15SE +/- 5.00, N = 39889.619878.886148.895118.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OffSMT On3691215SE +/- 0.01, N = 3SE +/- 0.08, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 36.2410.406.466.471. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off40K80K120K160K200KSE +/- 676.62, N = 3SE +/- 158.47, N = 3SE +/- 390.62, N = 3SE +/- 202.90, N = 3168372.89142135.58120515.11113162.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off0.18680.37360.56040.74720.934SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.670.700.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off30K60K90K120K150KSE +/- 602.14, N = 3SE +/- 420.06, N = 3SE +/- 192.97, N = 3SE +/- 126.57, N = 3133931.53118225.5585400.8871673.851. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off0.29250.5850.87751.171.4625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.081.131.211.301. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallSMT OnSMT Off13K26K39K52K65KSE +/- 411.42, N = 5SE +/- 478.13, N = 5SE +/- 52.11, N = 5SE +/- 25.22, N = 562774.253798.651784.151741.01. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. More details on the CP2K benchmark test cases and details can be found @ https://www.cp2k.org/performance Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2023.1Input: H2O-DFT-LSSMT OnSMT Off110022003300440055002143.512363.584957.695012.261. (F9X) gfortran options: -fopenmp -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kdbm -lcp2kgrid -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -lhdf5 -lhdf5_hl -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.7SMT OnSMT Off40K80K120K160K200KSE +/- 101.44, N = 3SE +/- 1020.90, N = 3SE +/- 1109.71, N = 3171120.35149056.95128050.221. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpcre -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread

EPYC 9754 2P: SMT On: The test run did not produce a result.

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e12SMT OffSMT On0.43740.87481.31221.74962.187SE +/- 0.003, N = 12SE +/- 0.010, N = 14SE +/- 0.003, N = 11SE +/- 0.006, N = 111.1601.5081.7961.9441. (CXX) g++ options: -O3

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e13SMT OnSMT Off510152025SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.02, N = 3SE +/- 0.02, N = 311.1211.1521.1321.291. (CXX) g++ options: -O3

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeSMT OnSMT Off120M240M360M480M600MSE +/- 9265130.36, N = 15SE +/- 6859221.31, N = 12SE +/- 7021012.10, N = 12SE +/- 5762415.88, N = 155823869244470231433650343492727229401. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingSMT OnSMT Off200K400K600K800K1000KSE +/- 5539.49, N = 3SE +/- 4721.80, N = 3SE +/- 1345.49, N = 3SE +/- 4792.89, N = 39258207714627262715927411. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingSMT OnSMT Off300K600K900K1200K1500KSE +/- 5499.45, N = 3SE +/- 1897.81, N = 3SE +/- 1608.29, N = 3SE +/- 429.17, N = 313539579130917917875104701. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT OnSMT Off90K180K270K360K450KSE +/- 2298.36, N = 3SE +/- 1964.45, N = 3SE +/- 92.54, N = 3SE +/- 33.22, N = 34078603170462160641632201. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT OnSMT Off300K600K900K1200K1500KSE +/- 18163.51, N = 15SE +/- 15887.63, N = 4SE +/- 505.98, N = 3SE +/- 6933.79, N = 3152453313015008103756762181. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT OnSMT Off90K180K270K360K450KSE +/- 3726.56, N = 3SE +/- 1247.49, N = 3SE +/- 117.40, N = 3SE +/- 12.67, N = 34098503208852161151632631. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT OnSMT Off7M14M21M28M35MSE +/- 83819.91, N = 3SE +/- 140717.61, N = 3SE +/- 52818.35, N = 3SE +/- 35950.58, N = 3348793333022166720312667167516671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT OffSMT On306090120150SE +/- 0.70, N = 3SE +/- 0.91, N = 3SE +/- 0.18, N = 3SE +/- 0.29, N = 399.22107.72119.58125.36

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesSMT OffSMT On50100150200250SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.70, N = 3198.75199.37211.08213.91

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigSMT OffSMT On612182430SE +/- 0.12, N = 13SE +/- 0.14, N = 13SE +/- 0.21, N = 7SE +/- 0.23, N = 718.4720.3422.9826.23

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigSMT OffSMT On50100150200250SE +/- 0.55, N = 3SE +/- 0.51, N = 3SE +/- 0.48, N = 3SE +/- 1.35, N = 3118.10145.93177.76227.52

Blender

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-OnlySMT OnSMT Off48121620SE +/- 0.02, N = 6SE +/- 0.05, N = 5SE +/- 0.02, N = 4SE +/- 0.05, N = 47.128.4512.7715.15

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-OnlySMT OnSMT Off918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 316.2820.2431.1238.44

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-OnlySMT OnSMT Off510152025SE +/- 0.02, N = 5SE +/- 0.04, N = 4SE +/- 0.06, N = 3SE +/- 0.06, N = 39.8711.7216.4920.47

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-OnlySMT OnSMT Off306090120150SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 369.0385.54116.54147.22

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT OnSMT Off1122334455SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 321.7527.4739.2750.01

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileSMT OffSMT On20406080100SE +/- 0.30, N = 3SE +/- 1.01, N = 3SE +/- 0.17, N = 3SE +/- 0.14, N = 3100.24100.79102.59105.80

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: CrownSMT OnSMT Off50100150200250SE +/- 0.16, N = 9SE +/- 0.13, N = 7SE +/- 0.13, N = 7SE +/- 0.05, N = 6210.07146.21125.5885.29

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian DragonSMT OnSMT Off60120180240300SE +/- 0.48, N = 9SE +/- 0.26, N = 8SE +/- 0.09, N = 8SE +/- 0.09, N = 6255.99178.30157.65107.36

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.0712.1423.2134.2845.355SE +/- 0.01, N = 7SE +/- 0.03, N = 7SE +/- 0.00, N = 7SE +/- 0.02, N = 74.764.353.623.57

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.07552.1513.22654.3025.3775SE +/- 0.01, N = 7SE +/- 0.03, N = 15SE +/- 0.00, N = 7SE +/- 0.02, N = 154.784.313.623.57

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-OnlySMT OnSMT Off0.52881.05761.58642.11522.644SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.00, N = 4SE +/- 0.01, N = 42.352.091.741.72

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCSMT OnSMT Off400800120016002000SE +/- 6.06, N = 3SE +/- 1.33, N = 3SE +/- 1.86, N = 3SE +/- 0.33, N = 31720153013961107

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.20, N = 4SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 318.6114.4516.3413.27

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT OnSMT Off816243240SE +/- 1.35, N = 15SE +/- 0.60, N = 15SE +/- 0.30, N = 15SE +/- 0.03, N = 334.4525.1124.4720.98

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT OnSMT Off3691215SE +/- 0.09, N = 15SE +/- 0.08, N = 8SE +/- 0.11, N = 15SE +/- 0.11, N = 1212.188.889.866.97

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.03, N = 5SE +/- 0.04, N = 4SE +/- 0.03, N = 5SE +/- 0.35, N = 1520.8814.3719.0213.51

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeSMT OnSMT Off1122334455SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 349.1341.6630.8523.73

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeSMT OnSMT Off1122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 349.2341.5530.8223.72

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeSMT OnSMT Off1224364860SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 353.8344.7132.6825.72

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeSMT OnSMT Off1224364860SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 352.7344.2931.9225.51

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 2.85, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 348263186111271. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 1.45, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 1.53, N = 349564587311461. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off30060090012001500SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3582763103213581. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 7.80, N = 3SE +/- 14.52, N = 3SE +/- 5.78, N = 3SE +/- 18.26, N = 376981004613759180041. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 64.83, N = 3SE +/- 38.84, N = 3SE +/- 10.48, N = 3SE +/- 29.21, N = 3156192013327538359261. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 19.63, N = 3SE +/- 5.29, N = 3SE +/- 6.12, N = 3SE +/- 11.93, N = 378171023213963182691. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 41.40, N = 3SE +/- 96.56, N = 3SE +/- 21.53, N = 3SE +/- 53.62, N = 3157312054527885365391. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off5K10K15K20K25KSE +/- 27.17, N = 3SE +/- 8.65, N = 3SE +/- 11.89, N = 3SE +/- 15.65, N = 392531212716484216661. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off9K18K27K36K45KSE +/- 34.44, N = 3SE +/- 17.91, N = 3SE +/- 17.58, N = 3SE +/- 76.61, N = 3184552423832972433711. (CXX) g++ options: -O3 -lm -ldl

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT OnSMT Off4080120160200122.59123.30159.91164.57

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT OffSMT On102030405038.4944.3040.57

Scene: Disney Material

EPYC 9754 2P: SMT On: The test run did not produce a result.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT OnSMT Off60120180240300166.68167.90265.89

Scene: Material Tester

EPYC 9754 2P: SMT On: The test run did not produce a result.

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To CompileSMT OffSMT On4080120160200SE +/- 1.24, N = 3SE +/- 1.29, N = 3SE +/- 0.50, N = 3SE +/- 0.32, N = 3148.38152.59148.48161.65

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To CompileSMT OnSMT Off306090120150SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 393.2793.95113.70116.12

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off500M1000M1500M2000M2500MSE +/- 1877054.43, N = 3SE +/- 1068228.02, N = 3SE +/- 470224.53, N = 3SE +/- 592546.29, N = 325444000002542633333169676666713139333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 512 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off700M1400M2100M2800M3500MSE +/- 569600.25, N = 3SE +/- 3555434.03, N = 3SE +/- 1814754.35, N = 3SE +/- 2946183.97, N = 333301666672610933333178370000014147000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput TotalSMT OffSMT On8K16K24K32K40KSE +/- 211.99, N = 3SE +/- 831.80, N = 15SE +/- 54.33, N = 3SE +/- 50.60, N = 336573.817891.420430.98389.01. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256SMT OnSMT Off70000M140000M210000M280000M350000MSE +/- 207719324.78, N = 3SE +/- 224959781.68, N = 3SE +/- 161352968.33, N = 3SE +/- 15184699.42, N = 33279260385132222694288671636336255531114145572801. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512SMT OnSMT Off20000M40000M60000M80000M100000MSE +/- 22577791.79, N = 3SE +/- 660141733.06, N = 3SE +/- 4276543.39, N = 3SE +/- 19506083.54, N = 310604965520310223223359053005879330518043336371. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OffSMT On20K40K60K80K100KSE +/- 3.85, N = 3SE +/- 3.93, N = 3SE +/- 1.80, N = 3SE +/- 16.66, N = 3113251.4108490.556647.354195.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OnSMT Off800K1600K2400K3200K4000KSE +/- 163.02, N = 3SE +/- 1067.40, N = 3SE +/- 405.88, N = 3SE +/- 1031.37, N = 33782091.83598946.31890935.31799293.01. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20SMT OnSMT Off300000M600000M900000M1200000M1500000MSE +/- 46014499.85, N = 3SE +/- 107897909.37, N = 3SE +/- 23916253.09, N = 3SE +/- 70893114.79, N = 3131754995402711004533945706593468579875507003074331. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMSMT OnSMT Off500000M1000000M1500000M2000000M2500000MSE +/- 4494053056.14, N = 3SE +/- 4718005576.14, N = 3SE +/- 404585301.69, N = 3SE +/- 772883363.35, N = 323398907001072307524535457116967373555711528389280631. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMSMT OnSMT Off400000M800000M1200000M1600000M2000000MSE +/- 676471418.80, N = 3SE +/- 2904032106.06, N = 3SE +/- 2018584981.53, N = 3SE +/- 917614258.55, N = 32019317070327199823414886310125377662109970346218831. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305SMT OnSMT Off200000M400000M600000M800000M1000000MSE +/- 267574958.87, N = 3SE +/- 39882779.79, N = 3SE +/- 15599558.91, N = 3SE +/- 86138373.27, N = 39098173601107844895705234624158373203927826899871. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read OnlySMT OffSMT On200K400K600K800K1000KSE +/- 21185.07, N = 12SE +/- 45813.37, N = 9SE +/- 23693.25, N = 12SE +/- 4149.87, N = 38777228555698278787859681. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average LatencySMT OffSMT On0.22910.45820.68730.91641.1455SE +/- 0.020, N = 12SE +/- 0.040, N = 9SE +/- 0.025, N = 12SE +/- 0.006, N = 30.9170.9520.9741.0181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048SMT OffSMT On2004006008001000SE +/- 1.06, N = 3SE +/- 1.81, N = 3SE +/- 0.73, N = 3SE +/- 8.04, N = 37837805915801. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096SMT OffSMT On150300450600750SE +/- 3.99, N = 3SE +/- 8.08, N = 3SE +/- 1.48, N = 3SE +/- 5.45, N = 66956555795451. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

CPU Power Consumption Monitor

EPYC 9754 1PEPYC 9754 2POpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringSMT OffSMT On140280420560700Min: 10.61 / Avg: 238.75 / Max: 362.1Min: 10.52 / Avg: 248.93 / Max: 397.25Min: 97.83 / Avg: 446.01 / Max: 702.85Min: 21.61 / Avg: 460.57 / Max: 792.38

154 Results Shown

SPECFEM3D
HeFFTe - Highly Efficient FFT for Exascale
libxsmm
SPECFEM3D:
  Water-layered Halfspace
  Tomographic Model
  Homogeneous Halfspace
HeFFTe - Highly Efficient FFT for Exascale
toyBrot Fractal Generator:
  TBB
  OpenMP
ASTC Encoder:
  Fast
  Thorough
  Exhaustive
Xmrig:
  Monero - 1M
  Wownero - 1M
Graph500:
  26:
    bfs median_TEPS
    bfs max_TEPS
HeFFTe - Highly Efficient FFT for Exascale
libxsmm
Graph500:
    sssp median_TEPS
    sssp max_TEPS
miniBUDE:
  OpenMP - BM1:
    GFInst/s
    Billion Interactions/s
SPECFEM3D
miniBUDE:
  OpenMP - BM2:
    GFInst/s
    Billion Interactions/s
nekRS:
  Kershaw
  TurboPipe Periodic
TensorFlow:
  CPU - 256 - AlexNet
  CPU - 512 - AlexNet
  CPU - 256 - GoogLeNet
  CPU - 256 - ResNet-50
  CPU - 512 - GoogLeNet
  CPU - 512 - ResNet-50
CloverLeaf
Neural Magic DeepSparse
Helsing
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
HeFFTe - Highly Efficient FFT for Exascale
Neural Magic DeepSparse
NAS Parallel Benchmarks:
  BT.C
  CG.C
  EP.D
  FT.C
  IS.D
  LU.C
  MG.C
  SP.B
  SP.C
NAMD
OpenVINO:
  Face Detection FP16 - CPU:
    FPS
    ms
  Person Detection FP16 - CPU:
    FPS
    ms
  Person Detection FP32 - CPU:
    FPS
    ms
  Vehicle Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    FPS
    ms
miniFE
CP2K Molecular Dynamics
Aircrack-ng
Primesieve:
  1e12
  1e13
Stockfish
7-Zip Compression:
  Compression Rating
  Decompression Rating
John The Ripper:
  bcrypt
  WPA PSK
  Blowfish
  MD5
Timed LLVM Compilation:
  Ninja
  Unix Makefiles
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
  Barbershop - CPU-Only
  Pabellon Barcelona - CPU-Only
Timed Godot Game Engine Compilation
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
Intel Open Image Denoise:
  RT.hdr_alb_nrm.3840x2160 - CPU-Only
  RT.ldr_alb_nrm.3840x2160 - CPU-Only
  RTLightmap.hdr.4096x4096 - CPU-Only
OpenVKL
LuxCoreRender:
  DLSC - CPU
  Orange Juice - CPU
  LuxCore Benchmark - CPU
  Rainbow Colors and Prism - CPU
OSPRay:
  particle_volume/ao/real_time
  particle_volume/scivis/real_time
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
OSPRay Studio:
  1 - 4K - 1 - Path Tracer
  2 - 4K - 1 - Path Tracer
  3 - 4K - 1 - Path Tracer
  1 - 4K - 16 - Path Tracer
  1 - 4K - 32 - Path Tracer
  2 - 4K - 16 - Path Tracer
  2 - 4K - 32 - Path Tracer
  3 - 4K - 16 - Path Tracer
  3 - 4K - 32 - Path Tracer
Appleseed:
  Emily
  Disney Material
  Material Tester
Timed Gem5 Compilation
Timed Node.js Compilation
Liquid-DSP:
  256 - 256 - 512
  512 - 256 - 512
srsRAN Project
OpenSSL:
  SHA256
  SHA512
  RSA4096
  RSA4096
  ChaCha20
  AES-128-GCM
  AES-256-GCM
  ChaCha20-Poly1305
PostgreSQL:
  1000 - 800 - Read Only
  1000 - 800 - Read Only - Average Latency
MariaDB:
  2048
  4096
CPU Power Consumption Monitor