AMD EPYC 9754 Bergamo SMT On/Off Comparison

Benchmarks by Michael Larabel for a future article (post 19th) looking at SMT on/off comparison toggled via BIOS. SMT comparison testing of AMD EPYC 9754 128-Core CPUs on Titanite with Ubuntu 22.04 LTS.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2307190-NE-BERGAMOSM27
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 9754 1P: SMT On
July 14 2023
  16 Hours, 51 Minutes
EPYC 9754 1P: SMT Off
July 13 2023
  15 Hours, 43 Minutes
EPYC 9754 2P: SMT On
July 11 2023
  14 Hours, 12 Minutes
EPYC 9754 2P: SMT Off
July 12 2023
  13 Hours, 40 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 6 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionEPYC 9754 2PEPYC 9754 1P SMT On SMT Off SMT Off SMT On2 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores / 512 Threads)AMD Titanite_4G (RTI1007B BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 22.045.19.0-41-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.41.3.224GCC 11.3.0ext41024x7682 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores)AMD EPYC 9754 128-Core @ 2.25GHz (128 Cores)768GBAMD EPYC 9754 128-Core @ 2.25GHz (128 Cores / 256 Threads)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa0010bPython Details- Python 3.10.6Security Details- EPYC 9754 2P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 2P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 1P: SMT Off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9754 1P: SMT On: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

cp2k: H2O-DFT-LSlibxsmm: 128pgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 800 - Read Onlytensorflow: CPU - 256 - ResNet-50tensorflow: CPU - 512 - GoogLeNetopenvkl: vklBenchmark ISPCtensorflow: CPU - 512 - ResNet-50libxsmm: 256stockfish: Total Timeopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Orange Juice - CPUdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streammysqlslap: 4096build-llvm: Unix Makefilesnekrs: TurboPipe Periodicopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenssl: SHA256openssl: SHA512openssl: AES-256-GCMopenssl: ChaCha20openssl: AES-128-GCMopenssl: ChaCha20-Poly1305build-linux-kernel: allmodconfigospray: particle_volume/scivis/real_timeospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracermysqlslap: 2048build-gem5: Time To Compilegraph500: 26graph500: 26graph500: 26graph500: 26openvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUheffte: c2c - FFTW - double - 512openvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUospray-studio: 3 - 4K - 32 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracersrsran: PUSCH Processor Benchmark, Throughput Totalopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 1 - 4K - 1 - Path Tracerospray: particle_volume/ao/real_timeospray-studio: 3 - 4K - 16 - Path Tracerdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streambuild-llvm: Ninjaospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerblender: Barbershop - CPU-Onlybuild-nodejs: Time To Compilebuild-godot: Time To Compileappleseed: Material Testerappleseed: Emilydeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamheffte: r2c - FFTW - double - 512deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamtensorflow: CPU - 256 - GoogLeNetbuild-linux-kernel: defconfigtensorflow: CPU - 512 - AlexNettensorflow: CPU - 256 - AlexNetxmrig: Monero - 1Mopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUluxcorerender: DLSC - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUjohn-the-ripper: WPA PSKospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUjohn-the-ripper: MD5openssl: RSA4096openssl: RSA4096deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamhelsing: 14 digitdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamspecfem3d: Layered Halfspacecompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingspecfem3d: Homogeneous Halfspaceblender: Pabellon Barcelona - CPU-Onlynpb: EP.Dliquid-dsp: 512 - 256 - 512specfem3d: Water-layered Halfspaceliquid-dsp: 256 - 256 - 512xmrig: Wownero - 1Mnamd: ATPase Simulation - 327,506 Atomsjohn-the-ripper: bcryptjohn-the-ripper: Blowfishspecfem3d: Tomographic Modelminibude: OpenMP - BM2minibude: OpenMP - BM2luxcorerender: Rainbow Colors and Prism - CPUblender: Classroom - CPU-Onlynekrs: Kershawnpb: BT.Caircrack-ng: npb: IS.Doidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyappleseed: Disney Materialnpb: LU.Ccloverleaf: Lagrangian-Eulerian Hydrodynamicsspecfem3d: Mount St. Helensprimesieve: 1e13heffte: c2c - FFTW - float - 512blender: Fishy Cat - CPU-Onlyblender: BMW27 - CPU-Onlyastcenc: Exhaustiveminife: Smalltoybrot: TBBoidn: RTLightmap.hdr.4096x4096 - CPU-Onlynpb: SP.Castcenc: Fastastcenc: Thoroughtoybrot: OpenMPheffte: r2c - FFTW - float - 512npb: CG.Coidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyembree: Pathtracer ISPC - Crownnpb: FT.Cembree: Pathtracer ISPC - Asian Dragonnpb: SP.Bminibude: OpenMP - BM1minibude: OpenMP - BM1npb: MG.Cprimesieve: 1e12EPYC 9754 2PEPYC 9754 1P SMT On SMT Off SMT Off SMT On2143.5134976.70.974827878124.98538.521720172.366112.658238692411.035810.419.8634.45229.1624556.9411545199.36555.121159.99327926038513106049655203201931707032713175499540272339890700107909817360110145.92949.231178177698580152.586960471000672541000172403000015711000001559.9540.67109.6451552.1540.90184551573117891.46.469889.614.8313214.201561948249.13289253521.3108242.9589107.72158249569.0393.271100.790164.56716448.62062627.3142207.197159.8861797.6248329.2220.3441770.911225.4186533.7270.93235.6518.61526.98121.03152453352.731853.829511.0022954.440.62168372.891.08133931.535.6411299.32348793333782091.8108490.5902.8593139.6590902.7792139.8821211.4848602.260227.280107.18351190.716868.36381868.33109.99744757013539579258204.72783014821.7523705.4033301666679.7783814372544400000142082.70.106464078604098503.782741798315.5247888.08519.0216.28491231.839849.014.78591505.1721.653.90667509511.120221.7659.877.1215.930562774.220142.35224243.28610.1137134.88533321433.60167554.744.76210.0714211432.89255.9864236490.76253.1236328.069249109.091.5082363.5794505.51.018785968146.74634.131530189.406373.044702314310.266242.686.9725.11212.6821590.8789579198.75054.421174.99222269428867102232233590199823414886311004533945702307524535457784489570523118.09941.55311023210046591148.3761075900000770180000207888000018151600001545.2341.05112.4131579.1440.20242382054536573.86.479878.884.8613141.512013363141.656712127426.7362290.756299.21576364585.5493.952100.240265.885808159.90661445.87652730.4980210.933118.12851058.6776452.9918.4741908.761581.6685946.0271.24235.3714.45526.46121.08130150044.288944.710411.1022955.640.67142135.581.13118225.555.6111373.95302216673598946.3113251.4676.8128182.4453676.4342182.5974162.0191770.139250.05481.19941541.951251.81032409.80357.4707729309130917714623.45183016927.4725983.7426109333336.2180692572542633333100754.30.139693170463208852.709205428439.59710989.93813.5120.24536518.74128050.2198635.304.3140.570691658754.2115.152.63012259911.152223.58411.728.4514.409953798.629762.09231041.57693.3961127.20883671430.26566822.974.35146.2051224178.10178.3046246272.79439.06710976.682268721.051.1604957.6862696.50.917877722121.47429.161107123.883813.427272294011.492784.678.8820.98259.2779246.3732695213.907258608333358.71545.13111414557280518043336379970346218835507003074331152838928063392782689987177.75823.72101826918004783148.4844935350003637500009283200008936240001105.9628.7435.40491102.8228.83433713653920430.96.245118.294.746744.6635926112723.733321666468.7809134.5717119.57613581146147.22116.122102.588167.900597123.29839635.55491780.442167.9907125.4467504.3578525.3922.9841526.811375.3351218.9268.82119.0113.27513.2362.1867621825.512125.717810.9111710.590.70113162.291.3071673.855.475837.62167516671799293.056647.3644.976896.9697645.001997.0913156.0563404.233857.95677.8148812.102349.51111275.770715.0281219295104705927416.09275029950.0114274.53141470000013.419454501131393333363182.20.205951632201632634.937265506236.1285903.20514.3738.445734266667298801.44149056.9535315.153.5738.492263289518.149.275.03003158021.132128.59720.4715.157.314751741.055901.72133415.421278.646668.30726242248.55748672.243.5785.2893147448.72107.3608161475.24234.6845867.108136942.131.7965012.262713.40.952855569118.45416.031396122.773331.736503434943.731464.2912.1824.47267.7088240.8667655211.0792538406923110.00582.081636336255535300587933010125377662106593468579871169673735557462415837320227.51730.81651396313759780161.6484459120003334450008802490008578900002377.7926.6134.84512378.0726.5732972278858389.010.406148.8912.075311.482753886130.850716484499.5866126.9310125.3641032873116.54113.704105.797166.676131122.58770946.43181376.214266.3418152.9895417.1186504.0926.2251628.801422.0824409.4541.29117.8816.341049.3160.7781037531.924532.675010.8411794.320.83120515.111.2185400.8810.536067.78203126671890935.354195.1859.865373.3807859.87773.2077201.6258316.046650.473102.2265624.478965.9747968.626715.9649316887917877262719.40434850039.2713264.79178370000017.194492779169676666774803.60.207022160642161157.480674706238.9055972.64320.8831.125808636667292243.61171120.3545300.293.6244.30492279662.5512.006.21637984921.286128.12416.4912.778.174351784.135911.74131909.911190.754975.08314081245.54245686.883.62125.5813140791.52157.6504149355.54237.7635944.062128129.561.944OpenBenchmarking.org

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. More details on the CP2K benchmark test cases and details can be found @ https://www.cp2k.org/performance Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2023.1Input: H2O-DFT-LSSMT OnSMT Off110022003300440055002143.512363.584957.695012.261. (F9X) gfortran options: -fopenmp -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kdbm -lcp2kgrid -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -lhdf5 -lhdf5_hl -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128SMT OnSMT Off11002200330044005500SE +/- 62.14, N = 4SE +/- 114.55, N = 9SE +/- 19.26, N = 3SE +/- 0.99, N = 34976.74505.52696.52713.41. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average LatencySMT OnSMT Off0.22910.45820.68730.91641.1455SE +/- 0.025, N = 12SE +/- 0.006, N = 3SE +/- 0.020, N = 12SE +/- 0.040, N = 90.9741.0180.9170.9521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read OnlySMT OnSMT Off200K400K600K800K1000KSE +/- 23693.25, N = 12SE +/- 4149.87, N = 3SE +/- 21185.07, N = 12SE +/- 45813.37, N = 98278787859688777228555691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50SMT OnSMT Off306090120150SE +/- 1.70, N = 3SE +/- 0.68, N = 3SE +/- 1.45, N = 12SE +/- 1.07, N = 12124.98146.74121.47118.45

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: GoogLeNetSMT OnSMT Off140280420560700SE +/- 4.51, N = 15SE +/- 6.73, N = 3SE +/- 5.79, N = 12SE +/- 4.74, N = 12538.52634.13429.16416.03

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCSMT OnSMT Off400800120016002000SE +/- 6.06, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 1.86, N = 31720153011071396

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50SMT OnSMT Off4080120160200SE +/- 0.59, N = 3SE +/- 0.13, N = 3SE +/- 1.35, N = 3SE +/- 0.90, N = 3172.36189.40123.88122.77

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256SMT OnSMT Off14002800420056007000SE +/- 1.43, N = 3SE +/- 66.66, N = 9SE +/- 16.75, N = 3SE +/- 2.32, N = 36112.66373.03813.43331.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeSMT OnSMT Off120M240M360M480M600MSE +/- 9265130.36, N = 15SE +/- 6859221.31, N = 12SE +/- 5762415.88, N = 15SE +/- 7021012.10, N = 125823869244470231432727229403650343491. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OnSMT Off1020304050SE +/- 0.15, N = 15SE +/- 0.12, N = 13SE +/- 0.10, N = 13SE +/- 0.55, N = 1411.0310.2611.4943.731. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSMT OnSMT Off13002600390052006500SE +/- 93.68, N = 15SE +/- 87.31, N = 13SE +/- 25.69, N = 13SE +/- 21.50, N = 145810.416242.682784.671464.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT OnSMT Off3691215SE +/- 0.11, N = 15SE +/- 0.11, N = 12SE +/- 0.08, N = 8SE +/- 0.09, N = 159.866.978.8812.18

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT OnSMT Off816243240SE +/- 1.35, N = 15SE +/- 0.60, N = 15SE +/- 0.03, N = 3SE +/- 0.30, N = 1534.4525.1120.9824.47

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off60120180240300SE +/- 2.29, N = 15SE +/- 3.96, N = 15SE +/- 6.30, N = 15SE +/- 6.66, N = 15229.16212.68259.28267.71

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off130260390520650SE +/- 6.10, N = 15SE +/- 11.89, N = 15SE +/- 8.20, N = 15SE +/- 7.96, N = 15556.94590.88246.37240.87

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096SMT OnSMT Off150300450600750SE +/- 5.45, N = 6SE +/- 1.48, N = 3SE +/- 3.99, N = 3SE +/- 8.08, N = 35455796956551. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesSMT OnSMT Off50100150200250SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.70, N = 3SE +/- 0.15, N = 3199.37198.75213.91211.08

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe PeriodicSMT OffSMT On600M1200M1800M2400M3000MSE +/- 87729075.59, N = 15SE +/- 62315945.32, N = 13258608333325384069231. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: TurboPipe Periodic

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OnSMT Off20406080100SE +/- 0.35, N = 3SE +/- 0.21, N = 3SE +/- 0.45, N = 15SE +/- 1.12, N = 1555.1254.4258.71110.001. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSMT OnSMT Off30060090012001500SE +/- 7.44, N = 3SE +/- 4.51, N = 3SE +/- 4.46, N = 15SE +/- 6.51, N = 151159.991174.99545.13582.081. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256SMT OnSMT Off70000M140000M210000M280000M350000MSE +/- 207719324.78, N = 3SE +/- 224959781.68, N = 3SE +/- 15184699.42, N = 3SE +/- 161352968.33, N = 33279260385132222694288671114145572801636336255531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512SMT OnSMT Off20000M40000M60000M80000M100000MSE +/- 22577791.79, N = 3SE +/- 660141733.06, N = 3SE +/- 19506083.54, N = 3SE +/- 4276543.39, N = 310604965520310223223359051804333637530058793301. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMSMT OnSMT Off400000M800000M1200000M1600000M2000000MSE +/- 676471418.80, N = 3SE +/- 2904032106.06, N = 3SE +/- 917614258.55, N = 3SE +/- 2018584981.53, N = 32019317070327199823414886399703462188310125377662101. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20SMT OnSMT Off300000M600000M900000M1200000M1500000MSE +/- 46014499.85, N = 3SE +/- 107897909.37, N = 3SE +/- 70893114.79, N = 3SE +/- 23916253.09, N = 3131754995402711004533945705507003074336593468579871. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMSMT OnSMT Off500000M1000000M1500000M2000000M2500000MSE +/- 4494053056.14, N = 3SE +/- 4718005576.14, N = 3SE +/- 772883363.35, N = 3SE +/- 404585301.69, N = 323398907001072307524535457115283892806311696737355571. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305SMT OnSMT Off200000M400000M600000M800000M1000000MSE +/- 267574958.87, N = 3SE +/- 39882779.79, N = 3SE +/- 86138373.27, N = 3SE +/- 15599558.91, N = 39098173601107844895705233927826899874624158373201. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigSMT OnSMT Off50100150200250SE +/- 0.51, N = 3SE +/- 0.55, N = 3SE +/- 0.48, N = 3SE +/- 1.35, N = 3145.93118.10177.76227.52

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeSMT OnSMT Off1122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 349.2341.5523.7230.82

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 19.63, N = 3SE +/- 5.29, N = 3SE +/- 11.93, N = 3SE +/- 6.12, N = 378171023218269139631. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off4K8K12K16K20KSE +/- 7.80, N = 3SE +/- 14.52, N = 3SE +/- 18.26, N = 3SE +/- 5.78, N = 376981004618004137591. (CXX) g++ options: -O3 -lm -ldl

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048SMT OnSMT Off2004006008001000SE +/- 8.04, N = 3SE +/- 0.73, N = 3SE +/- 1.06, N = 3SE +/- 1.81, N = 35805917837801. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To CompileSMT OnSMT Off4080120160200SE +/- 1.29, N = 3SE +/- 1.24, N = 3SE +/- 0.50, N = 3SE +/- 0.32, N = 3152.59148.38148.48161.65

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off200M400M600M800M1000M96047100010759000004935350004459120001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off160M320M480M640M800M6725410007701800003637500003334450001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off400M800M1200M1600M2000M172403000020788800009283200008802490001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26SMT OnSMT Off400M800M1200M1600M2000M157110000018151600008936240008578900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OnSMT Off5001000150020002500SE +/- 4.07, N = 3SE +/- 10.43, N = 3SE +/- 8.36, N = 9SE +/- 16.41, N = 121559.951545.231105.962377.791. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSMT OnSMT Off918273645SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.23, N = 9SE +/- 0.20, N = 1240.6741.0528.7426.611. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off306090120150SE +/- 0.77, N = 3SE +/- 1.48, N = 3SE +/- 2.10, N = 15SE +/- 2.09, N = 15109.65112.4135.4034.851. (CXX) g++ options: -O3

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OnSMT Off5001000150020002500SE +/- 8.82, N = 3SE +/- 4.61, N = 3SE +/- 10.77, N = 5SE +/- 12.47, N = 151552.151579.141102.822378.071. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSMT OnSMT Off918273645SE +/- 0.22, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 5SE +/- 0.15, N = 1540.9040.2028.8326.571. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off9K18K27K36K45KSE +/- 34.44, N = 3SE +/- 17.91, N = 3SE +/- 76.61, N = 3SE +/- 17.58, N = 3184552423843371329721. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 41.40, N = 3SE +/- 96.56, N = 3SE +/- 53.62, N = 3SE +/- 21.53, N = 3157312054536539278851. (CXX) g++ options: -O3 -lm -ldl

srsRAN Project

srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput TotalSMT OnSMT Off8K16K24K32K40KSE +/- 831.80, N = 15SE +/- 211.99, N = 3SE +/- 54.33, N = 3SE +/- 50.60, N = 317891.436573.820430.98389.01. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OnSMT Off3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 156.466.476.2410.401. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT OnSMT Off2K4K6K8K10KSE +/- 11.31, N = 3SE +/- 10.15, N = 3SE +/- 5.00, N = 3SE +/- 48.86, N = 159889.619878.885118.296148.891. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.20, N = 154.834.864.7412.071. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSMT OnSMT Off3K6K9K12K15KSE +/- 7.09, N = 3SE +/- 2.74, N = 3SE +/- 3.11, N = 3SE +/- 103.73, N = 1513214.2013141.516744.665311.481. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerSMT OnSMT Off8K16K24K32K40KSE +/- 64.83, N = 3SE +/- 38.84, N = 3SE +/- 29.21, N = 3SE +/- 10.48, N = 3156192013335926275381. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 2.85, N = 3SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 348263111278611. (CXX) g++ options: -O3 -lm -ldl

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeSMT OnSMT Off1122334455SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 349.1341.6623.7330.85

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerSMT OnSMT Off5K10K15K20K25KSE +/- 27.17, N = 3SE +/- 8.65, N = 3SE +/- 15.65, N = 3SE +/- 11.89, N = 392531212721666164841. (CXX) g++ options: -O3 -lm -ldl

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OnSMT Off110220330440550SE +/- 0.26, N = 3SE +/- 3.79, N = 3SE +/- 8.17, N = 15SE +/- 0.68, N = 3521.31426.74468.78499.59

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSMT OnSMT Off60120180240300SE +/- 0.12, N = 3SE +/- 2.51, N = 3SE +/- 2.61, N = 15SE +/- 0.15, N = 3242.96290.76134.57126.93

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT OnSMT Off306090120150SE +/- 0.91, N = 3SE +/- 0.70, N = 3SE +/- 0.18, N = 3SE +/- 0.29, N = 3107.7299.22119.58125.36

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off30060090012001500SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3582763135810321. (CXX) g++ options: -O3 -lm -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerSMT OnSMT Off2004006008001000SE +/- 1.45, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 349564511468731. (CXX) g++ options: -O3 -lm -ldl

Blender

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-OnlySMT OnSMT Off306090120150SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 369.0385.54147.22116.54

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To CompileSMT OnSMT Off306090120150SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 393.2793.95116.12113.70

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileSMT OnSMT Off20406080100SE +/- 1.01, N = 3SE +/- 0.30, N = 3SE +/- 0.17, N = 3SE +/- 0.14, N = 3100.79100.24102.59105.80

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT OffSMT On60120180240300265.89167.90166.68

Scene: Material Tester

EPYC 9754 2P: SMT On: The test run did not produce a result.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT OnSMT Off4080120160200164.57159.91123.30122.59

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OnSMT Off1122334455SE +/- 0.13, N = 3SE +/- 0.57, N = 15SE +/- 0.10, N = 3SE +/- 0.04, N = 348.6245.8835.5546.43

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSMT OnSMT Off6001200180024003000SE +/- 6.95, N = 3SE +/- 33.47, N = 15SE +/- 3.84, N = 3SE +/- 1.21, N = 32627.312730.501780.441376.21

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512SMT OnSMT Off50100150200250SE +/- 0.28, N = 5SE +/- 1.62, N = 5SE +/- 3.41, N = 15SE +/- 3.14, N = 15207.20210.9367.9966.341. (CXX) g++ options: -O3

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 1.18, N = 15SE +/- 0.16, N = 3159.89118.13125.45152.99

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.32, N = 3SE +/- 0.90, N = 3SE +/- 5.07, N = 15SE +/- 0.46, N = 3797.621058.68504.36417.12

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: GoogLeNetSMT OnSMT Off110220330440550SE +/- 1.69, N = 3SE +/- 5.52, N = 4SE +/- 0.35, N = 3SE +/- 3.39, N = 3329.22452.99525.39504.09

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigSMT OnSMT Off612182430SE +/- 0.14, N = 13SE +/- 0.12, N = 13SE +/- 0.21, N = 7SE +/- 0.23, N = 720.3418.4722.9826.23

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: AlexNetSMT OnSMT Off400800120016002000SE +/- 15.22, N = 15SE +/- 18.22, N = 3SE +/- 3.13, N = 3SE +/- 1.79, N = 31770.911908.761526.811628.80

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: AlexNetSMT OnSMT Off30060090012001500SE +/- 14.23, N = 15SE +/- 11.49, N = 15SE +/- 8.29, N = 3SE +/- 3.81, N = 31225.411581.661375.331422.08

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MSMT OnSMT Off20K40K60K80K100KSE +/- 871.70, N = 4SE +/- 83.35, N = 4SE +/- 513.99, N = 3SE +/- 587.76, N = 1586533.785946.051218.924409.41. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OnSMT Off120240360480600SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3270.93271.24268.82541.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSMT OnSMT Off50100150200250SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3235.65235.37119.01117.881. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.20, N = 4SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 318.6114.4513.2716.34

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OnSMT Off2004006008001000SE +/- 0.18, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3526.98526.46513.231049.311. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSMT OnSMT Off306090120150SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3121.03121.0862.1860.771. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT OnSMT Off300K600K900K1200K1500KSE +/- 18163.51, N = 15SE +/- 15887.63, N = 4SE +/- 6933.79, N = 3SE +/- 505.98, N = 3152453313015006762188103751. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeSMT OnSMT Off1224364860SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 352.7344.2925.5131.92

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeSMT OnSMT Off1224364860SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 353.8344.7125.7232.68

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 311.0011.1010.9110.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSMT OnSMT Off5K10K15K20K25KSE +/- 16.03, N = 3SE +/- 15.58, N = 3SE +/- 13.11, N = 3SE +/- 2.13, N = 322954.4422955.6411710.5911794.321. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off0.18680.37360.56040.74720.934SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.670.700.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSMT OnSMT Off40K80K120K160K200KSE +/- 676.62, N = 3SE +/- 158.47, N = 3SE +/- 202.90, N = 3SE +/- 390.62, N = 3168372.89142135.58113162.29120515.111. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off0.29250.5850.87751.171.4625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.081.131.301.211. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSMT OnSMT Off30K60K90K120K150KSE +/- 602.14, N = 3SE +/- 420.06, N = 3SE +/- 126.57, N = 3SE +/- 192.97, N = 3133931.53118225.5571673.8585400.881. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OnSMT Off3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.645.615.4710.531. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSMT OnSMT Off2K4K6K8K10KSE +/- 9.34, N = 3SE +/- 2.58, N = 3SE +/- 11.25, N = 3SE +/- 0.58, N = 311299.3211373.955837.626067.781. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT OnSMT Off7M14M21M28M35MSE +/- 83819.91, N = 3SE +/- 140717.61, N = 3SE +/- 35950.58, N = 3SE +/- 52818.35, N = 3348793333022166716751667203126671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OnSMT Off800K1600K2400K3200K4000KSE +/- 163.02, N = 3SE +/- 1067.40, N = 3SE +/- 1031.37, N = 3SE +/- 405.88, N = 33782091.83598946.31799293.01890935.31. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096SMT OnSMT Off20K40K60K80K100KSE +/- 3.93, N = 3SE +/- 3.85, N = 3SE +/- 1.80, N = 3SE +/- 16.66, N = 3108490.5113251.456647.354195.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3902.86676.81644.98859.87

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.25, N = 3139.66182.4596.9773.38

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off2004006008001000SE +/- 0.45, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 3SE +/- 0.49, N = 3902.78676.43645.00859.88

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off4080120160200SE +/- 0.08, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.25, N = 3139.88182.6097.0973.21

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off50100150200250SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.81, N = 3SE +/- 0.54, N = 3211.48162.02156.06201.63

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSMT OnSMT Off170340510680850SE +/- 0.72, N = 3SE +/- 0.92, N = 3SE +/- 2.15, N = 3SE +/- 1.02, N = 3602.26770.14404.23316.05

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitSMT OnSMT Off1326395265SE +/- 0.37, N = 3SE +/- 0.31, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 327.2850.0557.9650.471. (CC) gcc options: -O2 -pthread

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OnSMT Off20406080100SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.37, N = 3SE +/- 0.04, N = 3107.1881.2077.81102.23

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSMT OnSMT Off30060090012001500SE +/- 1.02, N = 3SE +/- 1.38, N = 3SE +/- 3.93, N = 3SE +/- 0.29, N = 31190.721541.95812.10624.48

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OnSMT Off1530456075SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 368.3651.8149.5165.97

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSMT OnSMT Off5001000150020002500SE +/- 1.60, N = 3SE +/- 1.63, N = 3SE +/- 5.02, N = 3SE +/- 0.74, N = 31868.332409.801275.77968.63

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered HalfspaceSMT OnSMT Off48121620SE +/- 0.066481426, N = 15SE +/- 0.056354216, N = 4SE +/- 0.159428812, N = 3SE +/- 0.034780206, N = 39.9974475707.47077293015.02812192915.9649316881. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingSMT OnSMT Off300K600K900K1200K1500KSE +/- 5499.45, N = 3SE +/- 1897.81, N = 3SE +/- 429.17, N = 3SE +/- 1608.29, N = 313539579130915104707917871. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingSMT OnSMT Off200K400K600K800K1000KSE +/- 5539.49, N = 3SE +/- 4721.80, N = 3SE +/- 4792.89, N = 3SE +/- 1345.49, N = 39258207714625927417262711. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous HalfspaceSMT OnSMT Off3691215SE +/- 0.120152278, N = 15SE +/- 0.065109589, N = 12SE +/- 0.044108994, N = 15SE +/- 0.048294894, N = 44.7278301483.4518301696.0927502999.4043485001. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT OnSMT Off1122334455SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 321.7527.4750.0139.27

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT OnSMT Off6K12K18K24K30KSE +/- 413.32, N = 15SE +/- 940.02, N = 12SE +/- 54.02, N = 5SE +/- 214.86, N = 1523705.4025983.7414274.5313264.791. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 512 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off700M1400M2100M2800M3500MSE +/- 569600.25, N = 3SE +/- 3555434.03, N = 3SE +/- 2946183.97, N = 3SE +/- 1814754.35, N = 333301666672610933333141470000017837000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered HalfspaceSMT OnSMT Off48121620SE +/- 0.063515594, N = 4SE +/- 0.062799204, N = 15SE +/- 0.021567327, N = 3SE +/- 0.142353904, N = 39.7783814376.21806925713.41945450117.1944927791. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512SMT OnSMT Off500M1000M1500M2000M2500MSE +/- 1877054.43, N = 3SE +/- 1068228.02, N = 3SE +/- 592546.29, N = 3SE +/- 470224.53, N = 325444000002542633333131393333316967666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1MSMT OnSMT Off30K60K90K120K150KSE +/- 677.82, N = 5SE +/- 741.13, N = 4SE +/- 13.77, N = 4SE +/- 513.11, N = 15142082.7100754.363182.274803.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsSMT OnSMT Off0.04660.09320.13980.18640.233SE +/- 0.00040, N = 3SE +/- 0.00135, N = 5SE +/- 0.00018, N = 4SE +/- 0.00095, N = 40.106460.139690.205950.20702

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT OnSMT Off90K180K270K360K450KSE +/- 2298.36, N = 3SE +/- 1964.45, N = 3SE +/- 33.22, N = 3SE +/- 92.54, N = 34078603170461632202160641. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT OnSMT Off90K180K270K360K450KSE +/- 3726.56, N = 3SE +/- 1247.49, N = 3SE +/- 12.67, N = 3SE +/- 117.40, N = 34098503208851632632161151. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic ModelSMT OnSMT Off246810SE +/- 0.086611165, N = 15SE +/- 0.014728964, N = 6SE +/- 0.048140615, N = 15SE +/- 0.080164897, N = 53.7827417982.7092054284.9372655067.4806747061. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OnSMT Off100200300400500SE +/- 0.43, N = 4SE +/- 6.02, N = 12SE +/- 0.25, N = 3SE +/- 0.01, N = 3315.52439.60236.13238.911. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2SMT OnSMT Off2K4K6K8K10KSE +/- 10.75, N = 4SE +/- 150.60, N = 12SE +/- 6.13, N = 3SE +/- 0.21, N = 37888.0910989.945903.215972.641. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT OnSMT Off510152025SE +/- 0.03, N = 5SE +/- 0.35, N = 15SE +/- 0.04, N = 4SE +/- 0.03, N = 519.0213.5114.3720.88

Blender

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-OnlySMT OnSMT Off918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 316.2820.2438.4431.12

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: KershawSMT OffSMT On1200M2400M3600M4800M6000MSE +/- 37190783.06, N = 3SE +/- 8226739.60, N = 3573426666758086366671. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: Kershaw

EPYC 9754 2P: SMT On: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

EPYC 9754 2P: SMT Off: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-QuantaGrid-D54Q-2U exited on signal 11 (Segmentation fault).

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT OnSMT Off110K220K330K440K550KSE +/- 4903.75, N = 15SE +/- 3792.90, N = 12SE +/- 269.21, N = 5SE +/- 396.74, N = 5491231.83536518.74298801.44292243.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.7SMT OffSMT On40K80K120K160K200KSE +/- 1109.71, N = 3SE +/- 1020.90, N = 3SE +/- 101.44, N = 3128050.22149056.95171120.351. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpcre -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread

EPYC 9754 2P: SMT On: The test run did not produce a result.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT OnSMT Off2K4K6K8K10KSE +/- 104.60, N = 15SE +/- 105.10, N = 15SE +/- 29.88, N = 6SE +/- 27.01, N = 69849.018635.305315.155300.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.07552.1513.22654.3025.3775SE +/- 0.01, N = 7SE +/- 0.03, N = 15SE +/- 0.02, N = 15SE +/- 0.00, N = 74.784.313.573.62

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT OffSMT On102030405040.5738.4944.30

Scene: Disney Material

EPYC 9754 2P: SMT On: The test run did not produce a result.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT OnSMT Off140K280K420K560K700KSE +/- 7199.59, N = 15SE +/- 4916.03, N = 15SE +/- 1485.96, N = 6SE +/- 2132.06, N = 6591505.17658754.21289518.14279662.551. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT OnSMT Off510152025SE +/- 0.27, N = 4SE +/- 0.04, N = 4SE +/- 0.09, N = 5SE +/- 0.11, N = 421.6515.159.2712.001. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. HelensSMT OnSMT Off246810SE +/- 0.014293172, N = 5SE +/- 0.013893024, N = 5SE +/- 0.069356320, N = 12SE +/- 0.034771470, N = 53.9066750952.6301225995.0300315806.2163798491. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e13SMT OnSMT Off510152025SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.02, N = 3SE +/- 0.02, N = 311.1211.1521.1321.291. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OnSMT Off50100150200250SE +/- 1.37, N = 5SE +/- 1.28, N = 5SE +/- 0.01, N = 4SE +/- 0.15, N = 4221.77223.58128.60128.121. (CXX) g++ options: -O3

Blender

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-OnlySMT OnSMT Off510152025SE +/- 0.02, N = 5SE +/- 0.04, N = 4SE +/- 0.06, N = 3SE +/- 0.06, N = 39.8711.7220.4716.49

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-OnlySMT OnSMT Off48121620SE +/- 0.02, N = 6SE +/- 0.05, N = 5SE +/- 0.05, N = 4SE +/- 0.02, N = 47.128.4515.1512.77

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveSMT OnSMT Off48121620SE +/- 0.0081, N = 6SE +/- 0.0257, N = 6SE +/- 0.0028, N = 5SE +/- 0.0006, N = 515.930514.40997.31478.17431. (CXX) g++ options: -O3 -flto -pthread

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallSMT OnSMT Off13K26K39K52K65KSE +/- 411.42, N = 5SE +/- 478.13, N = 5SE +/- 25.22, N = 5SE +/- 52.11, N = 562774.253798.651741.051784.11. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBSMT OnSMT Off12002400360048006000SE +/- 23.04, N = 15SE +/- 31.80, N = 15SE +/- 43.30, N = 15SE +/- 21.75, N = 920142976559035911. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-OnlySMT OnSMT Off0.52881.05761.58642.11522.644SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 4SE +/- 0.00, N = 42.352.091.721.74

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT OnSMT Off50K100K150K200K250KSE +/- 1293.24, N = 6SE +/- 1880.70, N = 6SE +/- 228.86, N = 4SE +/- 290.95, N = 4224243.28231041.57133415.42131909.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: FastSMT OnSMT Off30060090012001500SE +/- 1.29, N = 5SE +/- 1.25, N = 5SE +/- 1.20, N = 7SE +/- 1.79, N = 6610.11693.401278.651190.751. (CXX) g++ options: -O3 -flto -pthread

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughSMT OnSMT Off306090120150SE +/- 0.22, N = 6SE +/- 0.05, N = 6SE +/- 0.01, N = 6SE +/- 0.02, N = 6134.89127.2168.3175.081. (CXX) g++ options: -O3 -flto -pthread

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPSMT OnSMT Off13002600390052006500SE +/- 23.29, N = 12SE +/- 53.03, N = 15SE +/- 0.20, N = 7SE +/- 17.08, N = 833213671624240811. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512SMT OnSMT Off90180270360450SE +/- 0.86, N = 7SE +/- 0.77, N = 6SE +/- 0.09, N = 5SE +/- 0.52, N = 5433.60430.27248.56245.541. (CXX) g++ options: -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT OnSMT Off14K28K42K56K70KSE +/- 348.18, N = 8SE +/- 568.39, N = 8SE +/- 285.31, N = 8SE +/- 441.00, N = 1567554.7466822.9748672.2445686.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-OnlySMT OnSMT Off1.0712.1423.2134.2845.355SE +/- 0.01, N = 7SE +/- 0.03, N = 7SE +/- 0.02, N = 7SE +/- 0.00, N = 74.764.353.573.62

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: CrownSMT OnSMT Off50100150200250SE +/- 0.16, N = 9SE +/- 0.13, N = 7SE +/- 0.05, N = 6SE +/- 0.13, N = 7210.07146.2185.29125.58

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT OnSMT Off50K100K150K200K250KSE +/- 1757.80, N = 9SE +/- 2978.54, N = 13SE +/- 93.32, N = 8SE +/- 1029.98, N = 8211432.89224178.10147448.72140791.521. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian DragonSMT OnSMT Off60120180240300SE +/- 0.48, N = 9SE +/- 0.26, N = 8SE +/- 0.09, N = 6SE +/- 0.09, N = 8255.99178.30107.36157.65

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT OnSMT Off50K100K150K200K250KSE +/- 732.52, N = 9SE +/- 8884.57, N = 15SE +/- 885.17, N = 9SE +/- 1110.64, N = 9236490.76246272.79161475.24149355.541. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OnSMT Off100200300400500SE +/- 0.21, N = 9SE +/- 7.21, N = 15SE +/- 0.10, N = 9SE +/- 0.05, N = 9253.12439.07234.68237.761. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1SMT OnSMT Off2K4K6K8K10KSE +/- 5.35, N = 9SE +/- 180.32, N = 15SE +/- 2.53, N = 9SE +/- 1.30, N = 96328.0710976.685867.115944.061. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT OnSMT Off60K120K180K240K300KSE +/- 1284.96, N = 11SE +/- 599.41, N = 10SE +/- 104.98, N = 10SE +/- 296.36, N = 10249109.09268721.05136942.13128129.561. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e12SMT OnSMT Off0.43740.87481.31221.74962.187SE +/- 0.010, N = 14SE +/- 0.003, N = 12SE +/- 0.003, N = 11SE +/- 0.006, N = 111.5081.1601.7961.9441. (CXX) g++ options: -O3

CPU Power Consumption Monitor

EPYC 9754 2PEPYC 9754 1POpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringSMT OnSMT Off140280420560700Min: 21.61 / Avg: 460.57 / Max: 792.38Min: 97.83 / Avg: 446.01 / Max: 702.85Min: 10.61 / Avg: 238.75 / Max: 362.1Min: 10.52 / Avg: 248.93 / Max: 397.25

154 Results Shown

CP2K Molecular Dynamics
libxsmm
PostgreSQL:
  1000 - 800 - Read Only - Average Latency
  1000 - 800 - Read Only
TensorFlow:
  CPU - 256 - ResNet-50
  CPU - 512 - GoogLeNet
OpenVKL
TensorFlow
libxsmm
Stockfish
OpenVINO:
  Vehicle Detection FP16 - CPU:
    ms
    FPS
LuxCoreRender:
  LuxCore Benchmark - CPU
  Orange Juice - CPU
Neural Magic DeepSparse:
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
MariaDB
Timed LLVM Compilation
nekRS
OpenVINO:
  Machine Translation EN To DE FP16 - CPU:
    ms
    FPS
OpenSSL:
  SHA256
  SHA512
  AES-256-GCM
  ChaCha20
  AES-128-GCM
  ChaCha20-Poly1305
Timed Linux Kernel Compilation
OSPRay
OSPRay Studio:
  2 - 4K - 16 - Path Tracer
  1 - 4K - 16 - Path Tracer
MariaDB
Timed Gem5 Compilation
Graph500:
  26:
    sssp max_TEPS
    sssp median_TEPS
    bfs max_TEPS
    bfs median_TEPS
OpenVINO:
  Person Detection FP16 - CPU:
    ms
    FPS
HeFFTe - Highly Efficient FFT for Exascale
OpenVINO:
  Person Detection FP32 - CPU:
    ms
    FPS
OSPRay Studio:
  3 - 4K - 32 - Path Tracer
  2 - 4K - 32 - Path Tracer
srsRAN Project
OpenVINO:
  Person Vehicle Bike Detection FP16 - CPU:
    ms
    FPS
  Vehicle Detection FP16-INT8 - CPU:
    ms
    FPS
OSPRay Studio:
  1 - 4K - 32 - Path Tracer
  1 - 4K - 1 - Path Tracer
OSPRay
OSPRay Studio
Neural Magic DeepSparse:
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    ms/batch
    items/sec
Timed LLVM Compilation
OSPRay Studio:
  3 - 4K - 1 - Path Tracer
  2 - 4K - 1 - Path Tracer
Blender
Timed Node.js Compilation
Timed Godot Game Engine Compilation
Appleseed:
  Material Tester
  Emily
Neural Magic DeepSparse:
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
    ms/batch
    items/sec
HeFFTe - Highly Efficient FFT for Exascale
Neural Magic DeepSparse:
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
    ms/batch
    items/sec
TensorFlow
Timed Linux Kernel Compilation
TensorFlow:
  CPU - 512 - AlexNet
  CPU - 256 - AlexNet
Xmrig
OpenVINO:
  Face Detection FP16-INT8 - CPU:
    ms
    FPS
LuxCoreRender
OpenVINO:
  Face Detection FP16 - CPU:
    ms
    FPS
John The Ripper
OSPRay:
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/ao/real_time
OpenVINO:
  Weld Porosity Detection FP16-INT8 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    ms
    FPS
  Weld Porosity Detection FP16 - CPU:
    ms
    FPS
John The Ripper
OpenSSL:
  RSA4096:
    verify/s
    sign/s
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
Helsing
Neural Magic DeepSparse:
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
    ms/batch
    items/sec
SPECFEM3D
7-Zip Compression:
  Decompression Rating
  Compression Rating
SPECFEM3D
Blender
NAS Parallel Benchmarks
Liquid-DSP
SPECFEM3D
Liquid-DSP
Xmrig
NAMD
John The Ripper:
  bcrypt
  Blowfish
SPECFEM3D
miniBUDE:
  OpenMP - BM2:
    Billion Interactions/s
    GFInst/s
LuxCoreRender
Blender
nekRS
NAS Parallel Benchmarks
Aircrack-ng
NAS Parallel Benchmarks
Intel Open Image Denoise
Appleseed
NAS Parallel Benchmarks
CloverLeaf
SPECFEM3D
Primesieve
HeFFTe - Highly Efficient FFT for Exascale
Blender:
  Fishy Cat - CPU-Only
  BMW27 - CPU-Only
ASTC Encoder
miniFE
toyBrot Fractal Generator
Intel Open Image Denoise
NAS Parallel Benchmarks
ASTC Encoder:
  Fast
  Thorough
toyBrot Fractal Generator
HeFFTe - Highly Efficient FFT for Exascale
NAS Parallel Benchmarks
Intel Open Image Denoise
Embree
NAS Parallel Benchmarks
Embree
NAS Parallel Benchmarks
miniBUDE:
  OpenMP - BM1:
    Billion Interactions/s
    GFInst/s
NAS Parallel Benchmarks
Primesieve
CPU Power Consumption Monitor