Ubuntu 23.10 Server Benchmarks

2 x Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310301-NE-UBUNTU23169
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Xeon Platinum 8490H 2P
October 29 2023
  23 Hours, 25 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ubuntu 23.10 Server BenchmarksOpenBenchmarking.orgPhoronix Test Suite2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads)Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCC3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.6.0-rc5-phx-patched (x86_64)GNOME Shell 45.0X Server 1.21.1.7GCC 13.2.0ext41920x1200ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionUbuntu 23.10 Server Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 - OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu1)- Python 3.11.6- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

Ubuntu 23.10 Server Benchmarkstscp: AI Chess Performancestress-ng: NUMAstress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: AVX-512 VNNIspecfem3d: Layered Halfspacestress-ng: Vector Mathstress-ng: Floating Pointstress-ng: Matrix 3D Mathhelsing: 14 digitstress-ng: Memory Copyingstress-ng: Vector Shufflestress-ng: Wide Vector Mathspecfem3d: Mount St. Helensstress-ng: Fused Multiply-Addastcenc: Mediumspecfem3d: Tomographic Modelstress-ng: IO_uringspecfem3d: Homogeneous Halfspacecrafty: Elapsed Timespecfem3d: Water-layered Halfspacestress-ng: Vector Floating Pointastcenc: Thoroughrawtherapee: Total Benchmark Timelibxsmm: 128astcenc: Exhaustivelibxsmm: 256toktx: Zstd Compression 9libxsmm: 64heffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256toktx: Zstd Compression 19webp: Defaultwebp: Quality 100webp: Quality 100, Losslesseasywave: e2Asean Grid + BengkuluSept2007 Source - 240webp: Quality 100, Highest Compressioneasywave: e2Asean Grid + BengkuluSept2007 Source - 1200webp: Quality 100, Lossless, Highest Compressionsecuremark: SecureMark-TLSxmrig: Monero - 1Measywave: e2Asean Grid + BengkuluSept2007 Source - 2400xmrig: Wownero - 1Mgraph500: 26graph500: 26graph500: 26graph500: 26minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2openradioss: Bumper Beamopenradioss: Chrysler Neon 1Mopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containermafft: Multiple Sequence Alignment - LSU RNAtensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 64 - ResNet-50tensorflow: CPU - 256 - ResNet-50cloverleaf: clover_bmcloverleaf: clover_bm16cloverleaf: clover_bm64_shortgromacs: MPI CPU - water_GMX50_barehpcg: 104 104 104 - 60hpcg: 144 144 144 - 60npb: BT.Cnpb: CG.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Crodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusternamd: ATPase Simulation - 327,506 Atomsonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUaskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: Hogbom Clean OpenMPmt-dgemm: Sustained Floating-Point Ratepennant: sedovbigpennant: leblancbigamg: lulesh: nwchem: C240 Buckyballqmcpack: H4_aeqmcpack: Li2_STO_aeqmcpack: LiH_ae_MSDqmcpack: simple-H2Oqmcpack: O_ae_pyscf_UHFqmcpack: FeCO6_b3lyp_gmsminife: Smallincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directioncoremark: CoreMark Size 666 - Iterations Per Secondaircrack-ng: primesieve: 1e12primesieve: 1e13build-apache: Time To Compilestockfish: Total Timecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: MD5build-llvm: Ninjaasmfish: 1024 Hash Memory, 26 Depthcpuminer-opt: Magicpuminer-opt: scryptcpuminer-opt: Deepcoincpuminer-opt: Ringcoincpuminer-opt: Blake-2 Scpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: Myriad-Groestlcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Triple SHA-256, Onecoinbuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigsysbench: RAM / Memorysysbench: CPUkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacetachyon: Total Timesvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kx265: Bosphorus 4Kblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyuvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasteravifenc: 0avifenc: 2avifenc: 6build-godot: Time To Compileembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyopenvkl: vklBenchmarkCPU ISPCospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timeospray: particle_volume/pathtracer/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timeospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 16 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUospray-studio: 2 - 4K - 16 - Path Tracer - CPUospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 16 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUrays1bench: Large Sceneappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerbuild-gem5: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compileliquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 240 - 256 - 512duckdb: IMDBduckdb: TPC-H Parquetopenssl: SHA256openssl: SHA512openssl: RSA4096openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runmemcached: 1:10memcached: 1:100redis: GET - 500redis: SET - 500rocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randcassandra: Writespgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average LatencyXeon Platinum 8490H 2P1304138727.71255998.33535547.479132978.0316.091583044545315.6139013.8017872.3550.29532927.66640010.147628210.057.195225644252134621.62489.10709.7190037942907148.3111.898440020875526317.623881851249388.3079.235953.6143007.07.95171615.43.3271734.1143.877173.52719.39415.6210.021.313.2173.0353.9950.5226914544685.4132.60060163.0105259000010928300003808310005235610003375.861135.0343493.899139.75679.76114.3126.98127.7288.39107.377.82865.8385.04104.50125.6913.80218.8524.4315.32562.146859.7630335832.7061383.7314291.23117358.863923.37263425.33163287.91151986.4926.58881.69632.5554.0567.7660.2493110.609753.449190.3314930.4313570.4361141069.83812.068217.15138.01636.8847.07637.5947.023888.767.70413.46289.7414886.198.041529.8719.595775.4120.7026988.024.3721058.315.691905.2962.92738.3340.4247236.732.527621.2315.644412.5727.16140158.490.382993.0640.07149361.160.297114.1711918.038215.443732.620770.949109.4783.45845.89265411.409432.584014317973333344308.3411866.613.32100.92094.09829.113217.29128.4741132.9195.2646741.748079886.255921464481522.21032654928.7012.33627.41820.43126369930760231955107917628477874817594918342533316127600110.9502305201682698.321196.44323706852.3953490020904.24133870468105921027097338487727.638184.1148820.88308395.3931.6632.3767.9968.8070.76265978811691759259765100612.99104.76279.513187.774186.68728.4514.0437.0919.62146.6247.1824.9528.0154.2055.8356.336.29710.20667.13736.9042.991110.069122.2639151.96114.47270034.706234.5618157.84136.052635.133714.0187889907105913995331701425833766167923855710.887.3116.848.168.80610.24129.68837347.686893229.486398163.30320.673108.24576785000012096000001422633333123.742157.6821214133001774344053917354325.11927627.662029137328715347571046631306072098197437758714310494.97506.27503.663315903.313375000.053706874.502424580.58511803968491801931344820198331210187537131.3356762914.787OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceXeon Platinum 8490H 2P300K600K900K1200K1500KSE +/- 956.11, N = 1213041381. (CC) gcc options: -O3 -march=native

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: NUMAXeon Platinum 8490H 2P160320480640800SE +/- 2.38, N = 3727.711. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU StressXeon Platinum 8490H 2P50K100K150K200K250KSE +/- 95.00, N = 3255998.331. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix MathXeon Platinum 8490H 2P110K220K330K440K550KSE +/- 4843.33, N = 3535547.471. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNIXeon Platinum 8490H 2P2M4M6M8M10MSE +/- 44564.47, N = 39132978.031. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered HalfspaceXeon Platinum 8490H 2P48121620SE +/- 0.05, N = 316.091. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector MathXeon Platinum 8490H 2P120K240K360K480K600KSE +/- 220.90, N = 3545315.611. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Floating PointXeon Platinum 8490H 2P8K16K24K32K40KSE +/- 8.85, N = 339013.801. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix 3D MathXeon Platinum 8490H 2P4K8K12K16K20KSE +/- 35.59, N = 317872.351. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitXeon Platinum 8490H 2P1122334455SE +/- 0.68, N = 350.301. (CC) gcc options: -O2 -pthread

Digit Range: 12 digit

Xeon Platinum 8490H 2P: The test run did not produce a result.

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Memory CopyingXeon Platinum 8490H 2P7K14K21K28K35KSE +/- 92.04, N = 332927.661. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector ShuffleXeon Platinum 8490H 2P140K280K420K560K700KSE +/- 917.04, N = 3640010.141. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Wide Vector MathXeon Platinum 8490H 2P1.6M3.2M4.8M6.4M8MSE +/- 1953.33, N = 37628210.051. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. HelensXeon Platinum 8490H 2P246810SE +/- 0.043768819, N = 47.1952256441. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Fused Multiply-AddXeon Platinum 8490H 2P50M100M150M200M250MSE +/- 1449917.13, N = 3252134621.621. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumXeon Platinum 8490H 2P110220330440550SE +/- 0.82, N = 7489.111. (CXX) g++ options: -O3 -flto -pthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic ModelXeon Platinum 8490H 2P36912159.7190037941. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: IO_uringXeon Platinum 8490H 2P600K1200K1800K2400K3000KSE +/- 7213.99, N = 32907148.311. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous HalfspaceXeon Platinum 8490H 2P3691215SE +/- 0.12, N = 311.901. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeXeon Platinum 8490H 2P2M4M6M8M10MSE +/- 4555.37, N = 387552631. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered HalfspaceXeon Platinum 8490H 2P48121620SE +/- 0.05, N = 317.621. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Floating PointXeon Platinum 8490H 2P50K100K150K200K250KSE +/- 1218.29, N = 3249388.301. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughXeon Platinum 8490H 2P20406080100SE +/- 0.03, N = 679.241. (CXX) g++ options: -O3 -flto -pthread

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark TimeXeon Platinum 8490H 2P1224364860SE +/- 0.10, N = 353.611. RawTherapee, version 5.9, command line.

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

Xeon Platinum 8490H 2P: The test run did not produce a result. E: sh: 1: exec: ./heffte: not found

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128Xeon Platinum 8490H 2P6001200180024003000SE +/- 36.06, N = 93007.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveXeon Platinum 8490H 2P246810SE +/- 0.0630, N = 57.95171. (CXX) g++ options: -O3 -flto -pthread

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256Xeon Platinum 8490H 2P30060090012001500SE +/- 13.65, N = 31615.41. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

KTX-Software toktx

This is a benchmark of The Khronos Group's KTX-Software library and tools. KTX-Software provides "toktx" for converting/creating in the KTX container format for image textures. This benchmark times how long it takes to convert to KTX 2.0 format with various settings using a reference PNG sample input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 9Xeon Platinum 8490H 2P0.74861.49722.24582.99443.743SE +/- 0.006, N = 93.327

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64Xeon Platinum 8490H 2P400800120016002000SE +/- 11.92, N = 151734.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

HeFFTe - Highly Efficient FFT for Exascale

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128Xeon Platinum 8490H 2P306090120150SE +/- 1.02, N = 15143.881. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256Xeon Platinum 8490H 2P4080120160200SE +/- 2.20, N = 6173.531. (CXX) g++ options: -O3

KTX-Software toktx

This is a benchmark of The Khronos Group's KTX-Software library and tools. KTX-Software provides "toktx" for converting/creating in the KTX container format for image textures. This benchmark times how long it takes to convert to KTX 2.0 format with various settings using a reference PNG sample input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 19Xeon Platinum 8490H 2P510152025SE +/- 0.02, N = 319.39

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultXeon Platinum 8490H 2P48121620SE +/- 0.00, N = 1115.621. (CC) gcc options: -fvisibility=hidden -O2 -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100Xeon Platinum 8490H 2P3691215SE +/- 0.01, N = 1010.021. (CC) gcc options: -fvisibility=hidden -O2 -lm

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessXeon Platinum 8490H 2P0.29480.58960.88441.17921.474SE +/- 0.00, N = 31.311. (CC) gcc options: -fvisibility=hidden -O2 -lm

easyWave

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240Xeon Platinum 8490H 2P0.72381.44762.17142.89523.619SE +/- 0.070, N = 153.2171. (CXX) g++ options: -O3 -fopenmp

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionXeon Platinum 8490H 2P0.68181.36362.04542.72723.409SE +/- 0.00, N = 63.031. (CC) gcc options: -fvisibility=hidden -O2 -lm

easyWave

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200Xeon Platinum 8490H 2P1224364860SE +/- 0.66, N = 1554.001. (CXX) g++ options: -O3 -fopenmp

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionXeon Platinum 8490H 2P0.1170.2340.3510.4680.585SE +/- 0.00, N = 30.521. (CC) gcc options: -fvisibility=hidden -O2 -lm

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSXeon Platinum 8490H 2P60K120K180K240K300KSE +/- 444.49, N = 32691451. (CC) gcc options: -pedantic -O3

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MXeon Platinum 8490H 2P10K20K30K40K50KSE +/- 322.54, N = 344685.41. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

easyWave

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400Xeon Platinum 8490H 2P306090120150SE +/- 1.64, N = 12132.601. (CXX) g++ options: -O3 -fopenmp

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1MXeon Platinum 8490H 2P13K26K39K52K65KSE +/- 483.28, N = 360163.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26Xeon Platinum 8490H 2P200M400M600M800M1000M10525900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26Xeon Platinum 8490H 2P200M400M600M800M1000M10928300001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26Xeon Platinum 8490H 2P80M160M240M320M400M3808310001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26Xeon Platinum 8490H 2P110M220M330M440M550M5235610001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Xeon Platinum 8490H 2P7001400210028003500SE +/- 7.09, N = 73375.861. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Xeon Platinum 8490H 2P306090120150SE +/- 0.28, N = 7135.031. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Xeon Platinum 8490H 2P7001400210028003500SE +/- 26.25, N = 33493.901. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Xeon Platinum 8490H 2P306090120150SE +/- 1.05, N = 3139.761. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenRadioss

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamXeon Platinum 8490H 2P20406080100SE +/- 0.39, N = 379.76

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MXeon Platinum 8490H 2P306090120150SE +/- 0.07, N = 3114.31

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestXeon Platinum 8490H 2P612182430SE +/- 0.06, N = 326.98

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldXeon Platinum 8490H 2P306090120150SE +/- 0.28, N = 3127.72

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationXeon Platinum 8490H 2P20406080100SE +/- 0.29, N = 388.39

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerXeon Platinum 8490H 2P20406080100SE +/- 0.48, N = 3107.37

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

Test: Basic - Device: CPU

Xeon Platinum 8490H 2P: The test quit with a non-zero exit status. E: MPI_ABORT was invoked on rank 99 in communicator MPI_COMM_WORLD

WRF

WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.

Input: conus 2.5km

Xeon Platinum 8490H 2P: The test quit with a non-zero exit status. E: MPI_ABORT was invoked on rank 88 in communicator MPI_COMM_WORLD

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAXeon Platinum 8490H 2P246810SE +/- 0.016, N = 67.8281. (CC) gcc options: -std=c99 -O3 -lm -lpthread

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50Xeon Platinum 8490H 2P1530456075SE +/- 0.44, N = 365.83

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50Xeon Platinum 8490H 2P20406080100SE +/- 0.84, N = 385.04

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50Xeon Platinum 8490H 2P20406080100SE +/- 0.55, N = 3104.50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50Xeon Platinum 8490H 2P306090120150SE +/- 1.44, N = 3125.69

CloverLeaf

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmXeon Platinum 8490H 2P48121620SE +/- 0.13, N = 413.801. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16Xeon Platinum 8490H 2P50100150200250SE +/- 1.43, N = 3218.851. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortXeon Platinum 8490H 2P612182430SE +/- 0.12, N = 324.431. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareXeon Platinum 8490H 2P48121620SE +/- 0.01, N = 315.331. (CXX) g++ options: -O3

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60Xeon Platinum 8490H 2P1428425670SE +/- 0.32, N = 362.151. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60Xeon Platinum 8490H 2P1326395265SE +/- 0.71, N = 459.761. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CXeon Platinum 8490H 2P70K140K210K280K350KSE +/- 481.05, N = 5335832.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CXeon Platinum 8490H 2P13K26K39K52K65KSE +/- 513.63, N = 1561383.731. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DXeon Platinum 8490H 2P3K6K9K12K15KSE +/- 511.43, N = 1414291.231. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CXeon Platinum 8490H 2P30K60K90K120K150KSE +/- 827.96, N = 15117358.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DXeon Platinum 8490H 2P8001600240032004000SE +/- 29.28, N = 43923.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CXeon Platinum 8490H 2P60K120K180K240K300KSE +/- 522.02, N = 5263425.331. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CXeon Platinum 8490H 2P30K60K90K120K150KSE +/- 923.73, N = 9163287.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CXeon Platinum 8490H 2P30K60K90K120K150KSE +/- 733.56, N = 4151986.491. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDXeon Platinum 8490H 2P612182430SE +/- 0.05, N = 326.591. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DXeon Platinum 8490H 2P20406080100SE +/- 0.71, N = 381.701. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteXeon Platinum 8490H 2P816243240SE +/- 0.73, N = 1232.561. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverXeon Platinum 8490H 2P0.91261.82522.73783.65044.563SE +/- 0.036, N = 84.0561. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterXeon Platinum 8490H 2P246810SE +/- 0.029, N = 67.7661. (CXX) g++ options: -O2 -lOpenCL

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsXeon Platinum 8490H 2P0.05610.11220.16830.22440.2805SE +/- 0.00196, N = 30.24931

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P3691215SE +/- 0.20, N = 1510.61MIN: 4.081. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P0.77611.55222.32833.10443.8805SE +/- 0.01627, N = 53.44919MIN: 2.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P0.07460.14920.22380.29840.373SE +/- 0.002464, N = 70.331493MIN: 0.281. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P0.09710.19420.29130.38840.4855SE +/- 0.002789, N = 150.431357MIN: 0.341. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P0.09810.19620.29430.39240.4905SE +/- 0.000459, N = 90.436114MIN: 0.371. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P2004006008001000SE +/- 11.34, N = 31069.83MIN: 1032.81. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUXeon Platinum 8490H 2P2004006008001000SE +/- 2.08, N = 3812.07MIN: 794.511. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenVINO

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUXeon Platinum 8490H 2P50100150200250SE +/- 0.02, N = 3217.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUXeon Platinum 8490H 2P306090120150SE +/- 0.01, N = 3138.01MIN: 106.14 / MAX: 239.921. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUXeon Platinum 8490H 2P140280420560700SE +/- 0.89, N = 3636.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUXeon Platinum 8490H 2P1122334455SE +/- 0.07, N = 347.07MIN: 35.66 / MAX: 89.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUXeon Platinum 8490H 2P140280420560700SE +/- 0.62, N = 3637.591. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUXeon Platinum 8490H 2P1122334455SE +/- 0.05, N = 347.02MIN: 36.07 / MAX: 90.911. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUXeon Platinum 8490H 2P8001600240032004000SE +/- 1.48, N = 33888.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUXeon Platinum 8490H 2P246810SE +/- 0.00, N = 37.70MIN: 6.8 / MAX: 26.791. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P90180270360450SE +/- 0.73, N = 3413.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P60120180240300SE +/- 0.50, N = 3289.74MIN: 209.96 / MAX: 343.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUXeon Platinum 8490H 2P3K6K9K12K15KSE +/- 6.23, N = 314886.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUXeon Platinum 8490H 2P246810SE +/- 0.00, N = 38.04MIN: 6.98 / MAX: 23.031. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUXeon Platinum 8490H 2P30060090012001500SE +/- 0.78, N = 31529.871. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUXeon Platinum 8490H 2P510152025SE +/- 0.01, N = 319.59MIN: 17.9 / MAX: 51.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P12002400360048006000SE +/- 7.17, N = 35775.411. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P510152025SE +/- 0.03, N = 320.70MIN: 13.41 / MAX: 40.21. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUXeon Platinum 8490H 2P6K12K18K24K30KSE +/- 19.46, N = 326988.021. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUXeon Platinum 8490H 2P0.98331.96662.94993.93324.9165SE +/- 0.00, N = 34.37MIN: 3.7 / MAX: 47.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUXeon Platinum 8490H 2P5K10K15K20K25KSE +/- 3.18, N = 321058.311. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUXeon Platinum 8490H 2P1.28032.56063.84095.12126.4015SE +/- 0.00, N = 35.69MIN: 5.01 / MAX: 17.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUXeon Platinum 8490H 2P400800120016002000SE +/- 5.24, N = 31905.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUXeon Platinum 8490H 2P1428425670SE +/- 0.17, N = 362.92MIN: 49.92 / MAX: 113.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUXeon Platinum 8490H 2P160320480640800SE +/- 8.50, N = 3738.331. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUXeon Platinum 8490H 2P918273645SE +/- 0.46, N = 340.42MIN: 23.35 / MAX: 265.51. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P10K20K30K40K50KSE +/- 28.85, N = 347236.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUXeon Platinum 8490H 2P0.5671.1341.7012.2682.835SE +/- 0.00, N = 32.52MIN: 2.28 / MAX: 17.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUXeon Platinum 8490H 2P16003200480064008000SE +/- 16.20, N = 37621.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUXeon Platinum 8490H 2P48121620SE +/- 0.03, N = 315.64MIN: 12.92 / MAX: 33.671. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUXeon Platinum 8490H 2P9001800270036004500SE +/- 19.21, N = 34412.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUXeon Platinum 8490H 2P612182430SE +/- 0.11, N = 327.16MIN: 21.69 / MAX: 63.961. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUXeon Platinum 8490H 2P30K60K90K120K150KSE +/- 1481.71, N = 4140158.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUXeon Platinum 8490H 2P0.08550.1710.25650.3420.4275SE +/- 0.00, N = 40.38MIN: 0.31 / MAX: 40.781. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUXeon Platinum 8490H 2P6001200180024003000SE +/- 1.85, N = 32993.061. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUXeon Platinum 8490H 2P918273645