Ubuntu 22.04 Server Benchmarks

AMD EPYC 7713 64-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2209132-NE-UBUNTU22004
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 3 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 5 Tests
C++ Boost Tests 5 Tests
Chess Test Suite 4 Tests
Timed Code Compilation 12 Tests
C/C++ Compiler Tests 22 Tests
Compression Tests 3 Tests
CPU Massive 45 Tests
Creator Workloads 33 Tests
Cryptography 5 Tests
Database Test Suite 8 Tests
Encoding 9 Tests
Fortran Tests 7 Tests
Game Development 7 Tests
Go Language Tests 3 Tests
HPC - High Performance Computing 29 Tests
Imaging 7 Tests
Java 2 Tests
Common Kernel Benchmarks 5 Tests
LAPACK (Linear Algebra Pack) Tests 3 Tests
Linear Algebra 2 Tests
Machine Learning 7 Tests
Molecular Dynamics 9 Tests
MPI Benchmarks 8 Tests
Multi-Core 51 Tests
Node.js + NPM Tests 2 Tests
NVIDIA GPU Compute 5 Tests
Intel oneAPI 5 Tests
OpenMPI Tests 18 Tests
Programmer / Developer System Benchmarks 19 Tests
Python 5 Tests
Quantum Mechanics 2 Tests
Raytracing 2 Tests
Renderers 6 Tests
Scientific Computing 15 Tests
Software Defined Radio 3 Tests
Server 16 Tests
Server CPU Tests 30 Tests
Single-Threaded 9 Tests
Telephony 2 Tests
Texture Compression 4 Tests
Video Encoding 7 Tests
Common Workstation Benchmarks 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 7713 2P
September 08 2022
  1 Day, 15 Hours, 49 Minutes
EPYC 7713
September 11 2022
  1 Day, 16 Hours, 26 Minutes
Invert Hiding All Results Option
  1 Day, 16 Hours, 7 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ubuntu 22.04 Server BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionEPYC 7713 2PEPYC 77132 x AMD EPYC 7713 64-Core @ 2.00GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse512GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 22.045.15.0-47-generic (x86_64)GNOME Shell 42.4X Server 1.21.1.31.2.204GCC 11.2.0ext41920x1080AMD EPYC 7713 64-Core @ 2.00GHz (64 Cores / 128 Threads)256GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Details- OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7713 2P vs. EPYC 7713 ComparisonPhoronix Test SuiteBaseline+99.2%+99.2%+198.4%+198.4%+297.6%+297.6%+396.8%+396.8%396.9%366.9%347.8%343.4%342.1%339.8%339.3%310%215.8%155.9%152%150.4%138.9%131%124.3%120.9%119.7%116.7%116.7%113.2%112.8%110%107.7%106.3%100%99.1%92.6%92.1%92.1%91.6%91.5%91.1%91.1%89.4%88.7%88.1%85.6%76.4%75.4%69.3%62.9%62%61.1%56.8%53.1%45.8%45.1%37.6%31.5%30.8%30.5%29.9%28.8%28.4%26.4%26%24.4%24.3%23.8%22.5%21.9%19.4%19.2%18.9%18.8%17.7%17.6%14.4%14.3%14.3%13.9%12%9.1%9%9%8.2%8%7.7%7.3%6.7%6.6%6.4%6%4.8%4.6%3.8%3.8%3.8%3.8%3.4%3.4%3.2%3%2.9%2.9%2.9%2.7%2.6%2.6%2.5%2.5%2.5%2.3%2.2%2.2%2.1%2.1%2%50 - 1:550 - 1:1V.P.M364.5%50 - 5:1100 - 250 - Read Write - Average Latency100 - 250 - Read Write100 - 500 - Read Write - Average Latency100 - 500 - Read Write2048SpaceshipR.N.N.T - f32 - CPUR.N.N.T - u8s8f32 - CPUR.N.N.T - bf16bf16bf16 - CPUM.M.B.S.T - u8s8f32 - CPUR.N.N.I - bf16bf16bf16 - CPUd.S.M.S - Execution Time125%SP.C124.4%R.N.N.I - f32 - CPUMaterial TesterR.N.N.I - u8s8f32 - CPUPUT - 500 - 100 - Average LatencyRANGE - 500 - 100 - Average LatencyRANGE - 500 - 100RANGE - 100 - 100d.L.M.S - Execution Time112.3%PUT - 500 - 100RANGE - 100 - 100 - Average LatencyPUT - 100 - 100X.b.i.i102.8%sedovbig101.3%CoreMark Size 666 - I.P.S100.7%PUT - 100 - 100 - Average LatencytConvolve MPI - Gridding99.3%A.G.R.R.0.F.I - CPURand Read98.4%RSA409698.3%RSA409698.3%CPU98.1%SHA25697.7%Exhaustive97.5%tConvolve MPI - Degridding96.2%F.D.F - CPU96.1%conus 2.5km96%EP.D95.1%14 digit94.8%94.2%Basic - CPU93.6%256 - 256 - 5793.4%5001e1392.4%FT.C92.4%P.D.F - CPU92.2%RANGE - 500 - 1000P.D.F - CPU92.1%PUT - 500 - 1000Thorough92%Time To Solve91.9%PUT - 500 - 1000 - Average LatencyRANGE - 500 - 1000 - Average LatencyLU.C91.2%V.D.F - CPU91.1%PUT - 100 - 1000 - Average LatencyPUT - 100 - 10001 - 4K - 32 - Path Tracer90.4%90%3 - 4K - 32 - Path Tracer89.5%RANGE - 100 - 10002 - 4K - 32 - Path Tracer89.4%CG.C89.3%EP.C89.2%88.8%RANGE - 100 - 1000 - Average LatencyClassroom - CPU-Only88.7%P.V.B.D.F - CPU87.9%128 - 256 - 5787.9%M.T.E.T.D.F - CPU87.5%Total Time87.1%F.D.F.I - CPU86.4%V.D.F.I - CPU85.7%1000W.P.D.F.I - CPU85.2%W.P.D.F - CPU84.9%BT.C84.2%83.9%1.H.M.2.D79.3%D.R78.8%Pabellon Barcelona - CPU-Only78.7%1 - 4K - 1 - Path Tracer78.6%1 - 4K - 16 - Path Tracer78.6%3 - 4K - 16 - Path Tracer77.8%3 - 4K - 1 - Path Tracer77.5%Monero - 1M77.4%IS.D77.1%BMW27 - CPU-Only77%40962 - 4K - 16 - Path Tracer76.1%MG.C75.7%ArcFace ResNet-100 - CPU - Standard2 - 4K - 1 - Path Tracer75.3%Fishy Cat - CPU-Only74.6%Barbershop - CPU-Only74.5%Pathtracer ISPC - Crown74.4%OpenMP LavaMD74.1%Pathtracer - Crown72.9%leblancbig72.4%ATPase Simulation - 327,506 Atoms70.2%allmodconfig69.9%Q.1.C.E.5Carbon Nanotube63.4%H.C.OL.E.HRT.hdr_alb_nrm.3840x216061.4%A.G.R.R.0.F - CPUMedium60.9%RT.ldr_alb_nrm.3840x216060.3%MPI CPU - water_GMX50_bare60.1%Read While Writing58.6%SP.B58.6%F.H.RS.F.P.R53.6%100 - 250 - Read Only - Average Latency53.2%Savina Reactors.IO100 - 250 - Read Only52.8%Sharpen52.4%1 - Bosphorus 4K52.2%Ninja51.9%Orange Juice - CPU49%Bosphorus 4K - Very Fast47.8%100 - 500 - Read Only - Average Latency47.2%Compression Rating47.2%DLSC - CPU46.8%100 - 500 - Read Only46.8%C.B.S.A - f32 - CPU45.9%GET - 1000super-resolution-10 - CPU - StandardTime To Compile44.5%Enhanced42.8%Trace Time40.7%RAM / Memorydefconfig36.3%UASTC Level 334.9%3 - D.SALS Movie Lens9 - D.SA.S.PQ.9.C.E.729.2%1000Allfcn-resnet101-11 - CPU - Standard28.1%Time To Compile27.2%yolov4 - CPU - Standard500R.C.a.P - CPUBosphorus 4KS.C.c.j64 - 256 - 5723.8%I.M.D.SUnix Makefiles22.3%A.G.R.R.0.F - CPU22.2%R.R.W.RDisney Material21.4%UASTC Level 221%Danish Mood - CPU21%2620.6%WritesHWB Color SpaceGPT-2 - CPU - Standard2618.8%19, Long Mode - Compression SpeedPathtracer ISPC - Asian Dragon18.2%Speed 9 Realtime - Bosphorus 4KUpdate RandA.G.R.R.0.F.I - CPU16.2%Wownero - 1M15.1%Q.7.C.E.714.6%d.L.M.S - Mesh Time14.5%Speed 5 - Bosphorus 4K10 - Bosphorus 4KBLAS14.3%TradebeansC240 Buckyball14.1%EmilyTime To Compile13.9%Eigen13.8%Small12.8%Speed 10 Realtime - Bosphorus 4Kd.S.M.S - Mesh Time11.9%LuxCore Benchmark - CPU11.5%Time To Compile11%Time To Compile10.8%Pathtracer - Asian Dragon9.4%scikit_qdaOpenMP LeukocyteApache Spark BayesOpenMP CFD Solver8.9%W.P.D.F - CPUW.P.D.F.I - CPUDefault7.8%V.D.F.I - CPUF.D.F.I - CPUbertsquad-12 - CPU - StandardM.T.E.T.D.F - CPUP.V.B.D.F - CPUPreset 10 - Bosphorus 4KTime To Compile5.3%7 - Bosphorus 4K5.1%OFDM_TestV.D.F - CPUP.D.F - CPUP.P.BJPEG - 7P.D.F - CPUTime To Compile3.7%Bosphorus 4K - Ultra Fast3.5%PNG - 7OPTIONS, StatefulG.A.U.J.F263.2%10, LosslessPreset 8 - Bosphorus 4K6Preset 12 - Bosphorus 4K262.7%9 - Compression SpeedCPU - MobileNet v2UASTC Level 0scikit_icaC75523 - Compression SpeedS.C.m.j2.3%RotateLion19 - D.S2.1%Time To Compile6, LosslessDefault2.1%PNG - 8DragonflydbDragonflydbBRL-CADDragonflydbPostgreSQL pgbenchPostgreSQL pgbenchPostgreSQL pgbenchPostgreSQL pgbenchMariaDBNatrononeDNNoneDNNoneDNNoneDNNoneDNNOpenFOAMNAS Parallel BenchmarksoneDNNAppleseedoneDNNetcdetcdetcdetcdOpenFOAMetcdetcdetcdXcompact3d Incompact3dPennantCoremarketcdASKAPOpenVINOFacebook RocksDBOpenSSLOpenSSLSysbenchOpenSSLASTC EncoderASKAPOpenVINOWRFNAS Parallel BenchmarksHelsingHigh Performance Conjugate GradientRELIONLiquid-DSPnginxPrimesieveNAS Parallel BenchmarksOpenVINOetcdOpenVINOetcdASTC Encoderm-queensetcdetcdNAS Parallel BenchmarksOpenVINOetcdetcdOSPRay StudioAlgebraic Multi-Grid BenchmarkOSPRay StudioetcdOSPRay StudioNAS Parallel BenchmarksNAS Parallel BenchmarksLULESHetcdBlenderKripkeOpenVINOLiquid-DSPOpenVINOStockfishOpenVINOOpenVINOnginxOpenVINOOpenVINONAS Parallel BenchmarksebizzyasmFish7-Zip CompressionBlenderOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioXmrigNAS Parallel BenchmarksBlenderMariaDBOSPRay StudioNAS Parallel BenchmarksONNX RuntimeOSPRay StudioBlenderBlenderEmbreeRodiniaEmbreePennantNAMDTimed Linux Kernel CompilationWebP2 Image EncodeGPAWASKAPCloverLeafIntel Open Image DenoiseOpenVINOASTC EncoderIntel Open Image DenoiseGROMACSFacebook RocksDBNAS Parallel BenchmarksRenaissanceACES DGEMMPostgreSQL pgbenchRenaissancePostgreSQL pgbenchGraphicsMagickSVT-HEVCTimed LLVM CompilationLuxCoreRenderKvazaarPostgreSQL pgbench7-Zip CompressionLuxCoreRenderPostgreSQL pgbenchoneDNNRedisONNX RuntimeTimed Node.js CompilationGraphicsMagickPOV-RaySysbenchTimed Linux Kernel CompilationBasis UniversalLZ4 CompressionRenaissanceLZ4 CompressionRenaissanceWebP2 Image EncodeApache HTTP ServerJPEG XL Decoding libjxlONNX RuntimeTimed FFmpeg CompilationONNX RuntimeApache HTTP ServerLuxCoreRenderx265SPECjbb 2015Liquid-DSPRenaissanceTimed LLVM CompilationOpenVINOFacebook RocksDBAppleseedBasis UniversalLuxCoreRenderGraph500Apache CassandraGraphicsMagickONNX RuntimeGraph500Zstd CompressionEmbreeAOM AV1Facebook RocksDBOpenVINOXmrigWebP2 Image EncodeOpenFOAMVP9 libvpx EncodingSVT-HEVCLeelaChessZeroDaCapo BenchmarkNWChemAppleseedTimed Mesa CompilationLeelaChessZerominiFEAOM AV1OpenFOAMLuxCoreRenderTimed Gem5 CompilationBuild2EmbreeMlpack BenchmarkRodiniaRenaissanceRodiniaOpenVINOOpenVINOWebP2 Image EncodeOpenVINOOpenVINOONNX RuntimeOpenVINOOpenVINOSVT-AV1Timed Godot Game Engine CompilationSVT-HEVCsrsRANOpenVINOOpenVINOLibRawJPEG XL libjxlOpenVINOTimed PHP CompilationKvazaarJPEG XL libjxlPJSIPRenaissanceGraph500libavif avifencSVT-AV1libavif avifencSVT-AV1Node.js Express HTTP Load TestGraph500LZ4 CompressionTNNBasis UniversalMlpack BenchmarkNgspiceLZ4 CompressionSPECjbb 2015GraphicsMagickGoogle DracoZstd CompressionTimed Apache Compilationlibavif avifencTimed CPython CompilationJPEG XL libjxlEPYC 7713 2PEPYC 7713

Ubuntu 22.04 Server Benchmarksgpaw: Carbon Nanotubegromacs: MPI CPU - water_GMX50_barenamd: ATPase Simulation - 327,506 Atomsgraph500: 26graph500: 26graph500: 26graph500: 26hpcg: wrf: conus 2.5kmrelion: Basic - CPUamg: incompact3d: X3D-benchmarking input.i3dkripke: lulesh: pennant: leblancbigpennant: sedovbigminife: Smallmt-dgemm: Sustained Floating-Point Ratenwchem: C240 Buckyballqe: AUSURF112npb: BT.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: SP.Bnpb: SP.Cnpb: IS.Dnpb: MG.Cnpb: CG.Crodinia: OpenMP CFD Solverrodinia: OpenMP LavaMDrodinia: OpenMP Leukocyterodinia: OpenMP HotSpot3Dopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timeopenfoam: drivaerFastback, Large Mesh Size - Execution Timeopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUonnx: yolov4 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: super-resolution-10 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: Hogbom Clean OpenMPcloverleaf: Lagrangian-Eulerian Hydrodynamicslczero: BLASlczero: Eigenonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmlpack: scikit_svmmlpack: scikit_qdamlpack: scikit_icatnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1tnn: CPU - SqueezeNet v2build-apache: Time To Compilebuild-ffmpeg: Time To Compilebuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-gem5: Time To Compilebuild-godot: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-mesa: Time To Compilebuild-nodejs: Time To Compilebuild-php: Time To Compilebuild-python: Defaultbuild-python: Released Build, PGO + LTO Optimizedbuild-wasmer: Time To Compilebuild2: Time To Compiledacapobench: Jythondacapobench: Tradebeansrenaissance: Savina Reactors.IOrenaissance: Apache Spark Bayesrenaissance: Apache Spark PageRankrenaissance: In-Memory Database Shootoutrenaissance: Finagle HTTP Requestsrenaissance: Genetic Algorithm Using Jenetics + Futuresrenaissance: ALS Movie Lenscompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlyappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerpovray: Trace Timeembree: Pathtracer - Asian Dragonembree: Pathtracer - Crownembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Crownoidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160ospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Tracerluxcorerender: DLSC - CPUluxcorerender: Rainbow Colors and Prism - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUnatron: Spaceshipaom-av1: Speed 10 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kvpxenc: Speed 5 - Bosphorus 4Kkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-hevc: 1 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Kx265: Bosphorus 4Kjpegxl: JPEG - 7jpegxl: JPEG - 8jpegxl: PNG - 7jpegxl: PNG - 8jpegxl-decode: 1jpegxl-decode: Allavifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslesswebp: Defaultwebp: Quality 100webp: Quality 100, Highest Compressionwebp: Quality 100, Losslesswebp: Quality 100, Lossless, Highest Compressionwebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5graphics-magick: HWB Color Spacegraphics-magick: Enhancedgraphics-magick: Rotategraphics-magick: Sharpenlibraw: Post-Processing Benchmarkastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3draco: Church Facadedraco: Lionetcpak: Single-Threaded - ETC2etcpak: Multi-Threaded - ETC2mysqlslap: 2048mysqlslap: 4096pgbench: 100 - 250 - Read Writepgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 500 - Read Writepgbench: 100 - 500 - Read Write - Average Latencypgbench: 100 - 500 - Read Onlypgbench: 100 - 500 - Read Only - Average Latencyapache: 500apache: 1000nginx: 500nginx: 1000ebizzy: cassandra: Writesrocksdb: Rand Readrocksdb: Read While Writingrocksdb: Read Rand Write Randrocksdb: Update Randclickhouse: 100M Rows Web Analytics Dataset, First Run / Cold Cacheclickhouse: 100M Rows Web Analytics Dataset, Second Runclickhouse: 100M Rows Web Analytics Dataset, Third Rundragonflydb: 50 - 1:5dragonflydb: 50 - 1:1dragonflydb: 50 - 5:1etcd: PUT - 100 - 100etcd: PUT - 100 - 100 - Average Latencyetcd: PUT - 100 - 1000etcd: PUT - 100 - 1000 - Average Latencyetcd: PUT - 500 - 100etcd: PUT - 500 - 100 - Average Latencyetcd: PUT - 500 - 1000etcd: PUT - 500 - 1000 - Average Latencyetcd: RANGE - 100 - 100etcd: RANGE - 100 - 100 - Average Latencyetcd: RANGE - 100 - 1000etcd: RANGE - 100 - 1000 - Average Latencyetcd: RANGE - 500 - 100etcd: RANGE - 500 - 100 - Average Latencyetcd: RANGE - 500 - 1000etcd: RANGE - 500 - 1000 - Average Latencynode-express-loadtest: node-web-tooling: simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDsimdjson: TopTweetredis: GET - 1000brl-cad: VGR Performance Metriccython-bench: N-Queenspybench: Total For Average Test Timespyperformance: chaospyperformance: pickle_pure_pythonpyperformance: python_startuppyperformance: regex_compilenumpy: phpbench: PHP Benchmark Suiteluaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phasesrsran: 4G PHY_DL_Test 100 PRB SISO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB SISO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB SISO 256-QAMsrsran: 4G PHY_DL_Test 100 PRB SISO 256-QAMsrsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAMsrsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAMsrsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAMsrsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAMsrsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAMsrsran: OFDM_Testliquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57pjsip: OPTIONS, Statefulpjsip: OPTIONS, Statelesspjsip: INVITEngspice: C2670ngspice: C7552octave-benchmark: quantlib: helsing: 14 digitprimesieve: 1e13encode-flac: WAV To FLACencode-mp3: WAV To MP3blake2: openssl: RSA4096openssl: RSA4096openssl: SHA256xmrig: Monero - 1Mxmrig: Wownero - 1Msysbench: CPUsysbench: RAM / Memoryctx-clock: Context Switch Timem-queens: Time To Solvecoremark: CoreMark Size 666 - Iterations Per Secondsynthmark: VoiceMark_100securemark: SecureMark-TLSasmfish: 1024 Hash Memory, 26 Depthstockfish: Total Timeaircrack-ng: spec-jbb2015: SPECjbb2015-Composite max-jOPSspec-jbb2015: SPECjbb2015-Composite critical-jOPSEPYC 7713 2PEPYC 771343.8578.2150.2671264251600065946700030233800039037700037.10118650.839290.9611923306667300.69114214395026736456.2293.5565705.67589524664.832.0712272183.6399.92235808.548338.279109.24116679.26259899.49142675.55116527.634690.06100740.8945587.106.28326.74047.28489.280124.96281.7776.387052.6217.593597.4745.841388.4137561.073.2757763.892.1113.104777.1513.094774.374559.4228.051905.5133.553136.1420.381788.3435.752884.4422.16218.13292.823302374569668787887739742.043735.8319.75619.44444940930.64083330.01167531.127435.347473.912840.332754.452840.0521.4531.8446.403065.920341.281273.84865.89420.52114.271105.032178.675159.01541.16721.584160.46318.100113.46638.33815.39261.33751.86953.2324106456812254.2638.13985.76136.410535.42314.924613.151661763408684.73473.039.93485.854.1810743.252.5110851.417.1940.8922.2553.88171.52151.53107150.464258336.5831197.59068.875989.600667.173682.92632.262.2613792215844624142422742454601646265105278512.2316.997.6318.607.501.957.0455.6213.8654.2458.52179.550123.80668.79715.08144.70195.6221.48100.7728.9210.741.0066.93569.9741.2133.9577.3345.55816.7510.653.471.420.569.700.550.316.411093134473077937.96380.646758.47446.44996.3798.77311.40275255735229.6376678.5651501401663315.07619831550.1261426135.09420127220.24891255.7984767.7390312.0094018.12453258209921480175332145646732922167304654377.52387.01396.82724442.87724466.22724635.9338209.42022.641860.482523.738992.81582.643696.761822.837230.32732.742305.618323.438506.60512.644040.713522.6619810.503.8412.94.424.391364479.13324051523.31295395.83857.71155469.387260031211.6364.398.4623.8392.3143.6427.4149.6394.1133.6426.1141.0127.962.213033333316142666673191533333508506666753509000008819676174692137.362105.9516.5452808.862.12928.71818.1337.4903.3925009.61638050.513462915652750749.641982.9500125.797260.661206.3764105747.139772748.057249582248870519279486942149778.7971308996894671.6585.1300.4545762268100064227800025447700032359300019.108816959.848563.2711012051667609.67747027072252719305.6566.13306011.4272321875.120.8802712491.1397.64127998.614406.024668.9660659.25135917.4389950.7651921.712648.9657329.7024085.736.84546.54743.39288.019139.89633.93888.8214972.978.973555.7824.591293.5130742.172.0349691.031.066.824601.786.814600.942462.4025.971030.7631.021689.1818.93935.5834.171535.0820.82116.33274.5741718566317139368153820252.121943.0520.84012.00389335980.93470612.56082943.082950.432984.471266.201253.501229.5621.3329.1945.273107.102332.611273.48165.61720.10018.146159.575218.445176.43043.35429.409272.60920.610164.00139.76315.708261.36551.96558.956403339988004.9585.63067.35009.26720.82243.218814.135107735460884.93401.647.43460.355.4314123.353.8914156.730.4377.1438.8596.30299.30133.0434761.265637152.39420410.68062.981451.833456.809847.55831.401.4124633956884968249640058860812922471341000518.3321.136.8412.486.206.063.9065.4915.8636.7156.53184.711131.27070.7839.91137.74223.6626.69104.5829.2411.111.0267.62731.8840.7663.8467.1845.39816.7810.693.481.400.569.000.480.2410.85130394174651139.40236.602030.45543.26596.22110.61515.38574115613230.4326702.518615247735273.40012976840.193626547.98013709350.365115006.99109150.30173899.19174524.7924649425059624207479891821673561841358336378.44392.92394.983599739.573382696.793244708.3678829.58881.379978.190512.481895.96231.283927.655811.979219.65091.380131.795812.482098.41781.284602.780311.8636710.603.8612.94.404.391990067.3369769823.38495996.53877.64156464.567333321189.0365.498.6628.9396.0143.6430.5149.6394.4133.7431.8141.4128.962.413663333316055666672577666667270690000027671666679115666024690136.020103.3876.5822795.3120.99955.25518.1297.4723.3912613.1825904.16810180985328612.436467.2252440.579990.1212012.2382045229.645896747.308247747138776102149383689149666.94312795785389OpenBenchmarking.org

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon NanotubeEPYC 7713EPYC 7713 2P1632486480SE +/- 0.19, N = 3SE +/- 0.25, N = 371.6643.861. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bareEPYC 7713EPYC 7713 2P246810SE +/- 0.010, N = 3SE +/- 0.024, N = 35.1308.2151. (CXX) g++ options: -O3

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7713EPYC 7713 2P0.10230.20460.30690.40920.5115SE +/- 0.00038, N = 3SE +/- 0.00056, N = 30.454570.26712

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26EPYC 7713EPYC 7713 2P140M280M420M560M700M6226810006425160001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26EPYC 7713EPYC 7713 2P140M280M420M560M700M6422780006594670001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26EPYC 7713EPYC 7713 2P60M120M180M240M300M2544770003023380001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26EPYC 7713EPYC 7713 2P80M160M240M320M400M3235930003903770001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7713EPYC 7713 2P918273645SE +/- 0.01, N = 3SE +/- 0.11, N = 319.1137.101. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

WRF

WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5kmEPYC 7713EPYC 7713 2P4K8K12K16K20K16959.858650.841. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPUEPYC 7713EPYC 7713 2P120240360480600SE +/- 3.66, N = 3SE +/- 3.51, N = 4563.27290.961. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2EPYC 7713EPYC 7713 2P400M800M1200M1600M2000MSE +/- 863687.12, N = 3SE +/- 699794.57, N = 3101205166719233066671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dEPYC 7713EPYC 7713 2P130260390520650SE +/- 1.64, N = 3SE +/- 0.29, N = 3609.68300.691. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4EPYC 7713EPYC 7713 2P60M120M180M240M300MSE +/- 2567603.42, N = 15SE +/- 886909.06, N = 32707225271439502671. (CXX) g++ options: -O3 -fopenmp

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3EPYC 7713EPYC 7713 2P8K16K24K32K40KSE +/- 64.60, N = 3SE +/- 104.62, N = 319305.6636456.231. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigEPYC 7713EPYC 7713 2P246810SE +/- 0.094028, N = 15SE +/- 0.079319, N = 156.1330603.5565701. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigEPYC 7713EPYC 7713 2P3691215SE +/- 0.119128, N = 4SE +/- 0.039464, N = 611.4272305.6758951. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallEPYC 7713EPYC 7713 2P5K10K15K20K25KSE +/- 3.33, N = 4SE +/- 292.61, N = 1521875.124664.81. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateEPYC 7713EPYC 7713 2P714212835SE +/- 0.08, N = 5SE +/- 0.29, N = 620.8832.071. (CC) gcc options: -O3 -march=native -fopenmp

NWChem

NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballEPYC 7713EPYC 7713 2P50010001500200025002491.12183.61. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112EPYC 7713EPYC 7713 2P90180270360450SE +/- 0.15, N = 3SE +/- 0.17, N = 3397.64399.921. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CEPYC 7713EPYC 7713 2P50K100K150K200K250KSE +/- 152.57, N = 3SE +/- 391.85, N = 4127998.61235808.541. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CEPYC 7713EPYC 7713 2P2K4K6K8K10KSE +/- 45.43, N = 15SE +/- 61.53, N = 154406.028338.271. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DEPYC 7713EPYC 7713 2P2K4K6K8K10KSE +/- 87.49, N = 15SE +/- 151.57, N = 154668.969109.241. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CEPYC 7713EPYC 7713 2P20K40K60K80K100KSE +/- 292.35, N = 6SE +/- 650.27, N = 860659.25116679.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CEPYC 7713EPYC 7713 2P60K120K180K240K300KSE +/- 264.51, N = 4SE +/- 2852.04, N = 5135917.43259899.491. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BEPYC 7713EPYC 7713 2P30K60K90K120K150KSE +/- 824.60, N = 15SE +/- 1089.48, N = 1589950.76142675.551. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CEPYC 7713EPYC 7713 2P20K40K60K80K100KSE +/- 33.24, N = 3SE +/- 212.54, N = 451921.71116527.631. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DEPYC 7713EPYC 7713 2P10002000300040005000SE +/- 13.59, N = 4SE +/- 43.17, N = 152648.964690.061. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CEPYC 7713EPYC 7713 2P20K40K60K80K100KSE +/- 290.32, N = 9SE +/- 291.87, N = 1057329.70100740.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CEPYC 7713EPYC 7713 2P10K20K30K40K50KSE +/- 119.51, N = 6SE +/- 340.17, N = 1124085.7345587.101. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverEPYC 7713EPYC 7713 2P246810SE +/- 0.014, N = 6SE +/- 0.026, N = 76.8456.2831. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDEPYC 7713EPYC 7713 2P1122334455SE +/- 0.08, N = 3SE +/- 0.14, N = 346.5526.741. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteEPYC 7713EPYC 7713 2P1122334455SE +/- 0.47, N = 4SE +/- 0.41, N = 343.3947.281. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DEPYC 7713EPYC 7713 2P20406080100SE +/- 1.40, N = 15SE +/- 1.37, N = 1588.0289.281. (CXX) g++ options: -O2 -lOpenCL

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Mesh TimeEPYC 7713EPYC 7713 2P306090120150139.89124.961. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Execution TimeEPYC 7713EPYC 7713 2P140280420560700633.93281.701. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Mesh TimeEPYC 7713EPYC 7713 2P2004006008001000888.82776.381. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Execution TimeEPYC 7713EPYC 7713 2P3K6K9K12K15K14972.977052.621. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P48121620SE +/- 0.01, N = 3SE +/- 0.18, N = 58.9717.591. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P8001600240032004000SE +/- 2.50, N = 3SE +/- 39.69, N = 53555.783597.47MIN: 3306.65 / MAX: 3702.18MIN: 1881.18 / MAX: 6268.951. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P1020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 324.5945.841. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P30060090012001500SE +/- 1.10, N = 3SE +/- 2.23, N = 31293.511388.41MIN: 1118.15 / MAX: 1339.17MIN: 1233.32 / MAX: 1772.31. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUEPYC 7713EPYC 7713 2P8K16K24K32K40KSE +/- 11.83, N = 3SE +/- 533.11, N = 330742.1737561.071. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUEPYC 7713EPYC 7713 2P0.73581.47162.20742.94323.679SE +/- 0.00, N = 3SE +/- 0.06, N = 32.033.27MIN: 0.97 / MAX: 18.02MIN: 0.88 / MAX: 81.361. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P12K24K36K48K60KSE +/- 78.11, N = 3SE +/- 378.72, N = 349691.0357763.891. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P0.47480.94961.42441.89922.374SE +/- 0.00, N = 3SE +/- 0.01, N = 31.062.11MIN: 0.55 / MAX: 15.24MIN: 0.53 / MAX: 66.381. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P3691215SE +/- 0.02, N = 3SE +/- 0.05, N = 36.8213.101. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P10002000300040005000SE +/- 8.17, N = 3SE +/- 16.64, N = 34601.784777.15MIN: 2414.7 / MAX: 5219.01MIN: 2420.5 / MAX: 6044.241. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUEPYC 7713EPYC 7713 2P3691215SE +/- 0.01, N = 3SE +/- 0.03, N = 36.8113.091. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUEPYC 7713EPYC 7713 2P10002000300040005000SE +/- 5.38, N = 3SE +/- 9.45, N = 34600.944774.37MIN: 2381.08 / MAX: 5204.91MIN: 2371.17 / MAX: 6026.051. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P10002000300040005000SE +/- 1.64, N = 3SE +/- 0.96, N = 32462.404559.421. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 325.9728.05MIN: 12.2 / MAX: 39.52MIN: 10.65 / MAX: 66.971. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P400800120016002000SE +/- 0.14, N = 3SE +/- 2.71, N = 31030.761905.511. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P816243240SE +/- 0.01, N = 3SE +/- 0.05, N = 331.0233.55MIN: 15.59 / MAX: 49.44MIN: 14.43 / MAX: 183.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P7001400210028003500SE +/- 1.36, N = 3SE +/- 2.08, N = 31689.183136.141. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUEPYC 7713EPYC 7713 2P510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 318.9320.38MIN: 11.41 / MAX: 69.86MIN: 8.69 / MAX: 87.481. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P400800120016002000SE +/- 3.92, N = 3SE +/- 12.59, N = 3935.581788.341. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P816243240SE +/- 0.14, N = 3SE +/- 0.25, N = 334.1735.75MIN: 17.07 / MAX: 68.69MIN: 18.65 / MAX: 151.871. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P6001200180024003000SE +/- 0.60, N = 3SE +/- 3.95, N = 31535.082884.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUEPYC 7713EPYC 7713 2P510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 320.8222.16MIN: 12.46 / MAX: 44.77MIN: 11.48 / MAX: 97.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUEPYC 7713EPYC 7713 2P50100150200250SE +/- 0.19, N = 3SE +/- 0.45, N = 3116.33218.131. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUEPYC 7713EPYC 7713 2P60120180240300SE +/- 0.42, N = 3SE +/- 0.60, N = 3274.57292.82MIN: 116.27 / MAX: 347.98MIN: 139.85 / MAX: 449.451. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -flto -shared

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: yolov4 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P90180270360450SE +/- 1.15, N = 3SE +/- 5.46, N = 124173301. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P50100150200250SE +/- 3.69, N = 12SE +/- 2.05, N = 31852371. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P14002800420056007000SE +/- 6.02, N = 3SE +/- 45.38, N = 12663145691. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P150300450600750SE +/- 0.50, N = 3SE +/- 2.52, N = 37136681. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P2K4K6K8K10KSE +/- 26.19, N = 3SE +/- 48.25, N = 3936878781. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardEPYC 7713EPYC 7713 2P30060090012001500SE +/- 5.25, N = 3SE +/- 14.15, N = 1215388771. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingEPYC 7713EPYC 7713 2P9K18K27K36K45KSE +/- 173.09, N = 3SE +/- 448.28, N = 320252.139742.01. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingEPYC 7713EPYC 7713 2P9K18K27K36K45KSE +/- 76.73, N = 3SE +/- 263.02, N = 321943.043735.81. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPEPYC 7713EPYC 7713 2P110220330440550SE +/- 1.11, N = 4SE +/- 1.13, N = 4520.84319.761. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsEPYC 7713EPYC 7713 2P510152025SE +/- 0.07, N = 4SE +/- 0.36, N = 1512.0019.441. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASEPYC 7713EPYC 7713 2P10002000300040005000SE +/- 30.99, N = 3SE +/- 49.21, N = 4389344491. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenEPYC 7713EPYC 7713 2P9001800270036004500SE +/- 38.17, N = 3SE +/- 42.95, N = 4359840931. (CXX) g++ options: -flto -pthread

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7713EPYC 7713 2P0.21030.42060.63090.84121.0515SE +/- 0.000519, N = 7SE +/- 0.003112, N = 70.9347060.640833MIN: 0.87MIN: 0.591. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUEPYC 7713EPYC 7713 2P714212835SE +/- 0.24, N = 12SE +/- 0.34, N = 412.5630.01MIN: 9.16MIN: 23.661. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread