AMD EPYC 7713 64-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
EPYC 7713 2P Processor: 2 x AMD EPYC 7713 64-Core @ 2.00GHz (128 Cores / 256 Threads), Motherboard: AMD DAYTONA_X (RYM1009B BIOS), Chipset: AMD Starship/Matisse, Memory: 512GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 22.04, Kernel: 5.15.0-47-generic (x86_64), Desktop: GNOME Shell 42.4, Display Server: X Server 1.21.1.3, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173Java Notes: OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04)Python Notes: Python 3.10.4Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
EPYC 7713 Changed Processor to AMD EPYC 7713 64-Core @ 2.00GHz (64 Cores / 128 Threads) .
Changed Memory to 256GB .
Ubuntu 22.04 Server Benchmarks Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution EPYC 7713 2P EPYC 7713 2 x AMD EPYC 7713 64-Core @ 2.00GHz (128 Cores / 256 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 512GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 5.15.0-47-generic (x86_64) GNOME Shell 42.4 X Server 1.21.1.3 1.2.204 GCC 11.2.0 ext4 1920x1080 AMD EPYC 7713 64-Core @ 2.00GHz (64 Cores / 128 Threads) 256GB OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 Java Details - OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04) Python Details - Python 3.10.4 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
EPYC 7713 2P vs. EPYC 7713 Comparison Phoronix Test Suite Baseline +99.2% +99.2% +198.4% +198.4% +297.6% +297.6% +396.8% +396.8% 396.9% 366.9% 347.8% 343.4% 342.1% 339.8% 339.3% 310% 215.8% 155.9% 152% 150.4% 138.9% 131% 124.3% 120.9% 119.7% 116.7% 116.7% 113.2% 112.8% 110% 107.7% 106.3% 100% 99.1% 92.6% 92.1% 92.1% 91.6% 91.5% 91.1% 91.1% 89.4% 88.7% 88.1% 85.6% 76.4% 75.4% 69.3% 62.9% 62% 61.1% 56.8% 53.1% 45.8% 45.1% 37.6% 31.5% 30.8% 30.5% 29.9% 28.8% 28.4% 26.4% 26% 24.4% 24.3% 23.8% 22.5% 21.9% 19.4% 19.2% 18.9% 18.8% 17.7% 17.6% 14.4% 14.3% 14.3% 13.9% 12% 9.1% 9% 9% 8.2% 8% 7.7% 7.3% 6.7% 6.6% 6.4% 6% 4.8% 4.6% 3.8% 3.8% 3.8% 3.8% 3.4% 3.4% 3.2% 3% 2.9% 2.9% 2.9% 2.7% 2.6% 2.6% 2.5% 2.5% 2.5% 2.3% 2.2% 2.2% 2.1% 2.1% 2% 50 - 1:5 50 - 1:1 V.P.M 364.5% 50 - 5:1 100 - 250 - Read Write - Average Latency 100 - 250 - Read Write 100 - 500 - Read Write - Average Latency 100 - 500 - Read Write 2048 Spaceship R.N.N.T - f32 - CPU R.N.N.T - u8s8f32 - CPU R.N.N.T - bf16bf16bf16 - CPU M.M.B.S.T - u8s8f32 - CPU R.N.N.I - bf16bf16bf16 - CPU d.S.M.S - Execution Time 125% SP.C 124.4% R.N.N.I - f32 - CPU Material Tester R.N.N.I - u8s8f32 - CPU RANGE - 500 - 100 - Average Latency PUT - 500 - 100 - Average Latency RANGE - 500 - 100 RANGE - 100 - 100 d.L.M.S - Execution Time 112.3% PUT - 500 - 100 RANGE - 100 - 100 - Average Latency PUT - 100 - 100 X.b.i.i 102.8% sedovbig 101.3% CoreMark Size 666 - I.P.S 100.7% PUT - 100 - 100 - Average Latency tConvolve MPI - Gridding 99.3% A.G.R.R.0.F.I - CPU Rand Read 98.4% RSA4096 98.3% RSA4096 98.3% CPU 98.1% SHA256 97.7% Exhaustive 97.5% tConvolve MPI - Degridding 96.2% F.D.F - CPU 96.1% conus 2.5km 96% EP.D 95.1% 14 digit 94.8% 94.2% Basic - CPU 93.6% 256 - 256 - 57 93.4% 500 1e13 92.4% FT.C 92.4% P.D.F - CPU 92.2% RANGE - 500 - 1000 P.D.F - CPU 92.1% PUT - 500 - 1000 Thorough 92% Time To Solve 91.9% PUT - 500 - 1000 - Average Latency RANGE - 500 - 1000 - Average Latency LU.C 91.2% V.D.F - CPU 91.1% PUT - 100 - 1000 - Average Latency PUT - 100 - 1000 1 - 4K - 32 - Path Tracer 90.4% 90% 3 - 4K - 32 - Path Tracer 89.5% RANGE - 100 - 1000 2 - 4K - 32 - Path Tracer 89.4% CG.C 89.3% EP.C 89.2% 88.8% RANGE - 100 - 1000 - Average Latency Classroom - CPU-Only 88.7% P.V.B.D.F - CPU 87.9% 128 - 256 - 57 87.9% M.T.E.T.D.F - CPU 87.5% Total Time 87.1% F.D.F.I - CPU 86.4% V.D.F.I - CPU 85.7% 1000 W.P.D.F.I - CPU 85.2% W.P.D.F - CPU 84.9% BT.C 84.2% 83.9% 1.H.M.2.D 79.3% D.R 78.8% Pabellon Barcelona - CPU-Only 78.7% 1 - 4K - 1 - Path Tracer 78.6% 1 - 4K - 16 - Path Tracer 78.6% 3 - 4K - 16 - Path Tracer 77.8% 3 - 4K - 1 - Path Tracer 77.5% Monero - 1M 77.4% IS.D 77.1% BMW27 - CPU-Only 77% 4096 2 - 4K - 16 - Path Tracer 76.1% MG.C 75.7% ArcFace ResNet-100 - CPU - Standard 2 - 4K - 1 - Path Tracer 75.3% Fishy Cat - CPU-Only 74.6% Barbershop - CPU-Only 74.5% Pathtracer ISPC - Crown 74.4% OpenMP LavaMD 74.1% Pathtracer - Crown 72.9% leblancbig 72.4% ATPase Simulation - 327,506 Atoms 70.2% allmodconfig 69.9% Q.1.C.E.5 Carbon Nanotube 63.4% H.C.O L.E.H RT.hdr_alb_nrm.3840x2160 61.4% A.G.R.R.0.F - CPU Medium 60.9% RT.ldr_alb_nrm.3840x2160 60.3% MPI CPU - water_GMX50_bare 60.1% Read While Writing 58.6% SP.B 58.6% F.H.R S.F.P.R 53.6% 100 - 250 - Read Only - Average Latency 53.2% Savina Reactors.IO 100 - 250 - Read Only 52.8% Sharpen 52.4% 1 - Bosphorus 4K 52.2% Ninja 51.9% Orange Juice - CPU 49% Bosphorus 4K - Very Fast 47.8% 100 - 500 - Read Only - Average Latency 47.2% Compression Rating 47.2% DLSC - CPU 46.8% 100 - 500 - Read Only 46.8% C.B.S.A - f32 - CPU 45.9% GET - 1000 super-resolution-10 - CPU - Standard Time To Compile 44.5% Enhanced 42.8% Trace Time 40.7% RAM / Memory defconfig 36.3% UASTC Level 3 34.9% 3 - D.S ALS Movie Lens 9 - D.S A.S.P Q.9.C.E.7 29.2% 1000 All fcn-resnet101-11 - CPU - Standard 28.1% Time To Compile 27.2% yolov4 - CPU - Standard 500 R.C.a.P - CPU Bosphorus 4K S.C.c.j 64 - 256 - 57 23.8% I.M.D.S Unix Makefiles 22.3% A.G.R.R.0.F - CPU 22.2% R.R.W.R Disney Material 21.4% UASTC Level 2 21% Danish Mood - CPU 21% 26 20.6% Writes HWB Color Space GPT-2 - CPU - Standard 26 18.8% 19, Long Mode - Compression Speed Pathtracer ISPC - Asian Dragon 18.2% Speed 9 Realtime - Bosphorus 4K Update Rand A.G.R.R.0.F.I - CPU 16.2% Wownero - 1M 15.1% Q.7.C.E.7 14.6% d.L.M.S - Mesh Time 14.5% Speed 5 - Bosphorus 4K 10 - Bosphorus 4K BLAS 14.3% Tradebeans C240 Buckyball 14.1% Emily Time To Compile 13.9% Eigen 13.8% Small 12.8% Speed 10 Realtime - Bosphorus 4K d.S.M.S - Mesh Time 11.9% LuxCore Benchmark - CPU 11.5% Time To Compile 11% Time To Compile 10.8% Pathtracer - Asian Dragon 9.4% scikit_qda OpenMP Leukocyte Apache Spark Bayes OpenMP CFD Solver 8.9% W.P.D.F - CPU W.P.D.F.I - CPU Default 7.8% V.D.F.I - CPU F.D.F.I - CPU bertsquad-12 - CPU - Standard M.T.E.T.D.F - CPU P.V.B.D.F - CPU Preset 10 - Bosphorus 4K Time To Compile 5.3% 7 - Bosphorus 4K 5.1% OFDM_Test V.D.F - CPU P.D.F - CPU P.P.B JPEG - 7 P.D.F - CPU Time To Compile 3.7% Bosphorus 4K - Ultra Fast 3.5% PNG - 7 OPTIONS, Stateful G.A.U.J.F 26 3.2% 10, Lossless Preset 8 - Bosphorus 4K 6 Preset 12 - Bosphorus 4K 26 2.7% 9 - Compression Speed CPU - MobileNet v2 UASTC Level 0 scikit_ica C7552 3 - Compression Speed S.C.m.j 2.3% Rotate Lion 19 - D.S 2.1% Time To Compile 6, Lossless Default 2.1% PNG - 8 Dragonflydb Dragonflydb BRL-CAD Dragonflydb PostgreSQL pgbench PostgreSQL pgbench PostgreSQL pgbench PostgreSQL pgbench MariaDB Natron oneDNN oneDNN oneDNN oneDNN oneDNN OpenFOAM NAS Parallel Benchmarks oneDNN Appleseed oneDNN etcd etcd etcd etcd OpenFOAM etcd etcd etcd Xcompact3d Incompact3d Pennant Coremark etcd ASKAP OpenVINO Facebook RocksDB OpenSSL OpenSSL Sysbench OpenSSL ASTC Encoder ASKAP OpenVINO WRF NAS Parallel Benchmarks Helsing High Performance Conjugate Gradient RELION Liquid-DSP nginx Primesieve NAS Parallel Benchmarks OpenVINO etcd OpenVINO etcd ASTC Encoder m-queens etcd etcd NAS Parallel Benchmarks OpenVINO etcd etcd OSPRay Studio Algebraic Multi-Grid Benchmark OSPRay Studio etcd OSPRay Studio NAS Parallel Benchmarks NAS Parallel Benchmarks LULESH etcd Blender Kripke OpenVINO Liquid-DSP OpenVINO Stockfish OpenVINO OpenVINO nginx OpenVINO OpenVINO NAS Parallel Benchmarks ebizzy asmFish 7-Zip Compression Blender OSPRay Studio OSPRay Studio OSPRay Studio OSPRay Studio Xmrig NAS Parallel Benchmarks Blender MariaDB OSPRay Studio NAS Parallel Benchmarks ONNX Runtime OSPRay Studio Blender Blender Embree Rodinia Embree Pennant NAMD Timed Linux Kernel Compilation WebP2 Image Encode GPAW ASKAP CloverLeaf Intel Open Image Denoise OpenVINO ASTC Encoder Intel Open Image Denoise GROMACS Facebook RocksDB NAS Parallel Benchmarks Renaissance ACES DGEMM PostgreSQL pgbench Renaissance PostgreSQL pgbench GraphicsMagick SVT-HEVC Timed LLVM Compilation LuxCoreRender Kvazaar PostgreSQL pgbench 7-Zip Compression LuxCoreRender PostgreSQL pgbench oneDNN Redis ONNX Runtime Timed Node.js Compilation GraphicsMagick POV-Ray Sysbench Timed Linux Kernel Compilation Basis Universal LZ4 Compression Renaissance LZ4 Compression Renaissance WebP2 Image Encode Apache HTTP Server JPEG XL Decoding libjxl ONNX Runtime Timed FFmpeg Compilation ONNX Runtime Apache HTTP Server LuxCoreRender x265 SPECjbb 2015 Liquid-DSP Renaissance Timed LLVM Compilation OpenVINO Facebook RocksDB Appleseed Basis Universal LuxCoreRender Graph500 Apache Cassandra GraphicsMagick ONNX Runtime Graph500 Zstd Compression Embree AOM AV1 Facebook RocksDB OpenVINO Xmrig WebP2 Image Encode OpenFOAM VP9 libvpx Encoding SVT-HEVC LeelaChessZero DaCapo Benchmark NWChem Appleseed Timed Mesa Compilation LeelaChessZero miniFE AOM AV1 OpenFOAM LuxCoreRender Timed Gem5 Compilation Build2 Embree Mlpack Benchmark Rodinia Renaissance Rodinia OpenVINO OpenVINO WebP2 Image Encode OpenVINO OpenVINO ONNX Runtime OpenVINO OpenVINO SVT-AV1 Timed Godot Game Engine Compilation SVT-HEVC srsRAN OpenVINO OpenVINO LibRaw JPEG XL libjxl OpenVINO Timed PHP Compilation Kvazaar JPEG XL libjxl PJSIP Renaissance Graph500 libavif avifenc SVT-AV1 libavif avifenc SVT-AV1 Node.js Express HTTP Load Test Graph500 LZ4 Compression TNN Basis Universal Mlpack Benchmark Ngspice LZ4 Compression SPECjbb 2015 GraphicsMagick Google Draco Zstd Compression Timed Apache Compilation libavif avifenc Timed CPython Compilation JPEG XL libjxl EPYC 7713 2P EPYC 7713
Ubuntu 22.04 Server Benchmarks wrf: conus 2.5km openfoam: drivaerFastback, Large Mesh Size - Execution Time openfoam: drivaerFastback, Large Mesh Size - Mesh Time spec-jbb2015: SPECjbb2015-Composite critical-jOPS spec-jbb2015: SPECjbb2015-Composite max-jOPS mysqlslap: 4096 mysqlslap: 2048 nwchem: C240 Buckyball renaissance: ALS Movie Lens relion: Basic - CPU brl-cad: VGR Performance Metric incompact3d: X3D-benchmarking input.i3d rodinia: OpenMP HotSpot3D stockfish: Total Time lczero: Eigen lczero: BLAS renaissance: Savina Reactors.IO qe: AUSURF112 onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU pgbench: 100 - 250 - Read Write - Average Latency pgbench: 100 - 250 - Read Write asmfish: 1024 Hash Memory, 26 Depth pgbench: 100 - 500 - Read Write - Average Latency pgbench: 100 - 500 - Read Write renaissance: Apache Spark PageRank graph500: 26 graph500: 26 graph500: 26 graph500: 26 onnx: ArcFace ResNet-100 - CPU - Standard onnx: fcn-resnet101-11 - CPU - Standard onnx: yolov4 - CPU - Standard onnx: super-resolution-10 - CPU - Standard jpegxl: PNG - 8 securemark: SecureMark-TLS hpcg: luaradio: Complex Phase luaradio: Hilbert Transform luaradio: FM Deemphasis Filter luaradio: Five Back to Back FIR Filters blender: Barbershop - CPU-Only onednn: Recurrent Neural Network Inference - u8s8f32 - CPU mlpack: scikit_qda build-linux-kernel: allmodconfig tnn: CPU - DenseNet openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Small Mesh Size - Mesh Time build-llvm: Unix Makefiles numpy: luxcorerender: Danish Mood - CPU luxcorerender: LuxCore Benchmark - CPU compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed openssl: SHA256 build-gem5: Time To Compile appleseed: Material Tester webp2: Quality 95, Compression Effort 7 ospray-studio: 3 - 4K - 32 - Path Tracer build-nodejs: Time To Compile pgbench: 100 - 500 - Read Only - Average Latency pgbench: 100 - 500 - Read Only pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 250 - Read Only ospray-studio: 3 - 4K - 16 - Path Tracer ngspice: C2670 build-llvm: Ninja vpxenc: Speed 5 - Bosphorus 4K luxcorerender: Orange Juice - CPU cassandra: Writes clickhouse: 100M Rows Web Analytics Dataset, Third Run clickhouse: 100M Rows Web Analytics Dataset, Second Run clickhouse: 100M Rows Web Analytics Dataset, First Run / Cold Cache ospray-studio: 3 - 4K - 1 - Path Tracer ospray-studio: 2 - 4K - 32 - Path Tracer onnx: GPT-2 - CPU - Standard onnx: bertsquad-12 - CPU - Standard ospray-studio: 1 - 4K - 32 - Path Tracer ospray-studio: 2 - 4K - 1 - Path Tracer ospray-studio: 2 - 4K - 16 - Path Tracer ospray-studio: 1 - 4K - 1 - Path Tracer ospray-studio: 1 - 4K - 16 - Path Tracer renaissance: Genetic Algorithm Using Jenetics + Futures npb: EP.D renaissance: In-Memory Database Shootout renaissance: Finagle HTTP Requests ngspice: C7552 etcd: RANGE - 100 - 100 - Average Latency etcd: RANGE - 100 - 100 onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU jpegxl: PNG - 7 appleseed: Emily openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU simdjson: DistinctUserID simdjson: TopTweet apache: 1000 helsing: 14 digit simdjson: PartialTweets apache: 500 nginx: 500 nginx: 1000 sysbench: CPU build-python: Released Build, PGO + LTO Optimized etcd: PUT - 100 - 100 - Average Latency etcd: PUT - 100 - 100 onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU etcd: RANGE - 500 - 100 - Average Latency etcd: RANGE - 500 - 100 etcd: PUT - 500 - 100 - Average Latency etcd: PUT - 500 - 100 etcd: PUT - 100 - 1000 - Average Latency etcd: PUT - 100 - 1000 etcd: RANGE - 100 - 1000 - Average Latency etcd: RANGE - 100 - 1000 ebizzy: node-web-tooling: onednn: Recurrent Neural Network Inference - f32 - CPU etcd: PUT - 500 - 1000 - Average Latency etcd: PUT - 500 - 1000 etcd: RANGE - 500 - 1000 - Average Latency etcd: RANGE - 500 - 1000 blender: Pabellon Barcelona - CPU-Only openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU dragonflydb: 50 - 1:5 dragonflydb: 50 - 1:1 dragonflydb: 50 - 5:1 openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU pyperformance: python_startup pjsip: INVITE pjsip: OPTIONS, Stateful openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU luxcorerender: DLSC - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU rocksdb: Update Rand rocksdb: Read Rand Write Rand rocksdb: Read While Writing graphics-magick: Sharpen openssl: RSA4096 openssl: RSA4096 graphics-magick: Enhanced rocksdb: Rand Read graphics-magick: Rotate graphics-magick: HWB Color Space etcpak: Single-Threaded - ETC2 coremark: CoreMark Size 666 - Iterations Per Second blender: Classroom - CPU-Only gpaw: Carbon Nanotube cloverleaf: Lagrangian-Eulerian Hydrodynamics simdjson: Kostya build2: Time To Compile aom-av1: Speed 10 Realtime - Bosphorus 4K natron: Spaceship rodinia: OpenMP Leukocyte svt-hevc: 1 - Bosphorus 4K build-wasmer: Time To Compile simdjson: LargeRand srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM minife: Small luxcorerender: Rainbow Colors and Prism - CPU mlpack: scikit_ica askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding webp2: Quality 75, Compression Effort 7 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed build-linux-kernel: defconfig compress-7zip: Decompression Rating compress-7zip: Compression Rating kripke: jpegxl-decode: 1 webp: Quality 100, Lossless, Highest Compression build-godot: Time To Compile primesieve: 1e13 srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM avifenc: 2 pyperformance: chaos gromacs: MPI CPU - water_GMX50_bare build-php: Time To Compile redis: GET - 1000 amg: appleseed: Disney Material kvazaar: Bosphorus 4K - Ultra Fast rodinia: OpenMP LavaMD srsran: OFDM_Test srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM embree: Pathtracer - Asian Dragon embree: Pathtracer ISPC - Asian Dragon onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU pyperformance: regex_compile pjsip: OPTIONS, Stateless libraw: Post-Processing Benchmark blender: Fishy Cat - CPU-Only npb: IS.D renaissance: Apache Spark Bayes aircrack-ng: encode-flac: WAV To FLAC x265: Bosphorus 4K synthmark: VoiceMark_100 quantlib: xmrig: Monero - 1M namd: ATPase Simulation - 327,506 Atoms pennant: leblancbig phpbench: PHP Benchmark Suite jpegxl: JPEG - 7 xmrig: Wownero - 1M mlpack: scikit_svm dacapobench: Tradebeans blender: BMW27 - CPU-Only lulesh: pyperformance: pickle_pure_python tnn: CPU - MobileNet v2 cython-bench: N-Queens npb: SP.C pybench: Total For Average Test Times node-express-loadtest: srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM oidn: RT.hdr_alb_nrm.3840x2160 npb: BT.C svt-hevc: 10 - Bosphorus 4K oidn: RT.ldr_alb_nrm.3840x2160 build-apache: Time To Compile askap: Hogbom Clean OpenMP liquid-dsp: 256 - 256 - 57 kvazaar: Bosphorus 4K - Very Fast liquid-dsp: 128 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 32 - 256 - 57 npb: SP.B srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM build-mesa: Time To Compile tnn: CPU - SqueezeNet v1.1 build-ffmpeg: Time To Compile jpegxl-decode: All basis: UASTC Level 3 sysbench: RAM / Memory npb: LU.C svt-av1: Preset 12 - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K astcenc: Exhaustive webp: Quality 100, Lossless embree: Pathtracer ISPC - Crown jpegxl: JPEG - 8 povray: Trace Time mt-dgemm: Sustained Floating-Point Rate basis: UASTC Level 2 embree: Pathtracer - Crown astcenc: Thorough encode-mp3: WAV To MP3 draco: Church Facade npb: CG.C onednn: Convolution Batch Shapes Auto - f32 - CPU pennant: sedovbig m-queens: Time To Solve avifenc: 6, Lossless draco: Lion webp: Quality 100, Highest Compression rodinia: OpenMP CFD Solver svt-av1: Preset 10 - Bosphorus 4K npb: FT.C basis: UASTC Level 0 webp2: Default octave-benchmark: dacapobench: Jython svt-hevc: 7 - Bosphorus 4K avifenc: 10, Lossless astcenc: Medium tnn: CPU - SqueezeNet v2 npb: EP.C etcpak: Multi-Threaded - ETC2 avifenc: 6 npb: MG.C webp2: Quality 100, Compression Effort 5 webp: Quality 100 webp: Default build-python: Default ctx-clock: Context Switch Time blake2: EPYC 7713 2P EPYC 7713 8650.839 7052.62 776.38 68946 130899 140 150 2183.6 24613.1 290.961 3240515 300.691142 89.280 279486942 4093 4449 12254.2 399.92 2840.05 15.076 16633 248870519 35.094 14261 3985.7 390377000 302338000 659467000 642516000 877 237 330 4569 1.00 249582 37.1011 623.8 98.4 364.3 1211.6 171.52 2754.45 31.84 160.463 3065.920 281.7 124.96 178.675 469.38 7.50 7.63 10743.2 54.18 134629156527 159.015 336.583119 0.31 52785 113.466 0.248 2012722 0.126 1983155 26510 137.362 105.032 13.86 18.60 209921 396.82 387.01 377.52 1646 45460 7878 668 44624 1424 22742 1379 22158 2314.9 9109.24 6136.4 10535.4 105.951 2.7 37230.3273 7473.91 10.74 151.531071 3597.47 17.59 4.42 4.39 84767.73 62.129 3.84 91255.79 90312.00 94018.12 500125.79 261.337 2.6 38209.4202 7531.12 7435.34 2.6 38506.6051 2.6 38992.8158 23.7 41860.4825 23.4 42305.6183 453258 10.50 2840.33 22.8 43696.7618 22.6 44040.7135 53.88 4777.15 13.10 10851.4 52.51 4774.37 13.09 724442.87 724466.22 724635.93 1388.41 45.84 7.71 4692 8819 292.82 218.13 22.16 2884.44 12.23 20.38 3136.14 35.75 1788.34 2.11 57763.89 28.05 4559.42 3.27 37561.07 33.55 1905.51 304654 2922167 14564673 779 1638050.5 25009.6 1344 480175332 730 1093 229.637 4105747.139772 40.89 43.857 19.44 2.9 53.232 57.04 1.9 47.284 15.08 51.869 1 143.6 392.3 24664.8 16.99 46.40 43735.8 39742.0 0.55 3473.0 84.7 3485.8 39.9 21.584 634086 516617 143950267 66.93 0.56 41.167 28.718 141.0 426.1 41.213 95.8 8.215 38.338 1364479.13 1923306667 50.464258 58.52 26.740 130333333 133.6 394.1 68.8759 67.1736 30.0116 155 67617 37.96 22.25 4690.06 638.1 149778.797 18.133 21.48 748.057 2808.8 50749.6 0.26712 3.556570 726003 100.77 41982.9 21.45 4568 17.19 36456.229 385 341.281 23.312 116527.63 953 6198 62.2 127.9 2.26 235808.54 195.62 2.26 20.521 319.756 5350900000 54.24 5085066667 3191533333 1614266667 142675.55 149.6 427.4 18.100 273.848 14.271 569.97 11.402 7260.66 259899.49 179.550 55.62 68.797 6.4499 1.42 82.9263 28.92 7.590 32.071227 8.773 89.6006 58.4744 7.490 7525 45587.10 0.640833 5.675895 6.376 7.334 5735 3.47 6.283 123.806 116679.26 6.379 9.70 6.545 4106 144.70 5.558 380.6467 65.894 8338.27 6678.565 3.957 100740.89 6.41 10.65 16.75 15.39 120 3.39 16959.848 14972.97 888.82 85389 127957 247 615 2491.1 18814.1 563.271 697698 609.677470 88.019 149383689 3598 3893 8004.9 397.64 1229.56 3.400 73527 138776102 7.980 62654 3067.3 323593000 254477000 642278000 622681000 1538 185 417 6631 1.02 247747 19.1088 628.9 98.6 365.4 1189.0 299.30 1253.50 29.19 272.609 3107.102 633.93 139.89 218.445 464.56 6.20 6.84 14123.3 55.43 68101809853 176.430 152.394204 0.24 100051 164.001 0.365 1370935 0.193 1297684 47134 136.020 159.575 15.86 12.48 250596 394.98 392.92 378.44 2922 86081 9368 713 84968 2496 40058 2463 39568 2243.2 4668.96 5009.2 6720.8 103.387 1.3 79219.6509 2984.47 11.11 133.04347 3555.78 8.97 4.40 4.39 109150.30 120.999 3.86 115006.99 173899.19 174524.79 252440.57 261.365 1.3 78829.5888 2943.08 2950.43 1.2 82098.4178 1.2 81895.9623 12.4 79978.1905 12.4 80131.7958 246494 10.60 1266.20 11.9 83927.6558 11.8 84602.7803 96.30 4601.78 6.82 14156.7 53.89 4600.94 6.81 3599739.57 3382696.79 3244708.36 1293.51 24.59 7.64 4690 9115 274.57 116.33 20.82 1535.08 8.33 18.93 1689.18 34.17 935.58 1.06 49691.03 25.97 2462.40 2.03 30742.17 31.02 1030.76 358336 3561841 9182167 511 825904.1 12613.1 941 242074798 746 1303 230.432 2045229.645896 77.14 71.658 12.00 2.9 58.956 63.90 6.0 43.392 9.91 51.965 1 143.6 396.0 21875.1 21.13 45.27 21943.0 20252.1 0.48 3401.6 84.9 3460.3 47.4 29.409 354608 351077 270722527 67.62 0.56 43.354 55.255 141.4 431.8 40.766 96.5 5.130 39.763 1990067.33 1012051667 61.265637 56.53 46.547 136633333 133.7 394.4 62.9814 56.8098 12.5608 156 66602 39.40 38.85 2648.96 585.6 149666.943 18.129 26.69 747.308 2795.3 28612.4 0.45457 6.133060 733332 104.58 36467.2 21.33 3998 30.43 19305.656 387 332.611 23.384 51921.71 959 6367 62.4 128.9 1.40 127998.61 223.66 1.41 20.100 520.840 2767166667 36.71 2706900000 2577666667 1605566667 89950.76 149.6 430.5 20.610 273.481 18.146 731.88 15.385 9990.12 135917.43 184.711 65.49 70.783 3.2659 1.40 47.5583 29.24 10.680 20.880271 10.615 51.8334 30.4554 7.472 7411 24085.73 0.934706 11.42723 12.238 7.184 5613 3.48 6.845 131.270 60659.25 6.221 9.00 6.582 4033 137.74 5.398 236.6020 65.617 4406.02 6702.518 3.846 57329.70 10.85 10.69 16.78 15.708 120 3.39 OpenBenchmarking.org
WRF WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 9 Input: drivaerFastback, Large Mesh Size - Mesh Time EPYC 7713 2P EPYC 7713 200 400 600 800 1000 776.38 888.82 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
SPECjbb 2015 This is a benchmark of SPECjbb 2015. For this test profile to work, you must have a valid license/copy of the SPECjbb 2015 ISO (SPECjbb2015-1.02.iso) in your Phoronix Test Suite download cache. Learn more via the OpenBenchmarking.org test page.
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
RELION RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
Quantum ESPRESSO Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write EPYC 7713 EPYC 7713 2P 16K 32K 48K 64K 80K SE +/- 106.66, N = 3 SE +/- 294.20, N = 12 73527 16633 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 500 - Mode: Read Write EPYC 7713 EPYC 7713 2P 13K 26K 39K 52K 65K SE +/- 172.39, N = 3 SE +/- 133.03, N = 12 62654 14261 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 EPYC 7713 2P EPYC 7713 60M 120M 180M 240M 300M 302338000 254477000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 EPYC 7713 2P EPYC 7713 140M 280M 420M 560M 700M 659467000 642278000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 EPYC 7713 2P EPYC 7713 140M 280M 420M 560M 700M 642516000 622681000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
JPEG XL libjxl The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.
SecureMark SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
LuaRadio LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better LuaRadio 0.9.1 Test: Five Back to Back FIR Filters EPYC 7713 2P EPYC 7713 300 600 900 1200 1500 SE +/- 4.33, N = 3 SE +/- 13.28, N = 3 1211.6 1189.0
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 9 Input: drivaerFastback, Small Mesh Size - Mesh Time EPYC 7713 2P EPYC 7713 30 60 90 120 150 124.96 139.89 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.