Benchmarks for a future article. AMD EPYC 8534PN 64-Core testing with a AMD Cinnabar (RCB1009C BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.
Zen 1 - EPYC 7601 Processor: AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads), Motherboard: TYAN B8026T70AE24HR (V1.02.B10 BIOS), Chipset: AMD 17h, Memory: 128GB, Disk: 280GB INTEL SSDPE21D280GA + 1000GB INTEL SSDPE2KX010T8, Graphics: llvmpipe, Monitor: VE228, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.6.9-060609-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 256 bits), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x800126eJava Notes: OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Zen 4C - EPYC 8534PN Processor: AMD EPYC 8534PN 64-Core @ 2.00GHz (64 Cores / 128 Threads) , Motherboard: AMD Cinnabar (RCB1009C BIOS) , Chipset: AMD Device 14a4 , Memory: 6 x 32 GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG , Disk: 1000GB INTEL SSDPE2KX010T8 , Graphics: ASPEED , Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.6.9-060609-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 256 bits), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212Java Notes: OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC Zen Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads) TYAN B8026T70AE24HR (V1.02.B10 BIOS) AMD 17h 128GB 280GB INTEL SSDPE21D280GA + 1000GB INTEL SSDPE2KX010T8 llvmpipe VE228 2 x Broadcom NetXtreme BCM5720 PCIe Ubuntu 23.10 6.6.9-060609-generic (x86_64) GNOME Shell 45.0 X Server 1.21.1.7 4.5 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 256 bits) GCC 13.2.0 ext4 1920x1080 AMD EPYC 8534PN 64-Core @ 2.00GHz (64 Cores / 128 Threads) AMD Cinnabar (RCB1009C BIOS) AMD Device 14a4 6 x 32 GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG 1000GB INTEL SSDPE2KX010T8 ASPEED 1920x1200 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Zen 1 - EPYC 7601: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x800126e - Zen 4C - EPYC 8534PN: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212 Java Details - OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10) Python Details - Python 3.11.6 Security Details - Zen 1 - EPYC 7601: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - Zen 4C - EPYC 8534PN: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Zen 1 - EPYC 7601 vs. Zen 4C - EPYC 8534PN Comparison Phoronix Test Suite Baseline +588.7% +588.7% +1177.4% +1177.4% +1766.1% +1766.1% 881.3% 726.8% 726.8% 720.4% 720.4% 649.8% 607.4% 589% 588.6% 581.4% 570.9% 553.6% 550.2% 521.7% 505.5% 473.6% 459% 455.2% 452.1% 440.9% 428.6% 412.3% 411.9% 396.6% 373.5% 357.7% 356.8% 331.9% 326.9% 314.7% 308.9% 305% 303.3% 283.3% 281.1% 280.8% 280.8% 269.8% 256.9% 255.5% 253.5% 251.4% 250.6% 243.6% 241.8% 241.6% 239.6% 235% 231.5% 226.5% 225.3% 223.3% 221.1% 219.2% 215.9% 213.6% 212.6% 207.8% 202.5% 202.1% 201.9% 2354.6% 198.7% 197.5% 195.1% 194.1% 190.1% 189.1% 188.9% 186.1% 185.8% 185.6% 183.9% 179.9% 177.6% 175.9% 169.7% 165.7% 164.6% 164.2% 160.6% 156% 154.8% 153.9% 150.2% 150.1% 147.8% 145.4% 141.2% 136.1% 134.7% 131.3% 127.6% 125.5% 123.5% 123.3% 123.3% 123.1% 120.3% 118% 110.7% 105.7% 105.2% 102.8% 1300.8% 1298% 1128.6% 907.5% 97.4% 95.4% 90.8% 89.7% 89.3% 87.5% 82.9% 79.5% 79.2% 78.7% 78.1% 78% 78% 77.6% 77.2% 77.2% 76.9% 75.9% 74.6% 73.9% 73.9% 73.5% 72.4% 70.7% 70.4% 69.8% 68.9% 68.8% 67.7% 67.5% 66.3% 65.7% 65.5% 64.5% 64.1% 63.7% 63.3% 62.8% 61.8% 61.1% 61% 60.7% 59.2% 59.1% 58.4% 57% 56.4% 53.7% 51.5% 47.8% 47.5% 47.5% 45.5% 45.2% 45.1% 43.6% 42.8% 38.8% 38.3% 37.7% 32.8% 32% 30.1% 28.2% 27.7% 25% 24.8% 24.2% 23.9% 17.8% 7.7% V.D.F.I - CPU OpenMP - BM2 OpenMP - BM2 OpenMP - BM1 OpenMP - BM1 M.T.E.T.D.F - CPU W.P.D.F.I - CPU ChaCha20-Poly1305 C.S.9.P.Y.P - A.M.S gravity_spheres_volume/dim_512/scivis/real_time gravity_spheres_volume/dim_512/ao/real_time R.S.A.F.I - CPU H.E.R.F.I - CPU ChaCha20 S.F.P.R A.G.R.R.0.F.I - CPU RSA4096 N.T.C.B.b.u.S.S.I - A.M.S P.V.B.D.F - CPU AES-128-GCM B.L.N.Q.A.S.I - A.M.S AES-256-GCM CPU - 16 - ResNet-50 P.D.F - CPU gravity_spheres_volume/dim_512/pathtracer/real_time 3 - 4K - 1 - Path Tracer - CPU 1 - 4K - 1 - Path Tracer - CPU 3 - 4K - 32 - Path Tracer - CPU 1 - 4K - 32 - Path Tracer - CPU 3 - 4K - 16 - Path Tracer - CPU GhostRider - 1M A.G.R.R.0.F.I - CPU 1 - 4K - 16 - Path Tracer - CPU RT.ldr_alb_nrm.3840x2160 - CPU-Only Pathtracer ISPC - Asian Dragon Wownero - 1M e.G.B.S - 1200 Pathtracer ISPC - Crown Rand Read N.T.C.D.m - A.M.S B.L.N.Q.A - A.M.S F.D.F.I - CPU Rand Read C.S.9.P.Y.P - A.M.S N.T.C.B.b.u.c - A.M.S N.D.C.o.b.u.o.I - A.M.S R.5.B - A.M.S C.C.R.5.I - A.M.S 1B particle_volume/scivis/real_time H.E.R.F.I - CPU particle_volume/ao/real_time e.G.B.S - 2400 Barbershop - CPU-Only SHA512 C.D.Y.C.S.I - A.M.S C.D.Y.C - A.M.S 500M RSA4096 D.R CPU R.5.S.I - A.M.S KawPow - 1M C.F.U - 1M SHA256 Monero - 1M CryptoNight-Heavy - 1M Preset 13 - Bosphorus 4K Compression Rating Fishy Cat - CPU-Only CPU - Supercar Pabellon Barcelona - CPU-Only Classroom - CPU-Only CPU - Bedroom N.T.C.B.b.u.S.S.I - A.M.S Preset 12 - Bosphorus 4K BMW27 - CPU-Only Multi-Threaded 1000 B.L.N.Q.A.S.I - A.M.S Redis - 100 - 1:10 Preset 8 - Bosphorus 4K 500 Redis - 100 - 1:5 Ninja ATPase Simulation - 327,506 Atoms V.D.F.I - CPU allmodconfig Time To Compile Bosphorus 4K - Very Fast Bosphorus 4K - Super Fast d.S.M.S - Execution Time Bosphorus 4K - Medium Carbon Nanotube 20k Atoms libx265 - Platform libx265 - Video On Demand libx265 - Upload Time To Compile Preset 4 - Bosphorus 4K Read While Writing Bosphorus 4K - Ultra Fast Read While Writing Unix Makefiles F.D.F.I - CPU W.P.D.F.I - CPU R.5.S.I - A.M.S F.D.R.F.I - CPU libx265 - Live clover_bm16 Bosphorus 4K - Fast Bosphorus 4K clover_bm64_short M.T.E.T.D.F - CPU defconfig R.R.W.R i.i.1.C.P.D Time To Compile Bosphorus 4K - Faster N.T.C.D.m - A.M.S B.L.N.Q.A - A.M.S 500 - 100 - 800 - 100 500 - 100 - 800 - 100 10 Update Rand R.R.W.R 800 - 100 - 500 - 100 500 - 100 - 800 - 400 800 - 100 - 500 - 100 d.M.M.S - Execution Time CPU - 256 - ResNet-50 N.D.C.o.b.u.o.I - A.M.S N.T.C.B.b.u.c - A.M.S R.5.B - A.M.S CPU - 1 - ResNet-50 particle_volume/pathtracer/real_time Chrysler Neon 1M C.C.R.5.I - A.M.S Update Rand 500 - 100 - 800 - 400 6 Writes CORAL2 P1 R.S.A.F.I - CPU 500 - 100 - 500 - 400 800 - 100 - 500 - 400 800 - 100 - 500 - 400 500 - 100 - 500 - 100 500 - 100 - 500 - 100 5 IMDB P.P.A C.D.Y.C.S.I - A.M.S C.D.Y.C - A.M.S 500 - 100 - 500 - 400 1 1000 800 - 100 - 200 - 100 800 - 100 - 200 - 400 X.b.i.i d.M.M.S - Mesh Time 800 - 100 - 200 - 100 d.S.M.S - Mesh Time CTS2 P.V.B.D.F - CPU 800 - 100 - 200 - 400 TPC-H Parquet 500 - 100 - 200 - 100 500 - 100 - 200 - 400 500 - 100 - 200 - 100 F.D.R.F.I - CPU 800 - 100 - 800 - 100 800 - 100 - 800 - 100 500 - 100 - 200 - 400 P.D.F - CPU 800 - 100 - 800 - 400 800 - 100 - 800 - 400 CORAL2 P2 OpenVINO miniBUDE miniBUDE miniBUDE miniBUDE OpenVINO OpenVINO OpenSSL Neural Magic DeepSparse OSPRay OSPRay OpenVINO OpenVINO OpenSSL ACES DGEMM OpenVINO OpenSSL Neural Magic DeepSparse OpenVINO OpenSSL Neural Magic DeepSparse OpenSSL TensorFlow OpenVINO OSPRay OSPRay Studio OSPRay Studio OSPRay Studio OSPRay Studio OSPRay Studio Xmrig OpenVINO OSPRay Studio Intel Open Image Denoise Embree Xmrig easyWave Embree RocksDB Neural Magic DeepSparse Neural Magic DeepSparse OpenVINO Speedb Neural Magic DeepSparse Neural Magic DeepSparse Neural Magic DeepSparse Neural Magic DeepSparse Neural Magic DeepSparse Y-Cruncher OSPRay OpenVINO OSPRay easyWave Blender OpenSSL Neural Magic DeepSparse Neural Magic DeepSparse Y-Cruncher OpenSSL 7-Zip Compression Chaos Group V-RAY Neural Magic DeepSparse Xmrig Xmrig OpenSSL Xmrig Xmrig SVT-AV1 7-Zip Compression Blender IndigoBench Blender Blender IndigoBench Neural Magic DeepSparse SVT-AV1 Blender QuantLib nginx Neural Magic DeepSparse Redis 7.0.12 + memtier_benchmark SVT-AV1 nginx Redis 7.0.12 + memtier_benchmark Timed LLVM Compilation NAMD OpenVINO Timed Linux Kernel Compilation Timed Node.js Compilation uvg266 uvg266 OpenFOAM uvg266 GPAW LAMMPS Molecular Dynamics Simulator FFmpeg FFmpeg FFmpeg Timed FFmpeg Compilation SVT-AV1 Speedb uvg266 RocksDB Timed LLVM Compilation OpenVINO OpenVINO Neural Magic DeepSparse OpenVINO FFmpeg CloverLeaf VVenC x265 CloverLeaf OpenVINO Timed Linux Kernel Compilation Speedb Xcompact3d Incompact3d Timed Gem5 Compilation VVenC Neural Magic DeepSparse Neural Magic DeepSparse Apache IoTDB Apache IoTDB rav1e Speedb RocksDB Apache IoTDB Apache IoTDB Apache IoTDB OpenFOAM PyTorch Neural Magic DeepSparse Neural Magic DeepSparse Neural Magic DeepSparse PyTorch OSPRay OpenRadioss Neural Magic DeepSparse RocksDB Apache IoTDB rav1e Apache Cassandra Quicksilver OpenVINO Apache IoTDB Apache IoTDB Kripke Apache IoTDB Apache IoTDB Apache IoTDB rav1e DuckDB Timed MrBayes Analysis Neural Magic DeepSparse Neural Magic DeepSparse Apache IoTDB rav1e Apache HTTP Server Apache IoTDB Apache IoTDB Xcompact3d Incompact3d OpenFOAM Apache IoTDB OpenFOAM Quicksilver OpenVINO Apache IoTDB DuckDB Apache IoTDB Apache IoTDB Apache IoTDB OpenVINO Apache IoTDB Apache IoTDB Apache IoTDB OpenVINO Apache IoTDB Apache IoTDB Quicksilver Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN
AMD EPYC Zen nginx: 500 nginx: 1000 openssl: RSA4096 openssl: RSA4096 openssl: SHA256 openssl: SHA512 openssl: AES-128-GCM openssl: AES-256-GCM openssl: ChaCha20 openssl: ChaCha20-Poly1305 apache: 1000 apache-iotdb: 500 - 100 - 200 - 100 apache-iotdb: 500 - 100 - 200 - 100 apache-iotdb: 500 - 100 - 200 - 400 apache-iotdb: 500 - 100 - 200 - 400 apache-iotdb: 500 - 100 - 500 - 100 apache-iotdb: 500 - 100 - 500 - 100 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 500 - 400 apache-iotdb: 500 - 100 - 800 - 100 apache-iotdb: 500 - 100 - 800 - 100 apache-iotdb: 500 - 100 - 800 - 400 apache-iotdb: 500 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 200 - 100 apache-iotdb: 800 - 100 - 200 - 100 apache-iotdb: 800 - 100 - 200 - 400 apache-iotdb: 800 - 100 - 200 - 400 apache-iotdb: 800 - 100 - 500 - 100 apache-iotdb: 800 - 100 - 500 - 100 apache-iotdb: 800 - 100 - 500 - 400 apache-iotdb: 800 - 100 - 500 - 400 apache-iotdb: 800 - 100 - 800 - 100 apache-iotdb: 800 - 100 - 800 - 100 apache-iotdb: 800 - 100 - 800 - 400 apache-iotdb: 800 - 100 - 800 - 400 duckdb: IMDB duckdb: TPC-H Parquet cassandra: Writes speedb: Rand Read speedb: Read While Writing speedb: Read Rand Write Rand speedb: Update Rand rocksdb: Rand Read rocksdb: Read While Writing rocksdb: Read Rand Write Rand rocksdb: Update Rand memtier-benchmark: Redis - 100 - 1:10 memtier-benchmark: Redis - 100 - 1:5 compress-7zip: Compression Rating compress-7zip: Decompression Rating build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-llvm: Ninja build-llvm: Unix Makefiles build-nodejs: Time To Compile build-gem5: Time To Compile build-ffmpeg: Time To Compile quantlib: Multi-Threaded specfem3d: Layered Halfspace specfem3d: Water-layered Halfspace specfem3d: Homogeneous Halfspace specfem3d: Mount St. Helens specfem3d: Tomographic Model openfoam: drivaerFastback, Small Mesh Size - Mesh Time openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time openfoam: drivaerFastback, Medium Mesh Size - Execution Time openradioss: Chrysler Neon 1M minibude: OpenMP - BM1 minibude: OpenMP - BM1 minibude: OpenMP - BM2 minibude: OpenMP - BM2 easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 gpaw: Carbon Nanotube lammps: 20k Atoms cloverleaf: clover_bm64_short cloverleaf: clover_bm16 kripke: incompact3d: input.i3d 193 Cells Per Direction incompact3d: X3D-benchmarking input.i3d namd: ATPase Simulation - 327,506 Atoms gromacs: MPI CPU - water_GMX50_bare quicksilver: CORAL2 P1 quicksilver: CORAL2 P2 quicksilver: CTS2 y-cruncher: 500M y-cruncher: 1B mt-dgemm: Sustained Floating-Point Rate mrbayes: Primate Phylogeny Analysis blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Pabellon Barcelona - CPU-Only blender: Barbershop - CPU-Only embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Crown oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 16 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU ospray-studio: 3 - 4K - 32 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time v-ray: CPU indigobench: CPU - Supercar indigobench: CPU - Bedroom ffmpeg: libx265 - Live ffmpeg: libx265 - Upload ffmpeg: libx265 - Platform ffmpeg: libx265 - Video On Demand svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 4 - Bosphorus 4K rav1e: 10 rav1e: 6 rav1e: 5 rav1e: 1 x265: Bosphorus 4K uvg266: Bosphorus 4K - Medium uvg266: Bosphorus 4K - Very Fast uvg266: Bosphorus 4K - Super Fast uvg266: Bosphorus 4K - Ultra Fast vvenc: Bosphorus 4K - Fast vvenc: Bosphorus 4K - Faster openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream tensorflow: CPU - 16 - ResNet-50 pytorch: CPU - 1 - ResNet-50 pytorch: CPU - 256 - ResNet-50 xmrig: Monero - 1M xmrig: Wownero - 1M xmrig: CryptoNight-Femto UPX2 - 1M xmrig: CryptoNight-Heavy - 1M xmrig: KawPow - 1M xmrig: GhostRider - 1M Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 103269.83 100664.35 4510.7 293924.1 27056830400 8316184170 97645009487 89948428697 47920025967 30315490690 88038.28 38712263 46.29 38176456 166.00 53759940 85.07 52849125 294.75 58774090 127.26 59444334 421.30 46049589 39.40 45921074 147.89 56993811 82.07 57037441 318.24 58403756 129.77 60237625 488.64 199.014 245.742 152887 86867041 6145679 1436074 201555 84798204 3371876 1509184 290618 1183656.97 1142464.16 123934 134417 76.300 755.017 435.054 529.717 387.652 388.023 39.789 64481.7 78.679904384 74.950625896 38.063179250 29.951886203 30.283593040 44.44531 126.1908 231.74446 1179.2541 499.30 356.122 14.245 355.501 14.220 148.077 352.687 142.165 13.872 108.60 971.50 187375933 37.4064191 818.054118 0.97959 1.992 12996667 15013333 11426667 15.693 33.923 3.790799 201.182 73.00 192.80 101.20 247.24 767.15 21.8315 18.3613 0.48 8478 145528 280606 10050 171188 331643 2.57829 2.46692 4.14841 5.28931 5.22116 98.4413 20175 8.580 3.991 58.23 10.39 21.03 21.04 67.934 70.525 26.252 3.060 6.971 2.937 2.249 0.563 14.52 8.73 22.27 23.44 27.21 3.291 6.493 3.82 2094.21 13011.10 2.43 43.91 181.92 377.40 84.68 295.80 27.01 471.38 16.95 41.32 193.33 877.49 9.1 189.05 169.04 149.49 53.47 260.2496 61.3934 90.6624 176.4317 144.6829 110.4484 10.7545 1457.9155 70.3395 226.9518 70.5586 226.3744 10.8070 1460.7041 9.2992 1701.3747 142.7964 111.8988 155.7019 102.6582 13.2597 1200.1003 131.7605 121.2093 9.82 26.72 21.21 7044.2 10414.7 6957.2 7081.0 6937.9 1070.1 263177.49 266322.55 25217.0 889214.2 79857401783 26271960547 528178897353 460814853190 297915350493 208877254247 130116.68 51396698 35.58 50396884 133.01 86530272 52.95 86278348 191.77 104173220 71.67 103397638 254.31 67937226 27.16 67712944 106.91 99531709 47.19 91869780 195.49 73031288 101.60 70984166 394.51 125.100 178.435 251434 304542944 12947507 2577564 356602 302674440 6918267 2654286 483387 3084431.37 2901182.23 358093 406021 41.709 307.653 173.868 261.161 160.725 217.126 18.061 171336.6 30.951247 54.559168 159.57429 679.78992 297.72 2921.653 116.866 2939.356 117.574 38.884 109.847 63.045 31.007 57.38 497.09 303107200 20.8747749 562.359884 0.39175 21326667 16176667 16316667 5.099 10.233 22.953315 126.969 27.07 67.91 35.37 86.57 240.35 83.1975 67.9033 1.84 1856 36085 65725 2196 41278 76785 17.2986 16.8089 19.6434 17.0993 17.0484 166.152 60911 24.522 11.172 114.95 23.18 46.96 46.98 196.423 194.547 67.209 6.672 12.355 4.861 3.581 0.853 27.54 19.87 52.59 55.01 55.98 6.280 11.564 53.51 595.92 74625.61 0.60 218.07 146.53 5276.11 11.97 2902.57 10.90 2602.27 12.21 309.82 103.12 8840.41 7.10 1229.16 51.96 977.07 32.67 1444.9067 22.1179 322.2813 99.1066 484.7326 65.9305 36.7555 855.7340 219.9119 145.1134 221.3018 144.1447 36.9206 855.9304 64.0361 495.1442 484.8697 65.9139 3821.9241 8.3560 46.8734 674.2608 696.5146 45.8818 50.27 45.13 36.56 20716.5 39663.3 20699.5 20541.8 20720.8 4376.0 OpenBenchmarking.org
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 5K 10K 15K 20K 25K SE +/- 8.66, N = 3 SE +/- 24.99, N = 3 4510.7 25217.0 1. (CC) gcc options: -pthread -m64 -O3 -ldl
Apache HTTP Server This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 11M 22M 33M 44M 55M SE +/- 524500.20, N = 3 SE +/- 567305.29, N = 3 38712263 51396698
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 11M 22M 33M 44M 55M SE +/- 325879.11, N = 8 SE +/- 524236.55, N = 3 38176456 50396884
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 175336.13, N = 3 SE +/- 262330.46, N = 3 53759940 86530272
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 507374.51, N = 3 SE +/- 286751.08, N = 3 52849125 86278348
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 85534.45, N = 3 SE +/- 102156.43, N = 3 58774090 104173220
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 470631.65, N = 3 SE +/- 417895.46, N = 3 59444334 103397638
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 200 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 15M 30M 45M 60M 75M SE +/- 405601.66, N = 3 SE +/- 89839.93, N = 3 46049589 67937226
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 200 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 15M 30M 45M 60M 75M SE +/- 330018.02, N = 3 SE +/- 292969.25, N = 3 45921074 67712944
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 197187.95, N = 3 SE +/- 349666.50, N = 3 56993811 99531709
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 20M 40M 60M 80M 100M SE +/- 596691.26, N = 5 SE +/- 1905882.84, N = 12 57037441 91869780
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 16M 32M 48M 64M 80M SE +/- 295389.20, N = 3 SE +/- 1143156.28, N = 12 58403756 73031288
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 15M 30M 45M 60M 75M SE +/- 138623.83, N = 3 SE +/- 878468.51, N = 3 60237625 70984166
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 10 20 30 40 50 44.45 30.95 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 50 100 150 200 250 231.74 159.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 600 1200 1800 2400 3000 SE +/- 0.94, N = 3 SE +/- 5.71, N = 6 356.12 2921.65 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 600 1200 1800 2400 3000 SE +/- 0.27, N = 3 SE +/- 2.46, N = 3 355.50 2939.36 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
FFmpeg This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 12 24 36 48 60 SE +/- 0.00, N = 3 SE +/- 0.20, N = 3 3.82 53.51 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 16K 32K 48K 64K 80K SE +/- 3.60, N = 3 SE +/- 275.21, N = 3 13011.10 74625.61 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 50 100 150 200 250 SE +/- 0.09, N = 3 SE +/- 0.21, N = 3 43.91 218.07 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 1100 2200 3300 4400 5500 SE +/- 0.02, N = 3 SE +/- 6.62, N = 3 377.40 5276.11 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 600 1200 1800 2400 3000 SE +/- 0.35, N = 3 SE +/- 5.14, N = 3 295.80 2902.57 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 600 1200 1800 2400 3000 SE +/- 1.63, N = 3 SE +/- 1.69, N = 3 471.38 2602.27 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 0.68, N = 3 41.32 309.82 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 2K 4K 6K 8K 10K SE +/- 0.26, N = 3 SE +/- 14.71, N = 3 877.49 8840.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 300 600 900 1200 1500 SE +/- 0.70, N = 3 SE +/- 0.72, N = 3 189.05 1229.16 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 200 400 600 800 1000 SE +/- 0.14, N = 3 SE +/- 2.89, N = 3 149.49 977.07 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Neural Magic DeepSparse This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 300 600 900 1200 1500 SE +/- 1.56, N = 3 SE +/- 0.50, N = 3 260.25 1444.91
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 70 140 210 280 350 SE +/- 0.85, N = 6 SE +/- 0.25, N = 3 90.66 322.28
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 100 200 300 400 500 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 144.68 484.73
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 10.75 36.76
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 50 100 150 200 250 SE +/- 0.21, N = 3 SE +/- 0.36, N = 3 70.34 219.91
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 50 100 150 200 250 SE +/- 0.11, N = 3 SE +/- 0.29, N = 3 70.56 221.30
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 10.81 36.92
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 14 28 42 56 70 SE +/- 0.0191, N = 3 SE +/- 0.1281, N = 3 9.2992 64.0361
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 100 200 300 400 500 SE +/- 1.66, N = 3 SE +/- 0.12, N = 3 142.80 484.87
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 800 1600 2400 3200 4000 SE +/- 0.49, N = 3 SE +/- 7.35, N = 3 155.70 3821.92
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 13.26 46.87
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 150 300 450 600 750 SE +/- 1.27, N = 3 SE +/- 1.12, N = 3 131.76 696.51
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
Meta Performance Per Watts OpenBenchmarking.org Performance Per Watts, More Is Better Meta Performance Per Watts Performance Per Watts Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 500 1000 1500 2000 2500 2562.10 180.52
CPU Power Consumption Monitor OpenBenchmarking.org Watts CPU Power Consumption Monitor Phoronix Test Suite System Monitoring Zen 1 - EPYC 7601 Zen 4C - EPYC 8534PN 140 280 420 560 700 Min: 134.19 / Avg: 579.81 / Max: 789.58 Min: 7.2 / Avg: 112.32 / Max: 180
Zen 1 - EPYC 7601 Processor: AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads), Motherboard: TYAN B8026T70AE24HR (V1.02.B10 BIOS), Chipset: AMD 17h, Memory: 128GB, Disk: 280GB INTEL SSDPE21D280GA + 1000GB INTEL SSDPE2KX010T8, Graphics: llvmpipe, Monitor: VE228, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.6.9-060609-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 256 bits), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x800126eJava Notes: OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 6 January 2024 22:07 by user phoronix.
Zen 4C - EPYC 8534PN Processor: AMD EPYC 8534PN 64-Core @ 2.00GHz (64 Cores / 128 Threads), Motherboard: AMD Cinnabar (RCB1009C BIOS), Chipset: AMD Device 14a4, Memory: 6 x 32 GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG, Disk: 1000GB INTEL SSDPE2KX010T8, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.6.9-060609-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 256 bits), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212Java Notes: OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)Python Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 8 January 2024 13:44 by user phoronix.