9684x-march Benchmarks - OpenBenchmarking.org

Tests for a future article. 2 x AMD EPYC 9684X 96-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

PRE

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

a

Processor: 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe

OS: Ubuntu 23.10, Kernel: 6.5.0-25-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 640x480

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

Timed Mesa Compilation

This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

49 Results Shown

PyTorch:
CPU - 1 - ResNet-50
CPU - 1 - ResNet-152
CPU - 16 - ResNet-50
CPU - 32 - ResNet-50
CPU - 64 - ResNet-50
CPU - 16 - ResNet-152
CPU - 256 - ResNet-50
CPU - 32 - ResNet-152
CPU - 512 - ResNet-50
CPU - 64 - ResNet-152
CPU - 256 - ResNet-152
CPU - 512 - ResNet-152
CPU - 1 - Efficientnet_v2_l
CPU - 16 - Efficientnet_v2_l
CPU - 32 - Efficientnet_v2_l
CPU - 64 - Efficientnet_v2_l
CPU - 256 - Efficientnet_v2_l
CPU - 512 - Efficientnet_v2_l
TensorFlow:
CPU - 1 - AlexNet
CPU - 16 - AlexNet
CPU - 32 - AlexNet
CPU - 64 - AlexNet
CPU - 1 - GoogLeNet
CPU - 1 - ResNet-50
CPU - 256 - AlexNet
CPU - 512 - AlexNet
CPU - 16 - GoogLeNet
CPU - 16 - ResNet-50
CPU - 32 - GoogLeNet
CPU - 32 - ResNet-50
CPU - 64 - GoogLeNet
CPU - 64 - ResNet-50
CPU - 256 - GoogLeNet
CPU - 256 - ResNet-50
CPU - 512 - GoogLeNet
CPU - 512 - ResNet-50
RocksDB:
Overwrite
Rand Read
Update Rand
Read While Writing
Read Rand Write Rand
BRL-CAD
Timed Mesa Compilation
Blender:
BMW27 - CPU-Only
Junkshop - CPU-Only
Classroom - CPU-Only
Fishy Cat - CPU-Only
Barbershop - CPU-Only
Pabellon Barcelona - CPU-Only

PRE

Testing initiated at 27 March 2024 00:24 by user phoronix.

a

OS: Ubuntu 23.10, Kernel: 6.5.0-25-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 640x480

Testing initiated at 27 March 2024 02:05 by user phoronix.

9684x-march

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

PRE

a