2305100-PTS-SPRAVX1234 Intel Xeon w9-3495X testing with a Supermicro X13SWA-TF v1.01 (1.1a BIOS) and NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2305109-NE-2305100PT25 .
2305100-PTS-SPRAVX1234 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI Intel Xeon w9-3495X @ 4.60GHz (56 Cores / 112 Threads) Supermicro X13SWA-TF v1.01 (1.1a BIOS) Intel Alder Lake-S PCH 8 x 32 GB DDR5-4800MT/s Adata AD5R480032G20-BSSB 6401GB Micron_9300_MTFDHAL6T4TDR + 1000GB INTEL SSDSC2KW01 NVIDIA GeForce RTX 4090 24GB Realtek ALC888-VD BenQ PD2720U Intel I210 + Aquantia Device 14c0 Ubuntu 22.10 6.0.0-060000-generic (x86_64) GNOME Shell 43.1 X Server 1.21.1.4 NVIDIA 530.41.03 4.6.0 OpenCL 3.0 CUDA 12.1.98 1.3.236 GCC 12.2.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x2b000390 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
2305100-PTS-SPRAVX1234 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 2.45351 0.948574 0.791395 OpenBenchmarking.org
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.7 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 0.552 1.104 1.656 2.208 2.76 SE +/- 0.05862, N = 12 2.45351 MIN: 0.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN CPU Power Consumption Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 59.4 247.9 301.4 OpenBenchmarking.org Watts, Fewer Is Better oneDNN 2.7 CPU Power Consumption Monitor 80 160 240 320 400
oneDNN CPU Temperature Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 24.0 28.4 32.0 OpenBenchmarking.org Celsius, Fewer Is Better oneDNN 2.7 CPU Temperature Monitor 9 18 27 36 45
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.7 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 0.2134 0.4268 0.6402 0.8536 1.067 SE +/- 0.170798, N = 12 0.948574 MIN: 0.11 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN CPU Power Consumption Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 81.0 250.8 362.3 OpenBenchmarking.org Watts, Fewer Is Better oneDNN 2.7 CPU Power Consumption Monitor 100 200 300 400 500
oneDNN CPU Temperature Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 23.0 28.7 33.0 OpenBenchmarking.org Celsius, Fewer Is Better oneDNN 2.7 CPU Temperature Monitor 10 20 30 40 50
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.7 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 0.1781 0.3562 0.5343 0.7124 0.8905 SE +/- 0.222968, N = 12 0.791395 MIN: 0.14 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN CPU Power Consumption Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 61.6 256.0 377.0 OpenBenchmarking.org Watts, Fewer Is Better oneDNN 2.7 CPU Power Consumption Monitor 100 200 300 400 500
oneDNN CPU Temperature Monitor Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 22.0 29.4 34.0 OpenBenchmarking.org Celsius, Fewer Is Better oneDNN 2.7 CPU Temperature Monitor 10 20 30 40 50
CPU Power Consumption Monitor Phoronix Test Suite System Monitoring Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 59.4 247.5 377.0 OpenBenchmarking.org Watts CPU Power Consumption Monitor Phoronix Test Suite System Monitoring 100 200 300 400 500
CPU Temperature Monitor Phoronix Test Suite System Monitoring Min Avg Max Xeon w9-3495X F32 VS AVX512 VS AVX512 VNNI 22.0 28.6 34.0 OpenBenchmarking.org Celsius CPU Temperature Monitor Phoronix Test Suite System Monitoring 10 20 30 40 50
Phoronix Test Suite v10.8.4