onednn 7980X AMD Ryzen Threadripper 7980X 64-Cores testing with a System76 Thelio Major (FA Z5 BIOS) and AMD Radeon Pro W7900 45GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2403013-PTS-ONEDNN7954&grr .
onednn 7980X Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c d AMD Ryzen Threadripper 7980X 64-Cores @ 7.79GHz (64 Cores / 128 Threads) System76 Thelio Major (FA Z5 BIOS) AMD Device 14a4 4 x 32GB DRAM-4800MT/s Micron MTC20F1045S1RC48BA2 1000GB CT1000T700SSD5 AMD Radeon Pro W7900 45GB (1760/1124MHz) AMD Device 14cc DELL P2415Q Aquantia AQC113C NBase-T/IEEE + Realtek RTL8125 2.5GbE + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 23.10 6.5.0-21-generic (x86_64) GNOME Shell 45.2 X Server + Wayland 4.6 Mesa 23.2.1-1ubuntu3.1 (LLVM 15.0.7 DRM 3.54) GCC 13.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa108105 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
onednn 7980X onednn: Recurrent Neural Network Training - CPU onednn: Recurrent Neural Network Inference - CPU onednn: IP Shapes 1D - CPU onednn: Deconvolution Batch shapes_1d - CPU onednn: IP Shapes 3D - CPU onednn: Convolution Batch Shapes Auto - CPU onednn: Deconvolution Batch shapes_3d - CPU a b c d 554.177 338.714 0.636815 7.78722 0.363055 0.563429 1.03074 553.537 339.080 2.093539 7.80352 0.355889 0.560154 1.02765 556.932 338.742 0.614081 7.71667 0.357966 0.554875 1.02734 555.069 340.213 0.612614 7.69495 0.359911 0.559625 1.02871 OpenBenchmarking.org
oneDNN Harness: Recurrent Neural Network Training - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Training - Engine: CPU a b c d 120 240 360 480 600 SE +/- 1.23, N = 3 SE +/- 0.36, N = 3 SE +/- 2.35, N = 3 SE +/- 0.89, N = 3 554.18 553.54 556.93 555.07 MIN: 546.15 MIN: 546.98 MIN: 546.81 MIN: 547.93 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Recurrent Neural Network Inference - Engine: CPU a b c d 70 140 210 280 350 SE +/- 0.23, N = 3 SE +/- 0.52, N = 3 SE +/- 0.64, N = 3 SE +/- 1.69, N = 3 338.71 339.08 338.74 340.21 MIN: 334.29 MIN: 332.75 MIN: 332.03 MIN: 332.65 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: IP Shapes 1D - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 1D - Engine: CPU a b c d 0.471 0.942 1.413 1.884 2.355 SE +/- 0.004979, N = 10 SE +/- 1.475161, N = 12 SE +/- 0.001677, N = 3 SE +/- 0.001370, N = 3 0.636815 2.093539 0.614081 0.612614 MIN: 0.58 MIN: 0.58 MIN: 0.58 MIN: 0.58 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_1d - Engine: CPU a b c d 2 4 6 8 10 SE +/- 0.02714, N = 3 SE +/- 0.04618, N = 3 SE +/- 0.03540, N = 3 SE +/- 0.01791, N = 3 7.78722 7.80352 7.71667 7.69495 MIN: 6.61 MIN: 6.64 MIN: 6.76 MIN: 6.54 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: IP Shapes 3D - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: IP Shapes 3D - Engine: CPU a b c d 0.0817 0.1634 0.2451 0.3268 0.4085 SE +/- 0.004163, N = 3 SE +/- 0.003768, N = 5 SE +/- 0.000771, N = 3 SE +/- 0.002346, N = 3 0.363055 0.355889 0.357966 0.359911 MIN: 0.32 MIN: 0.31 MIN: 0.33 MIN: 0.33 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Convolution Batch Shapes Auto - Engine: CPU a b c d 0.1268 0.2536 0.3804 0.5072 0.634 SE +/- 0.002902, N = 3 SE +/- 0.001571, N = 3 SE +/- 0.001122, N = 3 SE +/- 0.002332, N = 3 0.563429 0.560154 0.554875 0.559625 MIN: 0.51 MIN: 0.52 MIN: 0.51 MIN: 0.52 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.4 Harness: Deconvolution Batch shapes_3d - Engine: CPU a b c d 0.2319 0.4638 0.6957 0.9276 1.1595 SE +/- 0.00321, N = 3 SE +/- 0.00014, N = 3 SE +/- 0.00208, N = 3 SE +/- 0.00031, N = 3 1.03074 1.02765 1.02734 1.02871 MIN: 0.96 MIN: 0.97 MIN: 0.97 MIN: 0.97 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
Phoronix Test Suite v10.8.5