2024-10-17-pytorch AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS) and AMD Radeon RX 6900 XT 16GB on Fedora Linux 40 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2410178-ENB-2024101780&grr .
2024-10-17-pytorch Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Compiler File-System Screen Resolution AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS) AMD Starship/Matisse 128GB 2 x 2000GB Sabrent Rocket Q AMD Radeon RX 6900 XT 16GB AMD Navi 21/23 MPCP28UHD + MP Monitor Aquantia AQtion AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Fedora Linux 40 6.11.3-200.fc40.x86_64 (x86_64) GNOME Shell 46.5 X Server 1.20.14 + Wayland 4.6 Mesa 24.1.7 (LLVM 18.1.6 DRM 3.59) OpenCL 2.1 AMD-APP (3614.0) + OpenCL 3.0 PoCL 5.0 Linux RELOC SPIR LLVM 17.0.6 SLEEF DISTRO POCL_DEBUG GCC 14.2.1 20240912 + Clang 18.1.8 + LLVM 18.1.8 btrfs 7680x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107c - Python 3.12.6 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2024-10-17-pytorch pytorch: CPU - 32 - Efficientnet_v2_l pytorch: CPU - 16 - Efficientnet_v2_l pytorch: CPU - 64 - Efficientnet_v2_l pytorch: CPU - 512 - Efficientnet_v2_l pytorch: CPU - 256 - Efficientnet_v2_l pytorch: CPU - 512 - ResNet-152 pytorch: CPU - 256 - ResNet-152 pytorch: CPU - 64 - ResNet-152 pytorch: CPU - 32 - ResNet-152 pytorch: CPU - 16 - ResNet-152 pytorch: CPU - 1 - Efficientnet_v2_l pytorch: CPU - 1 - ResNet-152 pytorch: CPU - 512 - ResNet-50 pytorch: CPU - 256 - ResNet-50 pytorch: CPU - 64 - ResNet-50 pytorch: CPU - 32 - ResNet-50 pytorch: CPU - 16 - ResNet-50 pytorch: CPU - 1 - ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 4.59 4.62 4.60 4.64 4.65 8.71 8.79 8.83 8.80 8.95 6.02 11.96 21.69 21.97 22.06 22.44 23.06 30.08 OpenBenchmarking.org
PyTorch Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 1.0328 2.0656 3.0984 4.1312 5.164 SE +/- 0.01, N = 3 4.59 MIN: 4.4 / MAX: 4.69
PyTorch Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 1.0395 2.079 3.1185 4.158 5.1975 SE +/- 0.00, N = 3 4.62 MIN: 4.27 / MAX: 4.71
PyTorch Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 1.035 2.07 3.105 4.14 5.175 SE +/- 0.03, N = 3 4.60 MIN: 4.34 / MAX: 4.73
PyTorch Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 1.044 2.088 3.132 4.176 5.22 SE +/- 0.01, N = 3 4.64 MIN: 4.09 / MAX: 4.75
PyTorch Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 1.0463 2.0926 3.1389 4.1852 5.2315 SE +/- 0.03, N = 3 4.65 MIN: 4.46 / MAX: 4.78
PyTorch Device: CPU - Batch Size: 512 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 512 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 2 4 6 8 10 SE +/- 0.03, N = 3 8.71 MIN: 8.21 / MAX: 8.86
PyTorch Device: CPU - Batch Size: 256 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 2 4 6 8 10 SE +/- 0.03, N = 3 8.79 MIN: 7.68 / MAX: 8.99
PyTorch Device: CPU - Batch Size: 64 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 64 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 2 4 6 8 10 SE +/- 0.05, N = 3 8.83 MIN: 8.42 / MAX: 9.04
PyTorch Device: CPU - Batch Size: 32 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 32 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 2 4 6 8 10 SE +/- 0.03, N = 3 8.80 MIN: 8.03 / MAX: 9.02
PyTorch Device: CPU - Batch Size: 16 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 16 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 3 6 9 12 15 SE +/- 0.04, N = 3 8.95 MIN: 8.14 / MAX: 9.16
PyTorch Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 2 4 6 8 10 SE +/- 0.06, N = 3 6.02 MIN: 5.63 / MAX: 6.26
PyTorch Device: CPU - Batch Size: 1 - Model: ResNet-152 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-152 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 3 6 9 12 15 SE +/- 0.01, N = 3 11.96 MIN: 11.32 / MAX: 12.19
PyTorch Device: CPU - Batch Size: 512 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 5 10 15 20 25 SE +/- 0.14, N = 3 21.69 MIN: 19.79 / MAX: 22.43
PyTorch Device: CPU - Batch Size: 256 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 5 10 15 20 25 SE +/- 0.13, N = 3 21.97 MIN: 20.5 / MAX: 22.73
PyTorch Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 5 10 15 20 25 SE +/- 0.25, N = 3 22.06 MIN: 21.02 / MAX: 22.92
PyTorch Device: CPU - Batch Size: 32 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 32 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 5 10 15 20 25 SE +/- 0.03, N = 3 22.44 MIN: 21.03 / MAX: 22.92
PyTorch Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 16 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 6 12 18 24 30 SE +/- 0.17, N = 3 23.06 MIN: 22.01 / MAX: 23.78
PyTorch Device: CPU - Batch Size: 1 - Model: ResNet-50 OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX 7 14 21 28 35 SE +/- 0.08, N = 3 30.08 MIN: 26.37 / MAX: 30.64
Phoronix Test Suite v10.8.5