2024-10-17-pytorch

AMD Ryzen Threadripper 3960X 24-Core testing with a ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS) and AMD Radeon RX 6900 XT 16GB on Fedora Linux 40 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2410178-ENB-2024101780&gru.

2024-10-17-pytorchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RXAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)ASUS ROG ZENITH II EXTREME ALPHA (0902 BIOS)AMD Starship/Matisse128GB2 x 2000GB Sabrent Rocket QAMD Radeon RX 6900 XT 16GBAMD Navi 21/23MPCP28UHD + MP MonitorAquantia AQtion AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200Fedora Linux 406.11.3-200.fc40.x86_64 (x86_64)GNOME Shell 46.5X Server 1.20.14 + Wayland4.6 Mesa 24.1.7 (LLVM 18.1.6 DRM 3.59)OpenCL 2.1 AMD-APP (3614.0) + OpenCL 3.0 PoCL 5.0 Linux RELOC SPIR LLVM 17.0.6 SLEEF DISTRO POCL_DEBUGGCC 14.2.1 20240912 + Clang 18.1.8 + LLVM 18.1.8btrfs7680x2160OpenBenchmarking.org- Transparent Huge Pages: madvise- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107c- Python 3.12.6- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

2024-10-17-pytorchpytorch: CPU - 1 - ResNet-50pytorch: CPU - 1 - ResNet-152pytorch: CPU - 16 - ResNet-50pytorch: CPU - 32 - ResNet-50pytorch: CPU - 64 - ResNet-50pytorch: CPU - 16 - ResNet-152pytorch: CPU - 256 - ResNet-50pytorch: CPU - 32 - ResNet-152pytorch: CPU - 512 - ResNet-50pytorch: CPU - 64 - ResNet-152pytorch: CPU - 256 - ResNet-152pytorch: CPU - 512 - ResNet-152pytorch: CPU - 1 - Efficientnet_v2_lpytorch: CPU - 16 - Efficientnet_v2_lpytorch: CPU - 32 - Efficientnet_v2_lpytorch: CPU - 64 - Efficientnet_v2_lpytorch: CPU - 256 - Efficientnet_v2_lpytorch: CPU - 512 - Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX30.0811.9623.0622.4422.068.9521.978.8021.698.838.798.716.024.624.594.604.654.64OpenBenchmarking.org

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX714212835SE +/- 0.08, N = 330.08MIN: 26.37 / MAX: 30.64

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.01, N = 311.96MIN: 11.32 / MAX: 12.19

PyTorch

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 16 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX612182430SE +/- 0.17, N = 323.06MIN: 22.01 / MAX: 23.78

PyTorch

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 32 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.03, N = 322.44MIN: 21.03 / MAX: 22.92

PyTorch

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.25, N = 322.06MIN: 21.02 / MAX: 22.92

PyTorch

Device: CPU - Batch Size: 16 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 16 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX3691215SE +/- 0.04, N = 38.95MIN: 8.14 / MAX: 9.16

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.13, N = 321.97MIN: 20.5 / MAX: 22.73

PyTorch

Device: CPU - Batch Size: 32 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 32 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.03, N = 38.80MIN: 8.03 / MAX: 9.02

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: ResNet-50AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX510152025SE +/- 0.14, N = 321.69MIN: 19.79 / MAX: 22.43

PyTorch

Device: CPU - Batch Size: 64 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.05, N = 38.83MIN: 8.42 / MAX: 9.04

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.03, N = 38.79MIN: 7.68 / MAX: 8.99

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: ResNet-152AMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.03, N = 38.71MIN: 8.21 / MAX: 8.86

PyTorch

Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX246810SE +/- 0.06, N = 36.02MIN: 5.63 / MAX: 6.26

PyTorch

Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1.03952.0793.11854.1585.1975SE +/- 0.00, N = 34.62MIN: 4.27 / MAX: 4.71

PyTorch

Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1.03282.06563.09844.13125.164SE +/- 0.01, N = 34.59MIN: 4.4 / MAX: 4.69

PyTorch

Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1.0352.073.1054.145.175SE +/- 0.03, N = 34.60MIN: 4.34 / MAX: 4.73

PyTorch

Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1.04632.09263.13894.18525.2315SE +/- 0.03, N = 34.65MIN: 4.46 / MAX: 4.78

PyTorch

Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_lAMD Ryzen Threadripper 3960X 24-Core - AMD Radeon RX1.0442.0883.1324.1765.22SE +/- 0.01, N = 34.64MIN: 4.09 / MAX: 4.75


Phoronix Test Suite v10.8.5