pyt AMD Ryzen Threadripper PRO 7995WX 96-Cores testing with a HP 8B24 (U65 Ver. 01.01.04 BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 23.10 via the Phoronix Test Suite. a: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 b: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 c: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 d: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 batches/sec > Higher Is Better a . 50.01 |=================================================================== b . 50.01 |=================================================================== c . 50.46 |==================================================================== d . 49.60 |=================================================================== PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: ResNet-152 batches/sec > Higher Is Better a . 18.70 |================================================================== b . 18.57 |================================================================== c . 18.79 |=================================================================== d . 19.15 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 16 - Model: ResNet-50 batches/sec > Higher Is Better a . 40.83 |==================================================================== b . 40.69 |==================================================================== c . 40.54 |==================================================================== d . 40.40 |=================================================================== PyTorch 2.1 Device: CPU - Batch Size: 32 - Model: ResNet-50 batches/sec > Higher Is Better a . 40.61 |==================================================================== b . 40.33 |=================================================================== c . 40.59 |==================================================================== d . 40.72 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 batches/sec > Higher Is Better a . 40.79 |==================================================================== b . 40.64 |=================================================================== c . 40.39 |=================================================================== d . 40.97 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 16 - Model: ResNet-152 batches/sec > Higher Is Better a . 16.18 |=================================================================== b . 15.75 |================================================================== c . 15.87 |================================================================== d . 16.32 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 batches/sec > Higher Is Better a . 40.82 |==================================================================== b . 40.98 |==================================================================== c . 40.93 |==================================================================== d . 40.35 |=================================================================== PyTorch 2.1 Device: CPU - Batch Size: 32 - Model: ResNet-152 batches/sec > Higher Is Better a . 16.10 |=================================================================== b . 16.40 |==================================================================== c . 16.17 |=================================================================== d . 16.41 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 batches/sec > Higher Is Better a . 40.33 |=================================================================== b . 40.44 |=================================================================== c . 40.74 |==================================================================== d . 39.99 |=================================================================== PyTorch 2.1 Device: CPU - Batch Size: 64 - Model: ResNet-152 batches/sec > Higher Is Better a . 16.05 |=================================================================== b . 15.88 |================================================================== c . 16.16 |=================================================================== d . 16.31 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 256 - Model: ResNet-152 batches/sec > Higher Is Better a . 16.09 |================================================================== b . 15.74 |================================================================= c . 15.87 |================================================================= d . 16.51 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: ResNet-152 batches/sec > Higher Is Better a . 16.20 |==================================================================== b . 16.24 |==================================================================== c . 16.17 |==================================================================== d . 15.96 |=================================================================== PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 11.16 |================================================================== b . 11.38 |==================================================================== c . 11.21 |=================================================================== d . 11.42 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 6.24 |===================================================================== b . 6.28 |===================================================================== c . 6.24 |===================================================================== d . 6.25 |===================================================================== PyTorch 2.1 Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 6.28 |===================================================================== b . 6.23 |==================================================================== c . 6.28 |===================================================================== d . 6.25 |===================================================================== PyTorch 2.1 Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 6.25 |==================================================================== b . 6.23 |==================================================================== c . 6.35 |===================================================================== d . 6.23 |==================================================================== PyTorch 2.1 Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 6.27 |==================================================================== b . 6.19 |=================================================================== c . 6.19 |=================================================================== d . 6.36 |===================================================================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 6.24 |===================================================================== b . 6.25 |===================================================================== c . 6.26 |===================================================================== d . 6.27 |===================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50 batches/sec > Higher Is Better a . 361.97 |=================================================================== b . 357.01 |================================================================== c . 355.60 |================================================================== d . 353.30 |================================================================= PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152 batches/sec > Higher Is Better a . 132.98 |================================================================== b . 134.62 |=================================================================== c . 132.86 |================================================================== d . 133.46 |================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-50 batches/sec > Higher Is Better a . 288.42 |================================================================== b . 291.63 |=================================================================== c . 291.60 |=================================================================== d . 291.81 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-50 batches/sec > Higher Is Better a . 287.50 |================================================================== b . 291.74 |=================================================================== c . 291.63 |=================================================================== d . 292.06 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50 batches/sec > Higher Is Better a . 287.15 |================================================================== b . 291.97 |=================================================================== c . 291.52 |=================================================================== d . 291.76 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-152 batches/sec > Higher Is Better a . 127.01 |================================================================== b . 125.19 |================================================================= c . 128.92 |=================================================================== d . 129.27 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-50 batches/sec > Higher Is Better a . 286.04 |================================================================== b . 289.02 |=================================================================== c . 288.89 |=================================================================== d . 289.30 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-152 batches/sec > Higher Is Better a . 126.84 |================================================================== b . 129.44 |=================================================================== c . 129.52 |=================================================================== d . 125.41 |================================================================= PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50 batches/sec > Higher Is Better a . 285.74 |================================================================== b . 288.50 |=================================================================== c . 288.57 |=================================================================== d . 288.51 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-152 batches/sec > Higher Is Better a . 126.04 |================================================================= b . 129.28 |=================================================================== c . 126.00 |================================================================= d . 127.10 |================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-152 batches/sec > Higher Is Better a . 126.55 |=================================================================== b . 125.39 |================================================================== c . 127.26 |=================================================================== d . 126.60 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152 batches/sec > Higher Is Better a . 126.58 |=================================================================== b . 126.94 |=================================================================== c . 123.41 |================================================================= d . 125.05 |================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 70.38 |=================================================================== b . 69.60 |================================================================== c . 71.80 |==================================================================== d . 71.36 |==================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 67.48 |==================================================================== b . 67.86 |==================================================================== c . 66.79 |=================================================================== d . 66.63 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 66.65 |================================================================== b . 68.69 |==================================================================== c . 67.12 |================================================================== d . 67.59 |=================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 67.18 |=================================================================== b . 68.07 |==================================================================== c . 67.30 |=================================================================== d . 67.83 |==================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 66.57 |=================================================================== b . 66.27 |=================================================================== c . 67.12 |==================================================================== d . 67.11 |==================================================================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l batches/sec > Higher Is Better a . 65.94 |=================================================================== b . 66.63 |==================================================================== c . 66.20 |==================================================================== d . 66.20 |====================================================================