opencl benchmark test AMD Ryzen Threadripper PRO 7995WX 96-Cores testing with a HP 8B24 (U65 Ver. 01.01.04 BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 23.10 via the Phoronix Test Suite. a: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 b: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 c: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 d: Processor: AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads), Motherboard: HP 8B24 (U65 Ver. 01.01.04 BIOS), Chipset: AMD Device 14a4, Memory: 128GB, Disk: 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1, Graphics: NVIDIA RTX A4000 16GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS VP28U, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 23.10, Kernel: 6.5.0-14-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP64 Compute TFLOPs/s > Higher Is Better a . 0.355 |==================================================================== b . 0.354 |==================================================================== c . 0.353 |==================================================================== d . 0.353 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP32 Compute TFLOPs/s > Higher Is Better a . 22.05 |==================================================================== b . 22.00 |==================================================================== c . 22.05 |==================================================================== d . 22.00 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT64 Compute TIOPs/s > Higher Is Better a . 2.857 |==================================================================== b . 2.833 |=================================================================== c . 2.859 |==================================================================== d . 2.854 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT32 Compute TIOPs/s > Higher Is Better a . 10.38 |==================================================================== b . 10.25 |=================================================================== c . 10.34 |==================================================================== d . 10.34 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT16 Compute TIOPs/s > Higher Is Better a . 8.703 |==================================================================== b . 8.625 |=================================================================== c . 8.599 |=================================================================== d . 8.533 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT8 Compute TIOPs/s > Higher Is Better a . 8.244 |==================================================================== b . 8.118 |=================================================================== c . 8.218 |==================================================================== d . 8.069 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Read GB/s > Higher Is Better a . 399.34 |=================================================================== b . 399.33 |=================================================================== c . 399.36 |=================================================================== d . 399.26 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Write GB/s > Higher Is Better a . 406.12 |=================================================================== b . 406.20 |=================================================================== c . 406.19 |=================================================================== d . 406.33 |===================================================================