ngc

AMD Ryzen Threadripper PRO 7995WX 96-Cores testing with a HP 8B24 (U65 Ver. 01.01.04 BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 23.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2401055-PTS-NGC5442844
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a
January 05
  15 Hours, 58 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ngc AMD Ryzen Threadripper PRO 7995WX 96-Cores testing with a HP 8B24 (U65 Ver. 01.01.04 BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 23.10 via the Phoronix Test Suite. ,,"a" Processor,,AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads) Motherboard,,HP 8B24 (U65 Ver. 01.01.04 BIOS) Chipset,,AMD Device 14a4 Memory,,128GB Disk,,2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1 Graphics,,NVIDIA RTX A4000 16GB Audio,,NVIDIA GA104 HD Audio Monitor,,ASUS VP28U Network,,Realtek RTL8111/8168/8411 OS,,Ubuntu 23.10 Kernel,,6.5.0-14-generic (x86_64) Desktop,,GNOME Shell 45.0 Display Server,,X Server 1.21.1.7 Display Driver,,NVIDIA 535.129.03 OpenGL,,4.6.0 OpenCL,,OpenCL 3.0 CUDA 12.2.147 Compiler,,GCC 13.2.0 File-System,,ext4 Screen Resolution,,3840x2160 ,,"a" "ArrayFire - Test: Conjugate Gradient OpenCL (ms)",LIB,2.300 "Betsy GPU Compressor - Codec: ETC1 - Quality: Highest (sec)",LIB, "Betsy GPU Compressor - Codec: ETC2 RGB - Quality: Highest (sec)",LIB, "Blender - Blend File: BMW27 - Compute: NVIDIA OptiX (sec)",LIB,11.32 "Blender - Blend File: Classroom - Compute: NVIDIA OptiX (sec)",LIB,28.80 "Blender - Blend File: Fishy Cat - Compute: NVIDIA OptiX (sec)",LIB,20.25 "Blender - Blend File: Barbershop - Compute: NVIDIA OptiX (sec)",LIB,100.99 "Blender - Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX (sec)",LIB,32.20 "Caffe - Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 (ms)",LIB, "Caffe - Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 (ms)",LIB, "Caffe - Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 (ms)",LIB, "Caffe - Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 (ms)",LIB, "Caffe - Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 (ms)",LIB, "Caffe - Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 (ms)",LIB, "Chaos Group V-RAY - Mode: NVIDIA RTX GPU (vrays)",HIB,1790 "Chaos Group V-RAY - Mode: NVIDIA CUDA GPU (vpaths)",HIB,1182 "cl-mem - Benchmark: Copy (GB/s)",HIB,273.0 "cl-mem - Benchmark: Read (GB/s)",HIB,366.1 "cl-mem - Benchmark: Write (GB/s)",HIB,348.6 "clpeak - OpenCL Test: Integer Compute INT (GIOPS)",HIB,9116.63 "clpeak - OpenCL Test: Single-Precision Float (GFLOPS)",HIB,17983.79 "clpeak - OpenCL Test: Double-Precision Double (GFLOPS)",HIB,347.10 "clpeak - OpenCL Test: Global Memory Bandwidth (GBPS)",HIB,361.29 "FAHBench - (Ns/Day)",HIB,224.5663 "FinanceBench - Benchmark: Black-Scholes OpenCL (ms)",LIB,10.628 "GROMACS - Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare (Ns/Day)",HIB, "Hashcat - Benchmark: MD5 (H/s)",HIB,33124266667 "Hashcat - Benchmark: SHA1 (H/s)",HIB,10834766667 "Hashcat - Benchmark: 7-Zip (H/s)",HIB,518600 "Hashcat - Benchmark: SHA-512 (H/s)",HIB,1351300000 "Hashcat - Benchmark: TrueCrypt RIPEMD160 + XTS (H/s)",HIB,370460 "IndigoBench - Acceleration: OpenCL GPU - Scene: Bedroom (M samples/s)",HIB,11.199 "IndigoBench - Acceleration: OpenCL GPU - Scene: Supercar (M samples/s)",HIB,31.766 "LeelaChessZero - Backend: OpenCL (Nodes/s)",HIB, "Libplacebo - (FPS)",HIB, "LuxCoreRender - Scene: DLSC - Acceleration: GPU (M samples/sec)",HIB,6.28 "LuxCoreRender - Scene: Danish Mood - Acceleration: GPU (M samples/sec)",HIB,5.02 "LuxCoreRender - Scene: Orange Juice - Acceleration: GPU (M samples/sec)",HIB,6.51 "LuxCoreRender - Scene: LuxCore Benchmark - Acceleration: GPU (M samples/sec)",HIB,6.12 "LuxCoreRender - Scene: Rainbow Colors and Prism - Acceleration: GPU (M samples/sec)",HIB,18.87 "MandelGPU - OpenCL Device: GPU (Samples/sec)",HIB,346801977.5 "Mixbench - Backend: OpenCL - Benchmark: Integer ()",HIB, "Mixbench - Backend: NVIDIA CUDA - Benchmark: Integer ()",HIB, "Mixbench - Backend: OpenCL - Benchmark: Double Precision ()",HIB, "Mixbench - Backend: OpenCL - Benchmark: Single Precision ()",HIB, "Mixbench - Backend: NVIDIA CUDA - Benchmark: Half Precision ()",HIB, "Mixbench - Backend: NVIDIA CUDA - Benchmark: Double Precision ()",HIB, "Mixbench - Backend: NVIDIA CUDA - Benchmark: Single Precision ()",HIB, "NAMD CUDA - ATPase Simulation - 327,506 Atoms (days/ns)",LIB,0.12721 "NCNN - Target: Vulkan GPU - Model: mobilenet (ms)",LIB,20.74 "NCNN - Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 (ms)",LIB,12.35 "NCNN - Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 (ms)",LIB,13.71 "NCNN - Target: Vulkan GPU - Model: shufflenet-v2 (ms)",LIB,16.76 "NCNN - Target: Vulkan GPU - Model: mnasnet (ms)",LIB,11.73 "NCNN - Target: Vulkan GPU - Model: efficientnet-b0 (ms)",LIB,16.93 "NCNN - Target: Vulkan GPU - Model: blazeface (ms)",LIB,7.46 "NCNN - Target: Vulkan GPU - Model: googlenet (ms)",LIB,25.46 "NCNN - Target: Vulkan GPU - Model: vgg16 (ms)",LIB,35.86 "NCNN - Target: Vulkan GPU - Model: resnet18 (ms)",LIB,12.54 "NCNN - Target: Vulkan GPU - Model: alexnet (ms)",LIB,7.15 "NCNN - Target: Vulkan GPU - Model: resnet50 (ms)",LIB,23.60 "NCNN - Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 (ms)",LIB,20.74 "NCNN - Target: Vulkan GPU - Model: yolov4-tiny (ms)",LIB,34.43 "NCNN - Target: Vulkan GPU - Model: squeezenet_ssd (ms)",LIB,26.63 "NCNN - Target: Vulkan GPU - Model: regnety_400m (ms)",LIB,56.77 "NCNN - Target: Vulkan GPU - Model: vision_transformer (ms)",LIB,44.17 "NCNN - Target: Vulkan GPU - Model: FastestDet (ms)",LIB,18.23 "NeatBench - Acceleration: GPU (FPS)",HIB, "OctaneBench - Total Score (Score)",HIB,358.152832 "PlaidML - FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL (Examples/sec)",HIB, "PlaidML - FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL (Examples/sec)",HIB, "PlaidML - FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL (Examples/sec)",HIB, "PlaidML - FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL (Examples/sec)",HIB, "PlaidML - FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL (Examples/sec)",HIB, "RealSR-NCNN - Scale: 4x - TAA: No (sec)",LIB,10.013 "RealSR-NCNN - Scale: 4x - TAA: Yes (sec)",LIB,59.851 "RedShift Demo - (sec)",LIB, "Rodinia - Test: OpenCL Particle Filter (sec)",LIB,6.554 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: S3D (GFLOPS)",HIB,209.572 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Triad (GB/s)",HIB,24.8470 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: FFT SP (GFLOPS)",HIB,1093.92 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: MD5 Hash (GHash/s)",HIB,22.1461 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Reduction (GB/s)",HIB,324.049 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: GEMM SGEMM_N (GFLOPS)",HIB,3391.66 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Max SP Flops (GFLOPS)",HIB,20821.2 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Download (GB/s)",HIB,26.8298 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Readback (GB/s)",HIB,27.0843 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Texture Read Bandwidth (GB/s)",HIB,2005.17 "ViennaCL - Test: CPU BLAS - sCOPY (GB/s)",HIB,601 "ViennaCL - Test: CPU BLAS - sAXPY (GB/s)",HIB,792 "ViennaCL - Test: CPU BLAS - sDOT (GB/s)",HIB,261 "ViennaCL - Test: CPU BLAS - dCOPY (GB/s)",HIB,769 "ViennaCL - Test: CPU BLAS - dAXPY (GB/s)",HIB,1272 "ViennaCL - Test: CPU BLAS - dDOT (GB/s)",HIB,558 "ViennaCL - Test: CPU BLAS - dGEMV-N (GB/s)",HIB,43.6 "ViennaCL - Test: CPU BLAS - dGEMV-T (GB/s)",HIB,156.3 "ViennaCL - Test: CPU BLAS - dGEMM-NN (GFLOPs/s)",HIB,127 "ViennaCL - Test: CPU BLAS - dGEMM-NT (GFLOPs/s)",HIB,121 "ViennaCL - Test: CPU BLAS - dGEMM-TN (GFLOPs/s)",HIB,134 "ViennaCL - Test: CPU BLAS - dGEMM-TT (GFLOPs/s)",HIB,129 "ViennaCL - Test: OpenCL BLAS - sCOPY (GB/s)",HIB,259 "ViennaCL - Test: OpenCL BLAS - sAXPY (GB/s)",HIB,325 "ViennaCL - Test: OpenCL BLAS - sDOT (GB/s)",HIB,298 "ViennaCL - Test: OpenCL BLAS - dCOPY (GB/s)",HIB,332 "ViennaCL - Test: OpenCL BLAS - dAXPY (GB/s)",HIB,364 "ViennaCL - Test: OpenCL BLAS - dDOT (GB/s)",HIB,368 "ViennaCL - Test: OpenCL BLAS - dGEMV-N (GB/s)",HIB,163 "ViennaCL - Test: OpenCL BLAS - dGEMV-T (GB/s)",HIB,302 "ViennaCL - Test: OpenCL BLAS - dGEMM-NN (GFLOPs/s)",HIB,322 "ViennaCL - Test: OpenCL BLAS - dGEMM-NT (GFLOPs/s)",HIB,323 "ViennaCL - Test: OpenCL BLAS - dGEMM-TN (GFLOPs/s)",HIB,322 "ViennaCL - Test: OpenCL BLAS - dGEMM-TT (GFLOPs/s)",HIB,322 "VkFFT - Test: FFT + iFFT R2C / C2R (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C 1D batched in half precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C Bluestein in single precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C 1D batched in double precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C 1D batched in single precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C multidimensional in single precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C Bluestein benchmark in double precision (Benchmark Score)",HIB, "VkFFT - Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling (Benchmark Score)",HIB, "vkpeak - fp32-scalar (GFLOPS)",HIB,11276.77 "vkpeak - fp32-vec4 (GFLOPS)",HIB,14596.88 "vkpeak - fp16-scalar (GFLOPS)",HIB,11037.49 "vkpeak - fp16-vec4 (GFLOPS)",HIB,21947.85 "vkpeak - fp64-scalar (GFLOPS)",HIB,356.21 "vkpeak - fp64-vec4 (GFLOPS)",HIB,356.28 "vkpeak - int32-scalar (GIOPS)",HIB,11249.11 "vkpeak - int32-vec4 (GIOPS)",HIB,10999.52 "vkpeak - int16-scalar (GIOPS)",HIB,7189.38 "vkpeak - int16-vec4 (GIOPS)",HIB,8646.42 "VkResample - Upscale: 2x - Precision: Double (ms)",LIB,500.008 "VkResample - Upscale: 2x - Precision: Single (ms)",LIB,22.096 "Waifu2x-NCNN Vulkan - Scale: 2x - Denoise: 3 - TAA: No (sec)",LIB, "Waifu2x-NCNN Vulkan - Scale: 2x - Denoise: 3 - TAA: Yes (sec)",LIB,4.889