test1.txt 2 x Intel Xeon Silver 4110 testing with a Dell 0RN4PJ (1.9.0 BIOS) and NVIDIA GV100 [TITAN V] 12GB on Ubuntu 18.04 via the Phoronix Test Suite. firstTest: Processor: 2 x Intel Xeon Silver 4110 @ 2.10GHz (16 Cores / 32 Threads), Motherboard: Dell 0RN4PJ (1.9.0 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: 1024GB SK hynix PC401 NVMe 1TB, Graphics: NVIDIA GV100 [TITAN V] 12GB, Audio: Realtek ALC3234, Network: Intel I219-LM + Intel I210 + 2 x Intel 10G X550T OS: Ubuntu 18.04, Kernel: 6.6.7preempt-rt (x86_64), Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA, Vulkan: 1.3.242, Compiler: GCC 7.5.0 + Clang 7.1.0-svn353565-1~exp1~20190408084827.60 + CUDA 11.8, File-System: ext4, Screen Resolution: 1920x1080 ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better firstTest . 3197 |============================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better firstTest . 3260 |============================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better firstTest . 3313 |============================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better firstTest . 3290 |============================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better firstTest . 292 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better firstTest . 162 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better firstTest . 455 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better firstTest . 496 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better firstTest . 406 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better firstTest . 266 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better firstTest . 315 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better firstTest . 241 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better firstTest . 28.7 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better firstTest . 30.0 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better firstTest . 28.9 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better firstTest . 29.6 |============================================================= FAHBench 2.3.2 Ns Per Day > Higher Is Better firstTest . 202.90 |=========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better firstTest . 11814.10 |========================================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better firstTest . 5688.04 |========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better firstTest . 18147.30 |========================================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GFLOPS > Higher Is Better firstTest . 11577.39 |========================================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GFLOPS > Higher Is Better firstTest . 5921.07 |========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better firstTest . 10000.87 |========================================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GIOPS > Higher Is Better firstTest . 11558.64 |========================================================= NeatBench 5 Acceleration: GPU FPS > Higher Is Better firstTest . 44.4 |============================================================= MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 4.0 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 4.0 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better firstTest . 22.13 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better firstTest . 264.70 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better firstTest . 64.88 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better firstTest . 52.37 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better firstTest . 89.17 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better firstTest . 82.56 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better firstTest . 21.90 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better firstTest . 35.61 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better firstTest . 160.55 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better firstTest . 68.19 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better firstTest . 7.87 |============================================================= NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better firstTest . 44.08 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better firstTest . 27.69 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better firstTest . 21.72 |============================================================ NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better firstTest . 38.44 |============================================================ NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better firstTest . 35.50 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better firstTest . 87.84 |============================================================ Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better firstTest . 41.8 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better firstTest . 32.2 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better firstTest . 48.2 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better firstTest . 48.3 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better firstTest . 33.5 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better firstTest . 42.9 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better firstTest . 47.3 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better firstTest . 31.64 |============================================================ FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better ArrayFire 3.9 Test: Conjugate Gradient OpenCL LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better LeelaChessZero 0.30 Backend: OpenCL Nodes Per Second > Higher Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth clpeak 1.1.2 OpenCL Test: Double-Precision Double clpeak 1.1.2 OpenCL Test: Single-Precision Float clpeak 1.1.2 OpenCL Test: Integer Compute INT RedShift Demo 3.0 Seconds < Lower Is Better cl-mem 2017-01-13 Benchmark: Write cl-mem 2017-01-13 Benchmark: Read cl-mem 2017-01-13 Benchmark: Copy SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better