inititialpassthrough052223 QEMU testing on Ubuntu 22.04 via the Phoronix Test Suite. passthrough: Processor: 4 x QEMU Virtual 2.5+ (16 Cores), Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (3.20230228-2 BIOS), Chipset: Intel 82G33/G31/P35/P31 + ICH9, Memory: 3 x 16 GB RAM QEMU, Disk: 35GB QEMU HDD + 107GB QEMU HDD, Graphics: NVIDIA GA102GL [RTX A5000] 24GB, Audio: Intel 82801I, Monitor: BenQ PD3200U, Network: Red Hat Virtio device OS: Ubuntu 22.04, Kernel: 5.15.0-72-generic (x86_64), Vulkan: 1.3.224, Compiler: GCC 11.3.0, File-System: ext4, Screen Resolution: 3840x2160, System Layer: QEMU passthrough_: Processor: 4 x QEMU Virtual 2.5+ (16 Cores), Motherboard: QEMU Standard PC (Q35 + ICH9 2009) (3.20230228-2 BIOS), Chipset: Intel 82G33/G31/P35/P31 + ICH9, Memory: 3 x 16 GB RAM QEMU, Disk: 35GB QEMU HDD + 107GB QEMU HDD, Graphics: NVIDIA GA102GL [RTX A5000] 24GB, Audio: Intel 82801I, Network: Red Hat Virtio device OS: Ubuntu 22.04, Kernel: 5.15.0-72-generic (x86_64), Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.0.151, Vulkan: 1.3.224, Compiler: GCC 11.3.0 + CUDA 11.5, File-System: ext4, Screen Resolution: 1280x800, System Layer: QEMU Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better passthrough_ . 48149900000 |=================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better passthrough_ . 15720500000 |=================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better passthrough_ . 765033 |======================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better passthrough_ . 2248000000 |==================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better passthrough_ . 574300 |======================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better passthrough_ . 340.77 |======================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better passthrough_ . 12.46 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better passthrough_ . 1914.83 |======================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better passthrough_ . 31.06 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better passthrough_ . 357.64 |======================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better passthrough_ . 6355.18 |======================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better passthrough_ . 30024.3 |======================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better passthrough_ . 12.56 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better passthrough_ . 13.18 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better passthrough_ . 2028.93 |======================================================= cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better passthrough_ . 337.1 |========================================================= cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better passthrough_ . 662.9 |========================================================= cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better passthrough_ . 657.0 |========================================================= RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better passthrough_ . 269.36 |======================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better passthrough_ . 13752.12 |====================================================== clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better passthrough_ . 26927.38 |====================================================== clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better passthrough_ . 488.04 |======================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better passthrough_ . 658.16 |======================================================== LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better passthrough_ . 19392 |========================================================= Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better passthrough_ . 4.690 |========================================================= ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better passthrough_ . 1.749 |========================================================= LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better passthrough_ . 11.48 |========================================================= LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better passthrough_ . 6.10 |========================================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better passthrough_ . 10.02 |========================================================= LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better passthrough_ . 8.57 |========================================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better passthrough_ . 30.10 |========================================================= FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better passthrough_ . 7.608 |========================================================= ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better passthrough_ . 516 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better passthrough_ . 269 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better passthrough_ . 126 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better passthrough_ . 12.5 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better passthrough_ . 18.6 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better passthrough_ . 15.8 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better passthrough_ . 35.2 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better passthrough_ . 48.4 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better passthrough_ . 41.6 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better passthrough_ . 38.7 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better passthrough_ . 45.5 |========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better passthrough_ . 41.9 |========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better passthrough_ . 325 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better passthrough_ . 439 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better passthrough_ . 324 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better passthrough_ . 523 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better passthrough_ . 605 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better passthrough_ . 553 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better passthrough_ . 171 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better passthrough_ . 338 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better passthrough_ . 447 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better passthrough_ . 451 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better passthrough_ . 450 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better passthrough_ . 450 |=========================================================== GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better NCNN 20220729 Target: Vulkan GPU ms < Lower Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better Blender 3.5 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better passthrough_ . 7.46 |========================================================== Blender 3.5 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better passthrough_ . 17.44 |========================================================= Blender 3.5 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better passthrough_ . 14.52 |========================================================= Blender 3.5 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better passthrough_ . 67.79 |========================================================= Blender 3.5 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better passthrough_ . 19.57 |========================================================= MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better NeatBench 5 Acceleration: GPU FPS > Higher Is Better passthrough_ . 56.5 |==========================================================