opncl a100-demo demmo: Processor: 14 x Intel Xeon Gold 6342 (14 Cores), Motherboard: Nutanix AHV (nutanix-ahv-2.20220304.0.2619.el7 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 4 x 16384 MB RAM, Disk: 428GB VDISK, Graphics: NVIDIA A100 80GB PCIe, Network: Red Hat Virtio device OS: Ubuntu 20.04, Kernel: 5.4.0-172-generic (x86_64), Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.242, Compiler: GCC 9.4.0 + CUDA 12.3, File-System: ext4, Screen Resolution: 1024x768, System Layer: KVM SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better demmo . 816.95 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better demmo . 24.73 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better demmo . 4417.74 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better demmo . 42.79 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better demmo . 241.91 |=============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better demmo . 13477.3 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better demmo . 19375.4 |============================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better demmo . 25.30 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better demmo . 26.40 |================================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better demmo . 1581.62 |============================================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better demmo . 234.8 |================================================================ cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better demmo . 796.1 |================================================================ cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better demmo . 1406.0 |=============================================================== FluidX3D 2.9 Test: FP32-FP32 MLUPs/s > Higher Is Better demmo . 9647 |================================================================= FluidX3D 2.9 Test: FP32-FP16C MLUPs/s > Higher Is Better demmo . 10790 |================================================================ FluidX3D 2.9 Test: FP32-FP16S MLUPs/s > Higher Is Better demmo . 17862 |================================================================ clpeak 1.1.2 OpenCL Test: Kernel Latency us < Lower Is Better demmo . 5.03 |================================================================= clpeak 1.1.2 OpenCL Test: Integer Compute GIOPS > Higher Is Better demmo . 19227.04 |============================================================= clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute GIOPS > Higher Is Better demmo . 19275.03 |============================================================= clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better demmo . 1495.26 |============================================================== clpeak 1.1.2 OpenCL Test: Double-Precision Compute GFLOPS > Higher Is Better demmo . 9694.43 |============================================================== clpeak 1.1.2 OpenCL Test: Single-Precision Compute GFLOPS > Higher Is Better demmo . 19300.02 |============================================================= clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer GBPS > Higher Is Better demmo . 9.72 |================================================================= clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer GBPS > Higher Is Better demmo . 8.32 |================================================================= Parboil 2.5 Test: OpenCL BFS Seconds < Lower Is Better Parboil 2.5 Test: OpenCL LBM Seconds < Lower Is Better Parboil 2.5 Test: OpenCL Histo Seconds < Lower Is Better Parboil 2.5 Test: OpenCL TPACF Seconds < Lower Is Better Parboil 2.5 Test: OpenCL MRI Gridding Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL LavaMD Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Myocyte Seconds < Lower Is Better demmo . 26.46 |================================================================ Rodinia 3.1 Test: OpenCL Heartwall Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Leukocyte Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better demmo . 153.8 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better demmo . 179 |================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better demmo . 84.2 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better demmo . 73.5 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better demmo . 119 |================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better demmo . 109 |================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better demmo . 97.5 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better demmo . 76.4 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better demmo . 26.4 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better demmo . 26.2 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better demmo . 26.0 |================================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better demmo . 26.5 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better demmo . 232 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better demmo . 312 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better demmo . 224 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better demmo . 440 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better demmo . 573 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better demmo . 435 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better demmo . 68.2 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better demmo . 243 |================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better demmo . 4257 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better demmo . 4653 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better demmo . 4223 |================================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better demmo . 4280 |================================================================= Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Seconds < Lower Is Better demmo . 1.373 |================================================================ Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Seconds < Lower Is Better demmo . 3.897 |================================================================ Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Seconds < Lower Is Better demmo . 0.191 |================================================================ Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Seconds < Lower Is Better demmo . 1.034 |================================================================ Xsbench OpenCL 2017-07-06 Lookups/s > Higher Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better MandelGPU 1.3pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Caustic Samples/sec > Higher Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Cornell Samples/sec > Higher Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Caustic3 Samples/sec > Higher Is Better LuxMark 3.1 OpenCL Device: CPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU - Scene: Luxball HDR Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR Score > Higher Is Better LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Luxball HDR Score > Higher Is Better LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Luxball HDR Score > Higher Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition ms < Lower Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row ms < Lower Is Better OpenDwarfs 2013-11-06 Test: Cyclic Redundancy Check ms < Lower Is Better Lulesh OpenCL 2017-07-06 z/s > Higher Is Better demmo . 5465.16 |==============================================================