Intel Core i9-9900K testing with a ASUS PRIME Z390-A (0506 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Ubuntu 18.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1810313-SK-OPENCLCLV15 opencl-clvk-benchmark-test - Phoronix Test Suite opencl-clvk-benchmark-test Intel Core i9-9900K testing with a ASUS PRIME Z390-A (0506 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Ubuntu 18.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1810313-SK-OPENCLCLV15&rdt&export=txt&gru .
opencl-clvk-benchmark-test Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution CLVK 20181031 NVIDIA 410.73 Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads) ASUS PRIME Z390-A (0506 BIOS) Intel Cannon Lake PCH Shared SRAM 16384MB Samsung SSD 970 EVO 250GB eVGA NVIDIA GeForce RTX 2070 8GB (1410/7000MHz) Realtek ALC1220 Acer B286HK Intel Connection Ubuntu 18.10 4.18.0-10-generic (x86_64) GNOME Shell 3.30.1 X Server 1.20.1 NVIDIA 410.73 4.6.0 OpenCL 1.2 clvk GCC 8.2.0 ext4 3840x2160 OpenCL 1.2 CUDA 10.0.185 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave OpenCL Details - GPU Compute Cores: 2304 Security Details - __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp
opencl-clvk-benchmark-test shoc: OpenCL - Triad shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer shoc: OpenCL - FFT SP shoc: OpenCL - Max SP Flops clpeak: Single-Precision Float clpeak: Double-Precision Double shoc: OpenCL - MD5 Hash clpeak: Integer Compute INT clpeak: Kernel Latency CLVK 20181031 NVIDIA 410.73 0.47 0.48 0.48 62.80 12.57 13.00 13.17 1106.70 330.37 395.40 317.07 368.01 11.32 12.59 989.36 8759 8679 275.80 18.79 8367 3.62 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad CLVK 20181031 NVIDIA 410.73 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.47 12.57 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download CLVK 20181031 NVIDIA 410.73 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.48 13.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback CLVK 20181031 NVIDIA 410.73 3 6 9 12 15 SE +/- 0.01, N = 12 SE +/- 0.00, N = 3 0.48 13.17 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth NVIDIA 410.73 200 400 600 800 1000 SE +/- 0.96, N = 3 1106.70 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy NVIDIA 410.73 70 140 210 280 350 SE +/- 0.12, N = 3 330.37 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read NVIDIA 410.73 90 180 270 360 450 SE +/- 0.10, N = 3 395.40 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write NVIDIA 410.73 70 140 210 280 350 SE +/- 0.63, N = 3 317.07 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth NVIDIA 410.73 80 160 240 320 400 SE +/- 0.52, N = 3 368.01
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer NVIDIA 410.73 3 6 9 12 15 SE +/- 0.00, N = 3 11.32
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer NVIDIA 410.73 3 6 9 12 15 SE +/- 0.00, N = 3 12.59
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP CLVK 20181031 NVIDIA 410.73 200 400 600 800 1000 SE +/- 0.02, N = 3 SE +/- 0.44, N = 3 62.80 989.36 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops NVIDIA 410.73 2K 4K 6K 8K 10K SE +/- 0.15, N = 3 8759 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float NVIDIA 410.73 2K 4K 6K 8K 10K SE +/- 90.96, N = 12 8679
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double NVIDIA 410.73 60 120 180 240 300 SE +/- 0.14, N = 3 275.80
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash NVIDIA 410.73 5 10 15 20 25 SE +/- 0.02, N = 3 18.79 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT NVIDIA 410.73 2K 4K 6K 8K 10K SE +/- 157.16, N = 11 8367
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency NVIDIA 410.73 0.8145 1.629 2.4435 3.258 4.0725 SE +/- 0.04, N = 3 3.62
Phoronix Test Suite v10.8.4