opencl_shoc Intel Xeon E-2278GE testing with a Neousys Nuvo-8108GC NVS-8108 (Build220607 BIOS) and NVIDIA GeForce RTX 3080 10GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408150-NE-OPENCLSHO07&grr .
opencl_shoc Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution NVIDIA GeForce RTX 3080 Intel Xeon E-2278GE @ 4.70GHz (8 Cores / 16 Threads) Neousys Nuvo-8108GC NVS-8108 (Build220607 BIOS) Intel Cannon Lake PCH 32GB 960GB 2.5" SATA SSD 3T + 62GB SanDisk 3.2Gen1 NVIDIA GeForce RTX 3080 10GB IDT 92HD73C1X5 DELL U2721DE Intel I219-LM + Intel I210 + 4 x Realtek RTL8125 2.5GbE + Realtek RTL8822CE 802.11ac PCIe Ubuntu 20.04 5.15.0-101-generic (x86_64) GNOME Shell 3.36.9 X Server 1.20.13 NVIDIA 535.171.04 4.6.0 OpenCL 3.0 CUDA 12.2.148 1.3.242 GCC 9.4.0 ext4 2560x1440 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-9QDOt0/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xfa - Thermald 1.9.1 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.71.40.83 - GPU Compute Cores: 8704 - gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled
opencl_shoc shoc: OpenCL - Max SP Flops shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Triad shoc: OpenCL - Bus Speed Download shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash NVIDIA GeForce RTX 3080 32641.4 2159.49 6.5825 6.3599 6.4179 561.507 30.6395 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops NVIDIA GeForce RTX 3080 7K 14K 21K 28K 35K SE +/- 68.99, N = 3 32641.4 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth NVIDIA GeForce RTX 3080 500 1000 1500 2000 2500 SE +/- 4.09, N = 3 2159.49 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback NVIDIA GeForce RTX 3080 2 4 6 8 10 SE +/- 0.0022, N = 3 6.5825 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad NVIDIA GeForce RTX 3080 2 4 6 8 10 SE +/- 0.0059, N = 3 6.3599 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download NVIDIA GeForce RTX 3080 2 4 6 8 10 SE +/- 0.0008, N = 3 6.4179 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP NVIDIA GeForce RTX 3080 120 240 360 480 600 SE +/- 5.97, N = 3 561.51 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash NVIDIA GeForce RTX 3080 7 14 21 28 35 SE +/- 0.07, N = 3 30.64 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Phoronix Test Suite v10.8.5