gpu_test AMD Ryzen 7 3700X 8-Core testing with a ASRock X570 Phantom Gaming 4 (P2.20 BIOS) and Gigabyte NVIDIA GeForce GTX 750 Ti 4GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2307272-NE-GPUTEST4415&grr .
gpu_test Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution GPU test on Linux AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASRock X570 Phantom Gaming 4 (P2.20 BIOS) AMD Starship/Matisse 32GB 4001GB Seagate ST4000NM0033-9ZM + 480GB Toshiba MKNSSDE3480GB Gigabyte NVIDIA GeForce GTX 750 Ti 4GB NVIDIA GM107 HD Audio Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 22.04 5.19.0-46-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.3 NVIDIA OpenCL 3.0 CUDA 12.1.68 1.3.236 GCC 11.3.0 + Clang 14.0.0-1ubuntu1 + CUDA 12.1 ext4 800x600 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8701013 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 82.07.55.00.b5 - Python 3.10.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
gpu_test blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX shoc: OpenCL - Max SP Flops gromacs: NVIDIA CUDA GPU - water_GMX50_bare blender: Fishy Cat - NVIDIA OptiX blender: Classroom - NVIDIA OptiX lczero: OpenCL blender: BMW27 - NVIDIA OptiX luxcorerender: LuxCore Benchmark - GPU luxcorerender: DLSC - GPU fahbench: luxcorerender: Orange Juice - GPU luxcorerender: Danish Mood - GPU luxcorerender: Rainbow Colors and Prism - GPU rodinia: OpenCL Particle Filter cl-mem: Copy viennacl: OpenCL BLAS - dGEMM-TT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sCOPY viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sCOPY cl-mem: Write cl-mem: Read shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Texture Read Bandwidth hashcat: MD5 clpeak: Double-Precision Double shoc: OpenCL - MD5 Hash hashcat: SHA-512 hashcat: SHA1 shoc: OpenCL - Reduction mixbench: OpenCL - Integer arrayfire: Conjugate Gradient OpenCL shoc: OpenCL - FFT SP shoc: OpenCL - Bus Speed Readback clpeak: Global Memory Bandwidth financebench: Black-Scholes OpenCL shoc: OpenCL - S3D shoc: OpenCL - Bus Speed Download clpeak: Single-Precision Float clpeak: Integer Compute INT shoc: OpenCL - Triad mixbench: OpenCL - Single Precision mixbench: OpenCL - Double Precision neatbench: GPU redshift: GPU test on Linux 1932.07 1177.60 1523.82 1.143 497.95 475.54 2196 239.44 0.36 0.44 29.4139 0.56 0.29 2.59 34.002 63.8 46.1 22.5 47.1 47.2 69.6 62.2 76.8 71.3 68.1 69.5 67.7 63.3 17.7 45.3 48.7 43.1 45.0 28.6 27.6 27.9 26.5 27.8 26.5 17.6 72.4 73.5 549.387 112.915 3843300000 47.52 1.9098 153933333 1300933333 73.8980 403.41 11.78 171.272 3.3019 72.45 53.789467 38.7254 3.3145 980.75 446.15 3.1471 1339.43 47.78 750 OpenBenchmarking.org
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: NVIDIA OptiX GPU test on Linux 400 800 1200 1600 2000 SE +/- 0.88, N = 3 1932.07
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX GPU test on Linux 300 600 900 1200 1500 SE +/- 0.27, N = 3 1177.60
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GPU test on Linux 300 600 900 1200 1500 SE +/- 0.03, N = 3 1523.82 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
GROMACS Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare GPU test on Linux 0.2572 0.5144 0.7716 1.0288 1.286 SE +/- 0.004, N = 3 1.143 1. (CXX) g++ options: -O3
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: NVIDIA OptiX GPU test on Linux 110 220 330 440 550 SE +/- 0.46, N = 3 497.95
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: NVIDIA OptiX GPU test on Linux 100 200 300 400 500 SE +/- 0.21, N = 3 475.54
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: OpenCL GPU test on Linux 500 1000 1500 2000 2500 SE +/- 25.10, N = 3 2196 1. (CXX) g++ options: -flto -pthread
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: NVIDIA OptiX GPU test on Linux 50 100 150 200 250 SE +/- 0.44, N = 3 239.44
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU GPU test on Linux 0.081 0.162 0.243 0.324 0.405 SE +/- 0.00, N = 3 0.36 MIN: 0.09 / MAX: 0.46
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU GPU test on Linux 0.099 0.198 0.297 0.396 0.495 SE +/- 0.00, N = 3 0.44 MIN: 0.4 / MAX: 0.45
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 GPU test on Linux 7 14 21 28 35 SE +/- 0.02, N = 3 29.41
LuxCoreRender Scene: Orange Juice - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU GPU test on Linux 0.126 0.252 0.378 0.504 0.63 SE +/- 0.00, N = 3 0.56 MIN: 0.08 / MAX: 0.69
LuxCoreRender Scene: Danish Mood - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU GPU test on Linux 0.0653 0.1306 0.1959 0.2612 0.3265 SE +/- 0.00, N = 3 0.29 MIN: 0.07 / MAX: 0.38
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU GPU test on Linux 0.5828 1.1656 1.7484 2.3312 2.914 SE +/- 0.00, N = 3 2.59 MIN: 2.18 / MAX: 2.68
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter GPU test on Linux 8 16 24 32 40 SE +/- 0.11, N = 3 34.00 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy GPU test on Linux 14 28 42 56 70 SE +/- 0.00, N = 3 63.8 1. (CC) gcc options: -O2 -flto -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GPU test on Linux 10 20 30 40 50 SE +/- 0.00, N = 3 46.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GPU test on Linux 5 10 15 20 25 SE +/- 0.00, N = 3 22.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GPU test on Linux 11 22 33 44 55 SE +/- 0.00, N = 3 47.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GPU test on Linux 11 22 33 44 55 SE +/- 0.00, N = 3 47.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GPU test on Linux 15 30 45 60 75 SE +/- 0.00, N = 3 69.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GPU test on Linux 14 28 42 56 70 SE +/- 0.00, N = 3 62.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GPU test on Linux 20 40 60 80 100 SE +/- 0.00, N = 3 76.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GPU test on Linux 16 32 48 64 80 SE +/- 0.00, N = 3 71.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GPU test on Linux 15 30 45 60 75 SE +/- 0.00, N = 3 68.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GPU test on Linux 15 30 45 60 75 SE +/- 0.03, N = 3 69.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GPU test on Linux 15 30 45 60 75 SE +/- 0.00, N = 3 67.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GPU test on Linux 14 28 42 56 70 SE +/- 0.03, N = 3 63.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GPU test on Linux 4 8 12 16 20 17.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GPU test on Linux 10 20 30 40 50 SE +/- 0.12, N = 3 45.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GPU test on Linux 11 22 33 44 55 SE +/- 0.03, N = 3 48.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GPU test on Linux 10 20 30 40 50 SE +/- 0.58, N = 3 43.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GPU test on Linux 10 20 30 40 50 SE +/- 0.44, N = 3 45.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GPU test on Linux 7 14 21 28 35 SE +/- 0.03, N = 3 28.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GPU test on Linux 6 12 18 24 30 SE +/- 0.06, N = 3 27.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT GPU test on Linux 7 14 21 28 35 SE +/- 0.05, N = 2 27.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GPU test on Linux 6 12 18 24 30 SE +/- 0.10, N = 2 26.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT GPU test on Linux 7 14 21 28 35 SE +/- 0.07, N = 3 27.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GPU test on Linux 6 12 18 24 30 SE +/- 0.03, N = 3 26.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GPU test on Linux 4 8 12 16 20 SE +/- 0.07, N = 3 17.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write GPU test on Linux 16 32 48 64 80 SE +/- 0.00, N = 3 72.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read GPU test on Linux 16 32 48 64 80 SE +/- 0.00, N = 3 73.5 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GPU test on Linux 120 240 360 480 600 SE +/- 1.16, N = 3 549.39 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GPU test on Linux 30 60 90 120 150 SE +/- 0.09, N = 3 112.92 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 GPU test on Linux 800M 1600M 2400M 3200M 4000M SE +/- 901849.95, N = 3 3843300000
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double GPU test on Linux 11 22 33 44 55 SE +/- 0.14, N = 3 47.52 1. (CXX) g++ options: -O3
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GPU test on Linux 0.4297 0.8594 1.2891 1.7188 2.1485 SE +/- 0.0001, N = 3 1.9098 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 GPU test on Linux 30M 60M 90M 120M 150M SE +/- 66666.67, N = 3 153933333
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 GPU test on Linux 300M 600M 900M 1200M 1500M SE +/- 533333.33, N = 3 1300933333
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GPU test on Linux 16 32 48 64 80 SE +/- 0.01, N = 3 73.90 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Mixbench Backend: OpenCL - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GPU test on Linux 90 180 270 360 450 SE +/- 0.11, N = 3 403.41 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL GPU test on Linux 3 6 9 12 15 SE +/- 0.02, N = 3 11.78 1. (CXX) g++ options: -rdynamic
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GPU test on Linux 40 80 120 160 200 SE +/- 0.03, N = 3 171.27 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GPU test on Linux 0.7429 1.4858 2.2287 2.9716 3.7145 SE +/- 0.0000, N = 3 3.3019 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GPU test on Linux 16 32 48 64 80 SE +/- 0.00, N = 3 72.45 1. (CXX) g++ options: -O3
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL GPU test on Linux 12 24 36 48 60 SE +/- 0.53, N = 15 53.79 1. (CXX) g++ options: -O3 -march=native -fopenmp
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GPU test on Linux 9 18 27 36 45 SE +/- 0.00, N = 3 38.73 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GPU test on Linux 0.7458 1.4916 2.2374 2.9832 3.729 SE +/- 0.0000, N = 3 3.3145 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float GPU test on Linux 200 400 600 800 1000 SE +/- 8.16, N = 8 980.75 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute INT GPU test on Linux 100 200 300 400 500 SE +/- 4.74, N = 5 446.15 1. (CXX) g++ options: -O3
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GPU test on Linux 0.7081 1.4162 2.1243 2.8324 3.5405 SE +/- 0.0002, N = 3 3.1471 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Mixbench Backend: OpenCL - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GPU test on Linux 300 600 900 1200 1500 SE +/- 0.09, N = 3 1339.43 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: OpenCL - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GPU test on Linux 11 22 33 44 55 SE +/- 0.02, N = 3 47.78 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU GPU test on Linux 160 320 480 640 800 SE +/- 0.00, N = 3 750
Phoronix Test Suite v10.8.5