20230509 nvidia gpu compute AMD Ryzen 7 3700X 8-Core testing with a ASRock X570 Phantom Gaming-ITX/TB3 (P2.80 BIOS) and Zotac NVIDIA GeForce RTX 2060 SUPER 8GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2305109-NE-20230509N80 .
20230509 nvidia gpu compute Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution 20230509 nvidia gpu compute AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASRock X570 Phantom Gaming-ITX/TB3 (P2.80 BIOS) AMD Starship/Matisse 32GB 500GB Western Digital WDS500G3X0C-00SJG0 + 4 x 2000GB Seagate ST2000LM015-2E81 Zotac NVIDIA GeForce RTX 2060 SUPER 8GB NVIDIA TU106 HD Audio ASUS VH196 Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 22.04 5.15.0-71-generic (x86_64) X Server 1.20.13 NVIDIA OpenCL 3.0 CUDA 12.0.151 1.3.224 GCC 11.3.0 btrfs 640x480 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8701021 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.44.00.79 - Python 3.10.9 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
20230509 nvidia gpu compute hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write fahbench: clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth lczero: OpenCL rodinia: OpenCL Particle Filter arrayfire: Conjugate Gradient OpenCL luxcorerender: DLSC - GPU luxcorerender: Danish Mood - GPU luxcorerender: Orange Juice - GPU luxcorerender: LuxCore Benchmark - GPU luxcorerender: Rainbow Colors and Prism - GPU financebench: Black-Scholes OpenCL viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX neatbench: GPU 20230509 nvidia gpu compute 28808466667 9078833333 463100 1286666667 335500 188.958 12.6492 986.039 17.8827 316.042 3010.10 8256.45 13.1151 13.2025 1098.48 284.7 398.0 358.1 199.6371 6310.84 6819.55 260.82 370.67 21074 8.038 2.089 3.90 2.28 4.40 3.33 13.66 15.480 21.5 31.0 32.0 21.3 29.3 32.2 33.7 34.9 43.7 42.2 46.4 44.4 275 324 243 371 389 404 271 325 245 246 12.25 32.77 27.41 136.65 40.28 2060 OpenBenchmarking.org
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 20230509 nvidia gpu compute 6000M 12000M 18000M 24000M 30000M SE +/- 68616867.06, N = 3 28808466667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 20230509 nvidia gpu compute 2000M 4000M 6000M 8000M 10000M SE +/- 30916087.15, N = 3 9078833333
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip 20230509 nvidia gpu compute 100K 200K 300K 400K 500K SE +/- 1457.17, N = 3 463100
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 20230509 nvidia gpu compute 300M 600M 900M 1200M 1500M SE +/- 1617954.81, N = 3 1286666667
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS 20230509 nvidia gpu compute 70K 140K 210K 280K 350K SE +/- 152.75, N = 3 335500
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D 20230509 nvidia gpu compute 40 80 120 160 200 SE +/- 0.08, N = 3 188.96 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad 20230509 nvidia gpu compute 3 6 9 12 15 SE +/- 0.00, N = 3 12.65 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP 20230509 nvidia gpu compute 200 400 600 800 1000 SE +/- 4.52, N = 3 986.04 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash 20230509 nvidia gpu compute 4 8 12 16 20 SE +/- 0.01, N = 3 17.88 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction 20230509 nvidia gpu compute 70 140 210 280 350 SE +/- 0.05, N = 3 316.04 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N 20230509 nvidia gpu compute 600 1200 1800 2400 3000 SE +/- 13.38, N = 3 3010.10 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops 20230509 nvidia gpu compute 2K 4K 6K 8K 10K SE +/- 36.00, N = 3 8256.45 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download 20230509 nvidia gpu compute 3 6 9 12 15 SE +/- 0.00, N = 3 13.12 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback 20230509 nvidia gpu compute 3 6 9 12 15 SE +/- 0.00, N = 3 13.20 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth 20230509 nvidia gpu compute 200 400 600 800 1000 SE +/- 3.57, N = 3 1098.48 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 20230509 nvidia gpu compute 60 120 180 240 300 SE +/- 0.06, N = 3 284.7 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 20230509 nvidia gpu compute 90 180 270 360 450 SE +/- 0.03, N = 3 398.0 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write 20230509 nvidia gpu compute 80 160 240 320 400 SE +/- 0.84, N = 3 358.1 1. (CC) gcc options: -O2 -flto -lOpenCL
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 20230509 nvidia gpu compute 40 80 120 160 200 SE +/- 0.83, N = 3 199.64
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute INT 20230509 nvidia gpu compute 1400 2800 4200 5600 7000 SE +/- 64.23, N = 5 6310.84 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float 20230509 nvidia gpu compute 1500 3000 4500 6000 7500 SE +/- 33.66, N = 3 6819.55 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double 20230509 nvidia gpu compute 60 120 180 240 300 SE +/- 0.02, N = 3 260.82 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth 20230509 nvidia gpu compute 80 160 240 320 400 SE +/- 0.03, N = 3 370.67 1. (CXX) g++ options: -O3
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: OpenCL 20230509 nvidia gpu compute 5K 10K 15K 20K 25K SE +/- 95.62, N = 3 21074 1. (CXX) g++ options: -flto -pthread
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter 20230509 nvidia gpu compute 2 4 6 8 10 SE +/- 0.080, N = 6 8.038 1. (CXX) g++ options: -O2 -lOpenCL
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL 20230509 nvidia gpu compute 0.47 0.94 1.41 1.88 2.35 SE +/- 0.001, N = 3 2.089 1. (CXX) g++ options: -rdynamic
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU 20230509 nvidia gpu compute 0.8775 1.755 2.6325 3.51 4.3875 SE +/- 0.02, N = 3 3.90 MIN: 3.61 / MAX: 4.15
LuxCoreRender Scene: Danish Mood - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU 20230509 nvidia gpu compute 0.513 1.026 1.539 2.052 2.565 SE +/- 0.03, N = 3 2.28 MIN: 0.56 / MAX: 2.95
LuxCoreRender Scene: Orange Juice - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU 20230509 nvidia gpu compute 0.99 1.98 2.97 3.96 4.95 SE +/- 0.01, N = 3 4.40 MIN: 3.69 / MAX: 4.84
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU 20230509 nvidia gpu compute 0.7493 1.4986 2.2479 2.9972 3.7465 SE +/- 0.02, N = 3 3.33 MIN: 0.82 / MAX: 4.24
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU 20230509 nvidia gpu compute 4 8 12 16 20 SE +/- 0.02, N = 3 13.66 MIN: 13.02 / MAX: 14.19
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL 20230509 nvidia gpu compute 4 8 12 16 20 SE +/- 0.05, N = 3 15.48 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY 20230509 nvidia gpu compute 5 10 15 20 25 SE +/- 0.18, N = 3 21.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY 20230509 nvidia gpu compute 7 14 21 28 35 SE +/- 0.41, N = 3 31.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT 20230509 nvidia gpu compute 7 14 21 28 35 SE +/- 0.90, N = 3 32.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY 20230509 nvidia gpu compute 5 10 15 20 25 SE +/- 0.32, N = 3 21.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY 20230509 nvidia gpu compute 7 14 21 28 35 SE +/- 1.44, N = 3 29.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT 20230509 nvidia gpu compute 7 14 21 28 35 SE +/- 0.20, N = 3 32.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N 20230509 nvidia gpu compute 8 16 24 32 40 SE +/- 0.60, N = 3 33.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T 20230509 nvidia gpu compute 8 16 24 32 40 SE +/- 0.15, N = 3 34.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN 20230509 nvidia gpu compute 10 20 30 40 50 SE +/- 0.70, N = 3 43.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT 20230509 nvidia gpu compute 10 20 30 40 50 SE +/- 0.95, N = 3 42.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN 20230509 nvidia gpu compute 11 22 33 44 55 SE +/- 0.47, N = 3 46.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT 20230509 nvidia gpu compute 10 20 30 40 50 SE +/- 0.24, N = 3 44.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 20230509 nvidia gpu compute 60 120 180 240 300 SE +/- 0.88, N = 3 275 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 20230509 nvidia gpu compute 70 140 210 280 350 SE +/- 0.33, N = 3 324 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 20230509 nvidia gpu compute 50 100 150 200 250 SE +/- 0.00, N = 3 243 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 20230509 nvidia gpu compute 80 160 240 320 400 SE +/- 0.00, N = 3 371 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 20230509 nvidia gpu compute 80 160 240 320 400 SE +/- 0.00, N = 3 389 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 20230509 nvidia gpu compute 90 180 270 360 450 SE +/- 0.00, N = 3 404 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 20230509 nvidia gpu compute 60 120 180 240 300 SE +/- 0.33, N = 3 271 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 20230509 nvidia gpu compute 70 140 210 280 350 SE +/- 0.33, N = 3 325 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 20230509 nvidia gpu compute 50 100 150 200 250 SE +/- 0.33, N = 3 245 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 20230509 nvidia gpu compute 50 100 150 200 250 SE +/- 0.33, N = 3 246 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.5 Blend File: BMW27 - Compute: NVIDIA OptiX 20230509 nvidia gpu compute 3 6 9 12 15 SE +/- 0.12, N = 12 12.25
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.5 Blend File: Classroom - Compute: NVIDIA OptiX 20230509 nvidia gpu compute 8 16 24 32 40 SE +/- 0.04, N = 3 32.77
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.5 Blend File: Fishy Cat - Compute: NVIDIA OptiX 20230509 nvidia gpu compute 6 12 18 24 30 SE +/- 0.28, N = 5 27.41
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.5 Blend File: Barbershop - Compute: NVIDIA OptiX 20230509 nvidia gpu compute 30 60 90 120 150 SE +/- 0.04, N = 3 136.65
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.5 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX 20230509 nvidia gpu compute 9 18 27 36 45 SE +/- 0.02, N = 3 40.28
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU 20230509 nvidia gpu compute 400 800 1200 1600 2000 SE +/- 0.00, N = 3 2060
Phoronix Test Suite v10.8.4