rx3080-ocl AMD Ryzen 9 5900X 12-Core testing with a ASRock X570 Steel Legend (P5.63 BIOS) and MSI NVIDIA GeForce RTX 3080 12GB on Ubuntu 24.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2412265-NE-RX3080OCL41&grt .
rx3080-ocl Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution MSI NVIDIA GeForce RTX 3080 AMD Ryzen 9 5900X 12-Core @ 4.95GHz (12 Cores / 24 Threads) ASRock X570 Steel Legend (P5.63 BIOS) AMD Starship/Matisse 2 x 16GB DDR4-3600MT/s TEAMGROUP-UD4-3200 1000GB Western Digital WDS100T3X0C-00SJG0 + 1000GB Western Digital WD Blue SN580 1TB + 2000GB Seagate ST2000DX001-1CM1 MSI NVIDIA GeForce RTX 3080 12GB NVIDIA GA102 HD Audio DELL S2721QS Intel I211 + Intel Dual Band-AC 3168NGW Ubuntu 24.10 6.11.0-13-generic (x86_64) GNOME Shell 47.0 X Server + Wayland NVIDIA 565.57.01 4.6.0 OpenCL 3.0 CUDA 12.7.33 GCC 14.2.0 + Clang 19.1.1 + CUDA 12.6 ext4 2560x1440 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - CPU Microcode: 0xa20102b - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.85.00.40 - GPU Compute Cores: 8960 - Python 3.12.7 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
rx3080-ocl cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Kernel Latency clpeak: Integer Compute clpeak: Integer 24-bit Compute clpeak: Global Memory Bandwidth clpeak: Double-Precision Compute clpeak: Single-Precision Compute clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL fluidx3d: FP32-FP32 fluidx3d: FP32-FP16C fluidx3d: FP32-FP16S lulesh-cl: rodinia: OpenCL Particle Filter smallpt-gpu: GPU - 2560 x 1440 - Caustic smallpt-gpu: GPU - 2560 x 1440 - Cornell smallpt-gpu: GPU - 2560 x 1440 - Caustic3 viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT xsbench-cl: MSI NVIDIA GeForce RTX 3080 366.0 826.7 786.3 4.66 15426.63 15474.84 815.86 553.48 30467.68 15.04 16.36 1.932 2.320 0.170 1.203 5346 8311 10075 5526.7331 4.331 1735229886 1735230023 1735230161 107 161 194 40.8 61.2 74.5 91.0 98.6 46.2 44.1 48.8 46.4 368 504 374 606 722 658 189 378 515 517 516 516 OpenBenchmarking.org
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy MSI NVIDIA GeForce RTX 3080 80 160 240 320 400 SE +/- 0.27, N = 3 366.0 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read MSI NVIDIA GeForce RTX 3080 200 400 600 800 1000 SE +/- 0.17, N = 3 826.7 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write MSI NVIDIA GeForce RTX 3080 200 400 600 800 1000 SE +/- 0.26, N = 3 786.3 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency MSI NVIDIA GeForce RTX 3080 1.0485 2.097 3.1455 4.194 5.2425 SE +/- 0.01, N = 3 4.66 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute MSI NVIDIA GeForce RTX 3080 3K 6K 9K 12K 15K SE +/- 9.82, N = 3 15426.63 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute MSI NVIDIA GeForce RTX 3080 3K 6K 9K 12K 15K SE +/- 9.26, N = 3 15474.84 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth MSI NVIDIA GeForce RTX 3080 200 400 600 800 1000 SE +/- 0.05, N = 3 815.86 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute MSI NVIDIA GeForce RTX 3080 120 240 360 480 600 SE +/- 0.40, N = 3 553.48 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute MSI NVIDIA GeForce RTX 3080 7K 14K 21K 28K 35K SE +/- 342.26, N = 3 30467.68 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer MSI NVIDIA GeForce RTX 3080 4 8 12 16 20 SE +/- 0.10, N = 3 15.04 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer MSI NVIDIA GeForce RTX 3080 4 8 12 16 20 SE +/- 0.08, N = 3 16.36 1. (CXX) g++ options: -O3
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Boat - Acceleration: OpenCL MSI NVIDIA GeForce RTX 3080 0.4347 0.8694 1.3041 1.7388 2.1735 SE +/- 0.005, N = 3 1.932
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Masskrug - Acceleration: OpenCL MSI NVIDIA GeForce RTX 3080 0.522 1.044 1.566 2.088 2.61 SE +/- 0.006, N = 3 2.320
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Rack - Acceleration: OpenCL MSI NVIDIA GeForce RTX 3080 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.000, N = 3 0.170
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.8.1 Test: Server Room - Acceleration: OpenCL MSI NVIDIA GeForce RTX 3080 0.2707 0.5414 0.8121 1.0828 1.3535 SE +/- 0.001, N = 3 1.203
FluidX3D Test: FP32-FP32 OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 3.0 Test: FP32-FP32 MSI NVIDIA GeForce RTX 3080 1100 2200 3300 4400 5500 SE +/- 0.00, N = 3 5346
FluidX3D Test: FP32-FP16C OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 3.0 Test: FP32-FP16C MSI NVIDIA GeForce RTX 3080 2K 4K 6K 8K 10K SE +/- 26.03, N = 3 8311
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 3.0 Test: FP32-FP16S MSI NVIDIA GeForce RTX 3080 2K 4K 6K 8K 10K SE +/- 2.40, N = 3 10075
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 MSI NVIDIA GeForce RTX 3080 1200 2400 3600 4800 6000 SE +/- 21.38, N = 3 5526.73 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter MSI NVIDIA GeForce RTX 3080 0.9745 1.949 2.9235 3.898 4.8725 SE +/- 0.042, N = 6 4.331 1. (CXX) g++ options: -O2 -lOpenCL
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic MSI NVIDIA GeForce RTX 3080 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 1735229886 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Cornell MSI NVIDIA GeForce RTX 3080 400M 800M 1200M 1600M 2000M SE +/- 24.83, N = 3 1735230023 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3 MSI NVIDIA GeForce RTX 3080 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 1735230161 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY MSI NVIDIA GeForce RTX 3080 20 40 60 80 100 SE +/- 1.15, N = 3 107 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY MSI NVIDIA GeForce RTX 3080 40 80 120 160 200 SE +/- 1.45, N = 3 161 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT MSI NVIDIA GeForce RTX 3080 40 80 120 160 200 SE +/- 1.33, N = 3 194 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY MSI NVIDIA GeForce RTX 3080 9 18 27 36 45 SE +/- 0.00, N = 3 40.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY MSI NVIDIA GeForce RTX 3080 14 28 42 56 70 SE +/- 0.03, N = 3 61.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT MSI NVIDIA GeForce RTX 3080 20 40 60 80 100 SE +/- 0.32, N = 3 74.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N MSI NVIDIA GeForce RTX 3080 20 40 60 80 100 SE +/- 0.09, N = 3 91.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T MSI NVIDIA GeForce RTX 3080 20 40 60 80 100 SE +/- 0.06, N = 3 98.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN MSI NVIDIA GeForce RTX 3080 10 20 30 40 50 SE +/- 0.00, N = 3 46.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT MSI NVIDIA GeForce RTX 3080 10 20 30 40 50 SE +/- 0.03, N = 3 44.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN MSI NVIDIA GeForce RTX 3080 11 22 33 44 55 SE +/- 0.00, N = 3 48.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT MSI NVIDIA GeForce RTX 3080 11 22 33 44 55 SE +/- 0.00, N = 3 46.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY MSI NVIDIA GeForce RTX 3080 80 160 240 320 400 SE +/- 0.88, N = 3 368 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 0.67, N = 3 504 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT MSI NVIDIA GeForce RTX 3080 80 160 240 320 400 SE +/- 0.67, N = 3 374 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY MSI NVIDIA GeForce RTX 3080 130 260 390 520 650 SE +/- 0.58, N = 3 606 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY MSI NVIDIA GeForce RTX 3080 160 320 480 640 800 SE +/- 0.33, N = 3 722 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT MSI NVIDIA GeForce RTX 3080 140 280 420 560 700 SE +/- 0.67, N = 3 658 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N MSI NVIDIA GeForce RTX 3080 40 80 120 160 200 SE +/- 0.00, N = 3 189 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T MSI NVIDIA GeForce RTX 3080 80 160 240 320 400 SE +/- 0.33, N = 3 378 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 1.73, N = 3 515 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 1.45, N = 3 517 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 1.73, N = 3 516 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 1.76, N = 3 516 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.5