rx3080-ocl

AMD Ryzen 9 5900X 12-Core testing with a ASRock X570 Steel Legend (P5.63 BIOS) and MSI NVIDIA GeForce RTX 3080 12GB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2412265-NE-RX3080OCL41&grt.

rx3080-oclProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionMSI NVIDIA GeForce RTX 3080AMD Ryzen 9 5900X 12-Core @ 4.95GHz (12 Cores / 24 Threads)ASRock X570 Steel Legend (P5.63 BIOS)AMD Starship/Matisse2 x 16GB DDR4-3600MT/s TEAMGROUP-UD4-32001000GB Western Digital WDS100T3X0C-00SJG0 + 1000GB Western Digital WD Blue SN580 1TB + 2000GB Seagate ST2000DX001-1CM1MSI NVIDIA GeForce RTX 3080 12GBNVIDIA GA102 HD AudioDELL S2721QSIntel I211 + Intel Dual Band-AC 3168NGWUbuntu 24.106.11.0-13-generic (x86_64)GNOME Shell 47.0X Server + WaylandNVIDIA 565.57.014.6.0OpenCL 3.0 CUDA 12.7.33GCC 14.2.0 + Clang 19.1.1 + CUDA 12.6ext42560x1440OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - CPU Microcode: 0xa20102b- BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.85.00.40- GPU Compute Cores: 8960- Python 3.12.7- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; IBRS_FW; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

rx3080-oclcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Slulesh-cl: rodinia: OpenCL Particle Filtersmallpt-gpu: GPU - 2560 x 1440 - Causticsmallpt-gpu: GPU - 2560 x 1440 - Cornellsmallpt-gpu: GPU - 2560 x 1440 - Caustic3viennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTxsbench-cl: MSI NVIDIA GeForce RTX 3080366.0826.7786.34.6615426.6315474.84815.86553.4830467.6815.0416.361.9322.3200.1701.20353468311100755526.73314.33117352298861735230023173523016110716119440.861.274.591.098.646.244.148.846.4368504374606722658189378515517516516OpenBenchmarking.org

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyMSI NVIDIA GeForce RTX 308080160240320400SE +/- 0.27, N = 3366.01. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadMSI NVIDIA GeForce RTX 30802004006008001000SE +/- 0.17, N = 3826.71. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteMSI NVIDIA GeForce RTX 30802004006008001000SE +/- 0.26, N = 3786.31. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyMSI NVIDIA GeForce RTX 30801.04852.0973.14554.1945.2425SE +/- 0.01, N = 34.661. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeMSI NVIDIA GeForce RTX 30803K6K9K12K15KSE +/- 9.82, N = 315426.631. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeMSI NVIDIA GeForce RTX 30803K6K9K12K15KSE +/- 9.26, N = 315474.841. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthMSI NVIDIA GeForce RTX 30802004006008001000SE +/- 0.05, N = 3815.861. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeMSI NVIDIA GeForce RTX 3080120240360480600SE +/- 0.40, N = 3553.481. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeMSI NVIDIA GeForce RTX 30807K14K21K28K35KSE +/- 342.26, N = 330467.681. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferMSI NVIDIA GeForce RTX 308048121620SE +/- 0.10, N = 315.041. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferMSI NVIDIA GeForce RTX 308048121620SE +/- 0.08, N = 316.361. (CXX) g++ options: -O3

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Boat - Acceleration: OpenCLMSI NVIDIA GeForce RTX 30800.43470.86941.30411.73882.1735SE +/- 0.005, N = 31.932

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Masskrug - Acceleration: OpenCLMSI NVIDIA GeForce RTX 30800.5221.0441.5662.0882.61SE +/- 0.006, N = 32.320

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Rack - Acceleration: OpenCLMSI NVIDIA GeForce RTX 30800.03830.07660.11490.15320.1915SE +/- 0.000, N = 30.170

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.8.1Test: Server Room - Acceleration: OpenCLMSI NVIDIA GeForce RTX 30800.27070.54140.81211.08281.3535SE +/- 0.001, N = 31.203

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32MSI NVIDIA GeForce RTX 308011002200330044005500SE +/- 0.00, N = 35346

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16CMSI NVIDIA GeForce RTX 30802K4K6K8K10KSE +/- 26.03, N = 38311

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16SMSI NVIDIA GeForce RTX 30802K4K6K8K10KSE +/- 2.40, N = 310075

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06MSI NVIDIA GeForce RTX 308012002400360048006000SE +/- 21.38, N = 35526.731. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterMSI NVIDIA GeForce RTX 30800.97451.9492.92353.8984.8725SE +/- 0.042, N = 64.3311. (CXX) g++ options: -O2 -lOpenCL

SmallPT GPU

OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: CausticMSI NVIDIA GeForce RTX 3080400M800M1200M1600M2000MSE +/- 25.12, N = 317352298861. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: CornellMSI NVIDIA GeForce RTX 3080400M800M1200M1600M2000MSE +/- 24.83, N = 317352300231. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 2560 x 1440 - Scene: Caustic3MSI NVIDIA GeForce RTX 3080400M800M1200M1600M2000MSE +/- 25.12, N = 317352301611. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYMSI NVIDIA GeForce RTX 308020406080100SE +/- 1.15, N = 31071. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYMSI NVIDIA GeForce RTX 30804080120160200SE +/- 1.45, N = 31611. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTMSI NVIDIA GeForce RTX 30804080120160200SE +/- 1.33, N = 31941. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYMSI NVIDIA GeForce RTX 3080918273645SE +/- 0.00, N = 340.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYMSI NVIDIA GeForce RTX 30801428425670SE +/- 0.03, N = 361.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTMSI NVIDIA GeForce RTX 308020406080100SE +/- 0.32, N = 374.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NMSI NVIDIA GeForce RTX 308020406080100SE +/- 0.09, N = 391.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TMSI NVIDIA GeForce RTX 308020406080100SE +/- 0.06, N = 398.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNMSI NVIDIA GeForce RTX 30801020304050SE +/- 0.00, N = 346.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTMSI NVIDIA GeForce RTX 30801020304050SE +/- 0.03, N = 344.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNMSI NVIDIA GeForce RTX 30801122334455SE +/- 0.00, N = 348.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTMSI NVIDIA GeForce RTX 30801122334455SE +/- 0.00, N = 346.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYMSI NVIDIA GeForce RTX 308080160240320400SE +/- 0.88, N = 33681. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 0.67, N = 35041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTMSI NVIDIA GeForce RTX 308080160240320400SE +/- 0.67, N = 33741. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYMSI NVIDIA GeForce RTX 3080130260390520650SE +/- 0.58, N = 36061. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYMSI NVIDIA GeForce RTX 3080160320480640800SE +/- 0.33, N = 37221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTMSI NVIDIA GeForce RTX 3080140280420560700SE +/- 0.67, N = 36581. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NMSI NVIDIA GeForce RTX 30804080120160200SE +/- 0.00, N = 31891. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TMSI NVIDIA GeForce RTX 308080160240320400SE +/- 0.33, N = 33781. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 1.73, N = 35151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 1.45, N = 35171. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 1.73, N = 35161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 1.76, N = 35161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5