RDNA3 vs. NVIDIA OpenCL Compute

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212153-NE-RDNA3OPEN09
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RX 6800 XT
December 13 2022
  37 Minutes
RX 7900 XT
December 13 2022
  37 Minutes
RX 7900 XTX
December 14 2022
  45 Minutes
RX 6800
December 14 2022
  56 Minutes
RX 5700 XT
December 14 2022
  1 Hour, 17 Minutes
Radeon VII
December 14 2022
  59 Minutes
RTX 3080
December 14 2022
  36 Minutes
RTX 3090
December 14 2022
  33 Minutes
RTX 3080 Ti
December 14 2022
  33 Minutes
Invert Behavior (Only Show Selected Data)
  46 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


RDNA3 vs. NVIDIA OpenCL ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0805 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 2000GBAMD Radeon RX 6800 XT 16GB (2575/1000MHz)AMD Navi 21 HDMI AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.045.15.0-56-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.3 + Wayland4.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)1.3.232GCC 11.3.0ext43840x2160AMD Radeon RX 7900 XT 20GB (3125/1249MHz)AMD Device ab30AMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Radeon RX 6800 16GB (2475/1000MHz)AMD Navi 21 HDMI AudioAMD Radeon RX 5700 XT 8GB (2100/875MHz)AMD Navi 10 HDMI AudioAMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD AudioX Server 1.21.1.3NVIDIA 525.60.114.6.0OpenCL 3.0 CUDA 12.0.89NVIDIA GeForce RTX 3090 24GBNVIDIA GeForce RTX 3080 Ti 12GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101- RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00- RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102- RX 6800: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101- RX 5700 XT: BAR1 / Visible vRAM Size: 8176 MB - vBIOS Version: 113-D1820501-101- Radeon VII: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D3600200-106- RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- RTX 3080 Ti: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details- RTX 3080: GPU Compute Cores: 8704- RTX 3090: GPU Compute Cores: 10496- RTX 3080 Ti: GPU Compute Cores: 10240

RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiResult OverviewPhoronix Test Suite100%208%316%424%LuxCoreRenderHashcatclpeakcl-mem

RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiPer Watt Result OverviewPhoronix Test Suite100%201%301%402%503%clpeakMeta Performance Per WattsLuxCoreRenderHashcatcl-memP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

RDNA3 vs. NVIDIA OpenCL Computeshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3Dluxcorerender: DLSC - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUhashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTScl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Transfer Bandwidth enqueueWriteBufferviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti1092.711687.677352.0028.3906469.75493.39785.3717.826.492.315.79519288000002041820000023679500001075600577387470.4437.1361.1457.1222446.785626.2344.6840655681056482355714753211971220117312101890.492263.167770.8936.2169591.196312.1426.9623.118.062.857.61613827285712636810000029825166671258680806578715.3702.2431.4619.3626238.367276.9758.955965647995475606431143597877847767742213.052735.9410295.6940.3631609.816367.9208.1327.639.693.138.87697026142863029187142934208833331499830900640864.8842.3462.4716.4428891.628208.5258.61677602867590658692118404917926891913875.7191673.765534.1821.9934432.25189.57834.2315.505.042.104.5042083242857167006833331926533333885600487667459.9442.7345.5375.6615416.604237.9044.134115177525277855531374749199319049281.938.892.292400266666796979500001105850000507411299400403.7390.1326.7342.409373.502504.8844.72449.6252395.175905.3416.9376312.235289.6371.7313.532.0912.0235047283333122868000001454946667628500430329251.2673.9302.6793.5213144.966602.0744.4047329637727457648462.62661483154795214032181.511762.966372.3737.1537393.661340.3849.5426.759.629.257.99597558857141885801666723725666671001244704678672.6645.5354.3662.2829425.4415440.1125.485563584743756306041913754974994984982200.192103.608222.3043.8949397.817430.05411.4631.5211.5510.689.41712442833332254543333328251833331163033826657825.4749.3364.2813.4935185.0818086.1225.366093655023837246671893766076116086062134.292016.258131.9941.6772386.996418.60610.9331.1211.0110.168.94674527500002134551666726780500001107056786900804.0737.1355.7794.6033704.7917437.5013.07593359493369705652185367564566565560OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII5001000150020002500SE +/- 1.08, N = 3SE +/- 0.66, N = 3SE +/- 1.96, N = 3SE +/- 2.26, N = 8SE +/- 2.10, N = 8SE +/- 14.48, N = 10SE +/- 5.22, N = 10SE +/- 1.27, N = 42181.512134.292200.19875.721092.711890.492213.05449.631. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: Texture Read Bandwidth

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII6001200180024003000SE +/- 0.13, N = 13SE +/- 0.12, N = 13SE +/- 0.16, N = 13SE +/- 0.69, N = 13SE +/- 0.93, N = 13SE +/- 1.13, N = 13SE +/- 3.26, N = 13SE +/- 2.34, N = 121762.962016.252103.601673.761687.672263.162735.942395.171. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: FFT SP

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2K4K6K8K10KSE +/- 7.36, N = 11SE +/- 16.35, N = 11SE +/- 15.79, N = 12SE +/- 85.06, N = 15SE +/- 97.24, N = 15SE +/- 121.67, N = 15SE +/- 165.23, N = 15SE +/- 3.93, N = 96372.378131.998222.305534.187352.007770.8910295.695905.341. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: GEMM SGEMM_N

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII1020304050SE +/- 0.00, N = 14SE +/- 0.00, N = 14SE +/- 0.05, N = 14SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.09, N = 13SE +/- 0.04, N = 14SE +/- 0.00, N = 1237.1541.6843.8921.9928.3936.2240.3616.941. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: MD5 Hash

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII130260390520650SE +/- 0.07, N = 13SE +/- 0.06, N = 13SE +/- 0.08, N = 13SE +/- 2.73, N = 12SE +/- 2.96, N = 13SE +/- 1.06, N = 13SE +/- 0.99, N = 13SE +/- 0.19, N = 12393.66387.00397.82432.25469.75591.20609.82312.241. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: Reduction

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII90180270360450SE +/- 0.06, N = 13SE +/- 0.06, N = 13SE +/- 0.32, N = 13SE +/- 0.40, N = 3SE +/- 1.05, N = 3SE +/- 3.49, N = 4SE +/- 3.65, N = 15SE +/- 0.58, N = 3340.38418.61430.0589.5893.40312.14367.92289.641. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: S3D

RX 5700 XT: The test run did not produce a result.

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 12SE +/- 0.12, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 129.5410.9311.461.934.235.376.968.131.73MIN: 9.35 / MAX: 9.8MIN: 10.5 / MAX: 11.09MIN: 10.5 / MAX: 11.82MIN: 0.37 / MAX: 2.07MIN: 0.82 / MAX: 4.59MIN: 5.23 / MAX: 5.65MIN: 6.78 / MAX: 7.21MIN: 7.94 / MAX: 8.48MIN: 0.33 / MAX: 1.88

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII714212835SE +/- 0.11, N = 6SE +/- 0.08, N = 6SE +/- 0.16, N = 6SE +/- 0.03, N = 3SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.07, N = 5SE +/- 0.00, N = 6SE +/- 0.08, N = 426.7531.1231.528.8915.5017.8223.1127.6313.53MIN: 23.62 / MAX: 29.35MIN: 28.53 / MAX: 34.16MIN: 28.71 / MAX: 34.92MIN: 8.58 / MAX: 9.11MIN: 14.54 / MAX: 15.99MIN: 16.51 / MAX: 18.34MIN: 21.35 / MAX: 24MIN: 25.23 / MAX: 29.44MIN: 10.02 / MAX: 15

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII3691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 6SE +/- 0.12, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 129.6211.0111.552.295.046.498.069.692.09MIN: 3.12 / MAX: 11.01MIN: 3.57 / MAX: 12.61MIN: 4.91 / MAX: 13.17MIN: 0.32 / MAX: 2.7MIN: 0.83 / MAX: 5.84MIN: 2.8 / MAX: 7.35MIN: 3.48 / MAX: 9.15MIN: 3.96 / MAX: 11.01MIN: 0.29 / MAX: 2.48

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII3691215SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.2510.1610.682.102.312.853.131.00MIN: 7.35 / MAX: 11.9MIN: 8.2 / MAX: 13.69MIN: 8.65 / MAX: 14.18MIN: 1.91 / MAX: 2.69MIN: 2.1 / MAX: 2.99MIN: 2.67 / MAX: 3.45MIN: 2.81 / MAX: 3.76

Scene: Orange Juice - Acceleration: GPU

RX 5700 XT: The test quit with a non-zero exit status. E: RUNTIME ERROR: OpenCL driver API error (code: -6, file:/home/vsts/work/1/s/LinuxCompile/LuxCore-sdk/src/luxrays/devices/ocldevice.cpp, line: 143): CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII3691215SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 37.998.949.414.505.797.618.872.02MIN: 2.96 / MAX: 9.11MIN: 2.99 / MAX: 10.44MIN: 3.47 / MAX: 10.92MIN: 1.74 / MAX: 5.11MIN: 2.39 / MAX: 6.64MIN: 2.96 / MAX: 8.76MIN: 3.4 / MAX: 10.2MIN: 0.75 / MAX: 2.29

Scene: Danish Mood - Acceleration: GPU

RX 5700 XT: The test quit with a non-zero exit status. E: RUNTIME ERROR: OpenCL driver API error (code: -6, file:/home/vsts/work/1/s/LinuxCompile/LuxCore-sdk/src/luxrays/devices/ocldevice.cpp, line: 143): CL_OUT_OF_HOST_MEMORY

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII15000M30000M45000M60000M75000MSE +/- 83944276.95, N = 7SE +/- 75676635.54, N = 6SE +/- 90108517.60, N = 6SE +/- 29219339.11, N = 6SE +/- 34450211.54, N = 7SE +/- 43793302.38, N = 7SE +/- 92800753.57, N = 7SE +/- 84711702.76, N = 7SE +/- 4912936.44, N = 6597558857146745275000071244283333240026666674208324285751928800000613827285716970261428635047283333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII6000M12000M18000M24000M30000MSE +/- 27656770.32, N = 6SE +/- 20586814.82, N = 6SE +/- 17806771.50, N = 6SE +/- 8939341.88, N = 6SE +/- 43663954.75, N = 6SE +/- 20754437.28, N = 6SE +/- 24335461.09, N = 6SE +/- 31736944.93, N = 7SE +/- 848920.88, N = 618858016667213455166672254543333396979500001670068333320418200000263681000003029187142912286800000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII700M1400M2100M2800M3500MSE +/- 1482040.64, N = 6SE +/- 2908120.81, N = 6SE +/- 3397589.01, N = 6SE +/- 1462589.03, N = 6SE +/- 1232522.26, N = 6SE +/- 889475.50, N = 6SE +/- 4470825.18, N = 6SE +/- 2918608.88, N = 6SE +/- 23473202.28, N = 15237256666726780500002825183333110585000019265333332367950000298251666734208833331454946667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII300K600K900K1200K1500KSE +/- 608.30, N = 9SE +/- 456.16, N = 9SE +/- 618.24, N = 9SE +/- 835.40, N = 9SE +/- 1045.23, N = 9SE +/- 867.95, N = 9SE +/- 2104.38, N = 10SE +/- 2789.55, N = 10SE +/- 754.75, N = 8100124411070561163033507411885600107560012586801499830628500

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII200K400K600K800K1000KSE +/- 1635.86, N = 9SE +/- 5315.17, N = 8SE +/- 4427.88, N = 7SE +/- 1606.13, N = 8SE +/- 452.46, N = 9SE +/- 7271.22, N = 15SE +/- 3079.13, N = 9SE +/- 976.30, N = 10SE +/- 1846.07, N = 7704678786900826657299400487667577387806578900640430329

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2004006008001000SE +/- 0.08, N = 9SE +/- 0.10, N = 9SE +/- 0.11, N = 10SE +/- 0.71, N = 8SE +/- 1.14, N = 8SE +/- 0.10, N = 8SE +/- 1.73, N = 10SE +/- 0.18, N = 10SE +/- 0.24, N = 8672.6804.0825.4403.7459.9470.4715.3864.8251.21. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2004006008001000SE +/- 0.31, N = 9SE +/- 0.32, N = 10SE +/- 0.23, N = 10SE +/- 2.21, N = 8SE +/- 0.26, N = 8SE +/- 1.35, N = 8SE +/- 0.16, N = 10SE +/- 0.37, N = 10SE +/- 0.62, N = 8645.5737.1749.3390.1442.7437.1702.2842.3673.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII100200300400500SE +/- 0.05, N = 9SE +/- 0.24, N = 10SE +/- 0.06, N = 10SE +/- 0.24, N = 8SE +/- 0.10, N = 8SE +/- 0.04, N = 8SE +/- 0.03, N = 10SE +/- 0.37, N = 10SE +/- 0.41, N = 8354.3355.7364.2326.7345.5361.1431.4462.4302.61. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2004006008001000SE +/- 0.03, N = 14SE +/- 0.04, N = 13SE +/- 0.02, N = 14SE +/- 3.16, N = 7SE +/- 0.66, N = 9SE +/- 0.02, N = 9SE +/- 0.78, N = 9SE +/- 0.32, N = 10SE +/- 0.65, N = 9662.28794.60813.49342.40375.66457.12619.36716.44793.521. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII8K16K24K32K40KSE +/- 82.27, N = 13SE +/- 22.78, N = 14SE +/- 24.18, N = 14SE +/- 11.77, N = 9SE +/- 22.92, N = 9SE +/- 2.29, N = 9SE +/- 37.62, N = 9SE +/- 75.34, N = 9SE +/- 5.01, N = 829425.4433704.7935185.089373.5015416.6022446.7826238.3628891.6213144.961. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII4K8K12K16K20KSE +/- 103.36, N = 15SE +/- 118.19, N = 15SE +/- 124.81, N = 13SE +/- 0.72, N = 7SE +/- 1.65, N = 8SE +/- 0.25, N = 7SE +/- 7.19, N = 8SE +/- 7.38, N = 8SE +/- 2.35, N = 815440.1117437.5018086.122504.884237.905626.237276.978208.526602.071. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII1326395265SE +/- 0.01, N = 4SE +/- 0.00, N = 3SE +/- 0.01, N = 7SE +/- 0.29, N = 5SE +/- 0.22, N = 6SE +/- 0.27, N = 6SE +/- 0.25, N = 7SE +/- 0.22, N = 8SE +/- 0.23, N = 625.4813.0725.3644.7244.1344.6858.9558.6144.401. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII150300450600750SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 1.45, N = 3SE +/- 4.36, N = 3SE +/- 3.67, N = 3SE +/- 3.84, N = 3SE +/- 3.18, N = 35565936094114065966774731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dCOPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII130260390520650SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 6.51, N = 3SE +/- 6.17, N = 3SE +/- 8.21, N = 3SE +/- 1.33, N = 33563593655175565646022961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sCOPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2004006008001000SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 9.00, N = 3SE +/- 8.69, N = 3SE +/- 5.84, N = 3SE +/- 2.19, N = 34744935027528107998673771. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sAXPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII130260390520650SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 14.00, N = 2SE +/- 9.35, N = 3SE +/- 10.68, N = 3SE +/- 3.33, N = 33753693835275645475902741. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sDOT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII2004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 2.52, N = 3SE +/- 5.46, N = 3SE +/- 2.00, N = 2SE +/- 6.17, N = 3SE +/- 1.53, N = 36307057247858235606585761. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dAXPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII150300450600750SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 4.37, N = 3SE +/- 4.37, N = 3SE +/- 5.51, N = 3SE +/- 18.41, N = 3SE +/- 2.52, N = 36046526675535576436924841. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dDOT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII4080120160200SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 1.67, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3191.0185.0189.0137.0147.0114.0118.062.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMV-N

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII120240360480600SE +/- 0.00, N = 2SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 14.95, N = 3SE +/- 3.84, N = 3SE +/- 1.67, N = 3SE +/- 3.67, N = 3SE +/- 0.33, N = 33753673764745323594042661. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMV-T

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII30060090012001500SE +/- 1.20, N = 3SE +/- 1.33, N = 3SE +/- 0.58, N = 3SE +/- 2.19, N = 3SE +/- 3.33, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 3.33, N = 3497564607919119778791714831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-NN

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII30060090012001500SE +/- 1.00, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 5.77, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 3.33, N = 3499566611931122078492615471. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-NT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII30060090012001500SE +/- 1.50, N = 2SE +/- 1.33, N = 3SE +/- 0.50, N = 2SE +/- 1.00, N = 3SE +/- 3.33, N = 3SE +/- 0.58, N = 3SE +/- 1.15, N = 3SE +/- 2.60, N = 349856560890411737768919521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-TN

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRTX 3080RTX 3080 TiRTX 3090RX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII30060090012001500SE +/- 1.00, N = 3SE +/- 0.00, N = 2SE +/- 0.33, N = 3SE +/- 2.03, N = 3SE +/- 3.33, N = 3498560606928121077491314031. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-TT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII60120180240300Min: 15.6 / Avg: 153.55 / Max: 320.04Min: 18.98 / Avg: 185.58 / Max: 350.33Min: 16.47 / Avg: 184.74 / Max: 349.64Min: 7 / Avg: 67.46 / Max: 207Min: 5 / Avg: 90.04 / Max: 203Min: 6 / Avg: 92.22 / Max: 256Min: 11 / Avg: 110.62 / Max: 267Min: 12 / Avg: 118.81 / Max: 297Min: 21 / Avg: 85.96 / Max: 358

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII1530456075Min: 47 / Avg: 62.12 / Max: 77Min: 30 / Avg: 62.32 / Max: 79Min: 42 / Avg: 57.28 / Max: 70Min: 36 / Avg: 54.67 / Max: 69Min: 48 / Avg: 60.6 / Max: 68Min: 49 / Avg: 62.56 / Max: 78Min: 25 / Avg: 49.03 / Max: 65Min: 27 / Avg: 57.6 / Max: 73Min: 41 / Avg: 58.82 / Max: 72

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRTX 3080RTX 3080 TiRTX 3090RX 5700 XTRX 6800RX 6800 XTRX 7900 XTRX 7900 XTXRadeon VII3K6K9K12K15K4351.035252.585608.1512500.003147.233724.234894.915627.902752.76