RDNA3 vs. NVIDIA OpenCL Compute

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212153-NE-RDNA3OPEN09
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RX 6800 XT
December 13 2022
  37 Minutes
RX 7900 XT
December 13 2022
  37 Minutes
RX 7900 XTX
December 14 2022
  45 Minutes
RX 6800
December 14 2022
  56 Minutes
RX 5700 XT
December 14 2022
  1 Hour, 17 Minutes
Radeon VII
December 14 2022
  59 Minutes
RTX 3080
December 14 2022
  36 Minutes
RTX 3090
December 14 2022
  33 Minutes
RTX 3080 Ti
December 14 2022
  33 Minutes
Invert Behavior (Only Show Selected Data)
  46 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


RDNA3 vs. NVIDIA OpenCL ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0805 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 2000GBAMD Radeon RX 6800 XT 16GB (2575/1000MHz)AMD Navi 21 HDMI AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.045.15.0-56-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.3 + Wayland4.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)1.3.232GCC 11.3.0ext43840x2160AMD Radeon RX 7900 XT 20GB (3125/1249MHz)AMD Device ab30AMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Radeon RX 6800 16GB (2475/1000MHz)AMD Navi 21 HDMI AudioAMD Radeon RX 5700 XT 8GB (2100/875MHz)AMD Navi 10 HDMI AudioAMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD AudioX Server 1.21.1.3NVIDIA 525.60.114.6.0OpenCL 3.0 CUDA 12.0.89NVIDIA GeForce RTX 3090 24GBNVIDIA GeForce RTX 3080 Ti 12GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101- RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00- RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102- RX 6800: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101- RX 5700 XT: BAR1 / Visible vRAM Size: 8176 MB - vBIOS Version: 113-D1820501-101- Radeon VII: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D3600200-106- RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- RTX 3080 Ti: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details- RTX 3080: GPU Compute Cores: 8704- RTX 3090: GPU Compute Cores: 10496- RTX 3080 Ti: GPU Compute Cores: 10240

RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiResult OverviewPhoronix Test Suite100%208%316%424%LuxCoreRenderHashcatclpeakcl-mem

RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 TiPer Watt Result OverviewPhoronix Test Suite100%201%301%402%503%clpeakMeta Performance Per WattsLuxCoreRenderHashcatcl-memP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

RDNA3 vs. NVIDIA OpenCL Computecl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Transfer Bandwidth enqueueWriteBufferhashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTSluxcorerender: DLSC - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3Dviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti470.4437.1361.1457.1222446.785626.2344.685192880000020418200000236795000010756005773875.3717.826.492.315.791092.711687.677352.0028.3906469.75493.39784065568105648235571475321197122011731210715.3702.2431.4619.3626238.367276.9758.956138272857126368100000298251666712586808065786.9623.118.062.857.611890.492263.167770.8936.2169591.196312.142596564799547560643114359787784776774864.8842.3462.4716.4428891.628208.5258.616970261428630291871429342088333314998309006408.1327.639.693.138.872213.052735.9410295.6940.3631609.816367.920677602867590658692118404917926891913459.9442.7345.5375.6615416.604237.9044.13420832428571670068333319265333338856004876674.2315.505.042.104.50875.7191673.765534.1821.9934432.25189.5783411517752527785553137474919931904928403.7390.1326.7342.409373.502504.8844.7224002666667969795000011058500005074112994001.938.892.29251.2673.9302.6793.5213144.966602.0744.40350472833331228680000014549466676285004303291.7313.532.0912.02449.6252395.175905.3416.9376312.235289.63747329637727457648462.6266148315479521403672.6645.5354.3662.2829425.4415440.1125.485975588571418858016667237256666710012447046789.5426.759.629.257.992181.511762.966372.3737.1537393.661340.384556358474375630604191375497499498498825.4749.3364.2813.4935185.0818086.1225.3671244283333225454333332825183333116303382665711.4631.5211.5510.689.412200.192103.608222.3043.8949397.817430.054609365502383724667189376607611608606804.0737.1355.7794.6033704.7917437.5013.0767452750000213455166672678050000110705678690010.9331.1211.0110.168.942134.292016.258131.9941.6772386.996418.606593359493369705652185367564566565560OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti2004006008001000SE +/- 0.10, N = 8SE +/- 1.73, N = 10SE +/- 0.18, N = 10SE +/- 1.14, N = 8SE +/- 0.71, N = 8SE +/- 0.24, N = 8SE +/- 0.08, N = 9SE +/- 0.11, N = 10SE +/- 0.10, N = 9470.4715.3864.8459.9403.7251.2672.6825.4804.01. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti2004006008001000SE +/- 1.35, N = 8SE +/- 0.16, N = 10SE +/- 0.37, N = 10SE +/- 0.26, N = 8SE +/- 2.21, N = 8SE +/- 0.62, N = 8SE +/- 0.31, N = 9SE +/- 0.23, N = 10SE +/- 0.32, N = 10437.1702.2842.3442.7390.1673.9645.5749.3737.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti100200300400500SE +/- 0.04, N = 8SE +/- 0.03, N = 10SE +/- 0.37, N = 10SE +/- 0.10, N = 8SE +/- 0.24, N = 8SE +/- 0.41, N = 8SE +/- 0.05, N = 9SE +/- 0.06, N = 10SE +/- 0.24, N = 10361.1431.4462.4345.5326.7302.6354.3364.2355.71. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti2004006008001000SE +/- 0.02, N = 9SE +/- 0.78, N = 9SE +/- 0.32, N = 10SE +/- 0.66, N = 9SE +/- 3.16, N = 7SE +/- 0.65, N = 9SE +/- 0.03, N = 14SE +/- 0.02, N = 14SE +/- 0.04, N = 13457.12619.36716.44375.66342.40793.52662.28813.49794.601. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti8K16K24K32K40KSE +/- 2.29, N = 9SE +/- 37.62, N = 9SE +/- 75.34, N = 9SE +/- 22.92, N = 9SE +/- 11.77, N = 9SE +/- 5.01, N = 8SE +/- 82.27, N = 13SE +/- 24.18, N = 14SE +/- 22.78, N = 1422446.7826238.3628891.6215416.609373.5013144.9629425.4435185.0833704.791. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti4K8K12K16K20KSE +/- 0.25, N = 7SE +/- 7.19, N = 8SE +/- 7.38, N = 8SE +/- 1.65, N = 8SE +/- 0.72, N = 7SE +/- 2.35, N = 8SE +/- 103.36, N = 15SE +/- 124.81, N = 13SE +/- 118.19, N = 155626.237276.978208.524237.902504.886602.0715440.1118086.1217437.501. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti1326395265SE +/- 0.27, N = 6SE +/- 0.25, N = 7SE +/- 0.22, N = 8SE +/- 0.22, N = 6SE +/- 0.29, N = 5SE +/- 0.23, N = 6SE +/- 0.01, N = 4SE +/- 0.01, N = 7SE +/- 0.00, N = 344.6858.9558.6144.1344.7244.4025.4825.3613.071. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti15000M30000M45000M60000M75000MSE +/- 43793302.38, N = 7SE +/- 92800753.57, N = 7SE +/- 84711702.76, N = 7SE +/- 34450211.54, N = 7SE +/- 29219339.11, N = 6SE +/- 4912936.44, N = 6SE +/- 83944276.95, N = 7SE +/- 90108517.60, N = 6SE +/- 75676635.54, N = 6519288000006138272857169702614286420832428572400266666735047283333597558857147124428333367452750000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti6000M12000M18000M24000M30000MSE +/- 20754437.28, N = 6SE +/- 24335461.09, N = 6SE +/- 31736944.93, N = 7SE +/- 43663954.75, N = 6SE +/- 8939341.88, N = 6SE +/- 848920.88, N = 6SE +/- 27656770.32, N = 6SE +/- 17806771.50, N = 6SE +/- 20586814.82, N = 620418200000263681000003029187142916700683333969795000012286800000188580166672254543333321345516667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti700M1400M2100M2800M3500MSE +/- 889475.50, N = 6SE +/- 4470825.18, N = 6SE +/- 2918608.88, N = 6SE +/- 1232522.26, N = 6SE +/- 1462589.03, N = 6SE +/- 23473202.28, N = 15SE +/- 1482040.64, N = 6SE +/- 3397589.01, N = 6SE +/- 2908120.81, N = 6236795000029825166673420883333192653333311058500001454946667237256666728251833332678050000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti300K600K900K1200K1500KSE +/- 867.95, N = 9SE +/- 2104.38, N = 10SE +/- 2789.55, N = 10SE +/- 1045.23, N = 9SE +/- 835.40, N = 9SE +/- 754.75, N = 8SE +/- 608.30, N = 9SE +/- 618.24, N = 9SE +/- 456.16, N = 9107560012586801499830885600507411628500100124411630331107056

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti200K400K600K800K1000KSE +/- 7271.22, N = 15SE +/- 3079.13, N = 9SE +/- 976.30, N = 10SE +/- 452.46, N = 9SE +/- 1606.13, N = 8SE +/- 1846.07, N = 7SE +/- 1635.86, N = 9SE +/- 4427.88, N = 7SE +/- 5315.17, N = 8577387806578900640487667299400430329704678826657786900

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 12SE +/- 0.06, N = 12SE +/- 0.05, N = 12SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 35.376.968.134.231.931.739.5411.4610.93MIN: 5.23 / MAX: 5.65MIN: 6.78 / MAX: 7.21MIN: 7.94 / MAX: 8.48MIN: 0.82 / MAX: 4.59MIN: 0.37 / MAX: 2.07MIN: 0.33 / MAX: 1.88MIN: 9.35 / MAX: 9.8MIN: 10.5 / MAX: 11.82MIN: 10.5 / MAX: 11.09

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti714212835SE +/- 0.00, N = 5SE +/- 0.07, N = 5SE +/- 0.00, N = 6SE +/- 0.00, N = 5SE +/- 0.03, N = 3SE +/- 0.08, N = 4SE +/- 0.11, N = 6SE +/- 0.16, N = 6SE +/- 0.08, N = 617.8223.1127.6315.508.8913.5326.7531.5231.12MIN: 16.51 / MAX: 18.34MIN: 21.35 / MAX: 24MIN: 25.23 / MAX: 29.44MIN: 14.54 / MAX: 15.99MIN: 8.58 / MAX: 9.11MIN: 10.02 / MAX: 15MIN: 23.62 / MAX: 29.35MIN: 28.71 / MAX: 34.92MIN: 28.53 / MAX: 34.16

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 12SE +/- 0.12, N = 6SE +/- 0.05, N = 12SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.498.069.695.042.292.099.6211.5511.01MIN: 2.8 / MAX: 7.35MIN: 3.48 / MAX: 9.15MIN: 3.96 / MAX: 11.01MIN: 0.83 / MAX: 5.84MIN: 0.32 / MAX: 2.7MIN: 0.29 / MAX: 2.48MIN: 3.12 / MAX: 11.01MIN: 4.91 / MAX: 13.17MIN: 3.57 / MAX: 12.61

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 32.312.853.132.101.009.2510.6810.16MIN: 2.1 / MAX: 2.99MIN: 2.67 / MAX: 3.45MIN: 2.81 / MAX: 3.76MIN: 1.91 / MAX: 2.69MIN: 7.35 / MAX: 11.9MIN: 8.65 / MAX: 14.18MIN: 8.2 / MAX: 13.69

Scene: Orange Juice - Acceleration: GPU

RX 5700 XT: The test quit with a non-zero exit status. E: RUNTIME ERROR: OpenCL driver API error (code: -6, file:/home/vsts/work/1/s/LinuxCompile/LuxCore-sdk/src/luxrays/devices/ocldevice.cpp, line: 143): CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti3691215SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 35.797.618.874.502.027.999.418.94MIN: 2.39 / MAX: 6.64MIN: 2.96 / MAX: 8.76MIN: 3.4 / MAX: 10.2MIN: 1.74 / MAX: 5.11MIN: 0.75 / MAX: 2.29MIN: 2.96 / MAX: 9.11MIN: 3.47 / MAX: 10.92MIN: 2.99 / MAX: 10.44

Scene: Danish Mood - Acceleration: GPU

RX 5700 XT: The test quit with a non-zero exit status. E: RUNTIME ERROR: OpenCL driver API error (code: -6, file:/home/vsts/work/1/s/LinuxCompile/LuxCore-sdk/src/luxrays/devices/ocldevice.cpp, line: 143): CL_OUT_OF_HOST_MEMORY

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti5001000150020002500SE +/- 2.10, N = 8SE +/- 14.48, N = 10SE +/- 5.22, N = 10SE +/- 2.26, N = 8SE +/- 1.27, N = 4SE +/- 1.08, N = 3SE +/- 1.96, N = 3SE +/- 0.66, N = 31092.711890.492213.05875.72449.632181.512200.192134.291. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: Texture Read Bandwidth

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti6001200180024003000SE +/- 0.93, N = 13SE +/- 1.13, N = 13SE +/- 3.26, N = 13SE +/- 0.69, N = 13SE +/- 2.34, N = 12SE +/- 0.13, N = 13SE +/- 0.16, N = 13SE +/- 0.12, N = 131687.672263.162735.941673.762395.171762.962103.602016.251. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: FFT SP

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti2K4K6K8K10KSE +/- 97.24, N = 15SE +/- 121.67, N = 15SE +/- 165.23, N = 15SE +/- 85.06, N = 15SE +/- 3.93, N = 9SE +/- 7.36, N = 11SE +/- 15.79, N = 12SE +/- 16.35, N = 117352.007770.8910295.695534.185905.346372.378222.308131.991. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: GEMM SGEMM_N

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti1020304050SE +/- 0.00, N = 13SE +/- 0.09, N = 13SE +/- 0.04, N = 14SE +/- 0.00, N = 13SE +/- 0.00, N = 12SE +/- 0.00, N = 14SE +/- 0.05, N = 14SE +/- 0.00, N = 1428.3936.2240.3621.9916.9437.1543.8941.681. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: MD5 Hash

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti130260390520650SE +/- 2.96, N = 13SE +/- 1.06, N = 13SE +/- 0.99, N = 13SE +/- 2.73, N = 12SE +/- 0.19, N = 12SE +/- 0.07, N = 13SE +/- 0.08, N = 13SE +/- 0.06, N = 13469.75591.20609.82432.25312.24393.66397.82387.001. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: Reduction

RX 5700 XT: The test run did not produce a result.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti90180270360450SE +/- 1.05, N = 3SE +/- 3.49, N = 4SE +/- 3.65, N = 15SE +/- 0.40, N = 3SE +/- 0.58, N = 3SE +/- 0.06, N = 13SE +/- 0.32, N = 13SE +/- 0.06, N = 1393.40312.14367.9289.58289.64340.38430.05418.611. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Target: OpenCL - Benchmark: S3D

RX 5700 XT: The test run did not produce a result.

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti150300450600750SE +/- 4.36, N = 3SE +/- 3.67, N = 3SE +/- 3.84, N = 3SE +/- 1.45, N = 3SE +/- 3.18, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 34065966774114735576095931. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dCOPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti60120180240300Min: 6 / Avg: 92.22 / Max: 256Min: 11 / Avg: 110.62 / Max: 267Min: 12 / Avg: 118.81 / Max: 297Min: 5 / Avg: 90.04 / Max: 203Min: 7 / Avg: 67.46 / Max: 207Min: 21 / Avg: 85.96 / Max: 358Min: 15.6 / Avg: 153.55 / Max: 320.04Min: 16.47 / Avg: 184.74 / Max: 349.64Min: 18.98 / Avg: 185.58 / Max: 350.33

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti1530456075Min: 49 / Avg: 62.56 / Max: 78Min: 25 / Avg: 49.03 / Max: 65Min: 27 / Avg: 57.6 / Max: 73Min: 48 / Avg: 60.6 / Max: 68Min: 36 / Avg: 54.67 / Max: 69Min: 41 / Avg: 58.82 / Max: 72Min: 47 / Avg: 62.12 / Max: 77Min: 42 / Avg: 57.28 / Max: 70Min: 30 / Avg: 62.32 / Max: 79

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti130260390520650SE +/- 6.51, N = 3SE +/- 6.17, N = 3SE +/- 8.21, N = 3SE +/- 0.33, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 1.20, N = 35565646025172963563653591. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sCOPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti2004006008001000SE +/- 9.00, N = 3SE +/- 8.69, N = 3SE +/- 5.84, N = 3SE +/- 0.58, N = 3SE +/- 2.19, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 38107998677523774765024931. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sAXPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti130260390520650SE +/- 14.00, N = 2SE +/- 9.35, N = 3SE +/- 10.68, N = 3SE +/- 0.33, N = 3SE +/- 3.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 35645475905272743773833691. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - sDOT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti2004006008001000SE +/- 5.46, N = 3SE +/- 2.00, N = 2SE +/- 6.17, N = 3SE +/- 2.52, N = 3SE +/- 1.53, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 38235606587855766317247051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dAXPY

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti150300450600750SE +/- 4.37, N = 3SE +/- 5.51, N = 3SE +/- 18.41, N = 3SE +/- 4.37, N = 3SE +/- 2.52, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 35576436925534846056676521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dDOT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti4080120160200SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 1.67, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3147.0114.0118.0137.062.6192.0189.0185.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMV-N

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti120240360480600SE +/- 3.84, N = 3SE +/- 1.67, N = 3SE +/- 3.67, N = 3SE +/- 14.95, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 35323594044742663763763671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMV-T

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti30060090012001500SE +/- 3.33, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 2.19, N = 3SE +/- 3.33, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 3SE +/- 1.33, N = 3119778791791914835006075641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-NN

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti30060090012001500SE +/- 5.77, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 3.33, N = 3SE +/- 1.00, N = 3SE +/- 1.20, N = 3122078492693115475026115661. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-NT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti30060090012001500SE +/- 3.33, N = 3SE +/- 0.58, N = 3SE +/- 1.15, N = 3SE +/- 1.00, N = 3SE +/- 2.60, N = 3SE +/- 0.50, N = 2SE +/- 1.33, N = 311737768919049525006085651. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-TN

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800Radeon VIIRTX 3080RTX 3090RTX 3080 Ti30060090012001500SE +/- 0.00, N = 2SE +/- 0.33, N = 3SE +/- 2.03, N = 3SE +/- 1.00, N = 3SE +/- 3.33, N = 3121077491392814034986065601. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Test: OpenCL BLAS - dGEMM-TT

RX 5700 XT: The test quit with a non-zero exit status. E: what(): ViennaCL: FATAL ERROR: CL_OUT_OF_HOST_MEMORY

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRX 6800 XTRX 7900 XTRX 7900 XTXRX 6800RX 5700 XTRadeon VIIRTX 3080RTX 3090RTX 3080 Ti3K6K9K12K15K3724.234894.915627.903147.2312500.002752.764351.035608.155252.58