RDNA3 OpenCL Compute

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG CROSSHAIR X670E HERO (0805 BIOS) and AMD Radeon RX 6800 XT 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212135-NE-RDNA3OPEN49
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 6 Tests
OpenCL 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
RX 7900 XTX
December 13 2022
  54 Minutes
RX 7900 XT
December 13 2022
  1 Hour, 3 Minutes
RX 6800 XT
December 13 2022
  45 Minutes
Radeon RX 6800 XT
December 13 2022
  37 Minutes
Invert Hiding All Results Option
  50 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


RDNA3 OpenCL ComputeOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0805 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 2000GBAMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Radeon RX 7900 XT 20GB (3125/1249MHz)AMD Radeon RX 6800 XT 16GB (2575/1000MHz)AMD Device ab30AMD Navi 21 HDMI AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.045.15.0-56-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.3 + Wayland4.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)1.3.232GCC 11.3.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudiosMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRDNA3 OpenCL Compute BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203 - RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102 - RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00- RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - Radeon RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XTResult OverviewPhoronix Test Suite100%119%138%157%176%SHOC Scalable HeterOgeneous ComputingViennaCLcl-memLuxCoreRenderHashcatclpeak

RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XTPer Watt Result OverviewPhoronix Test Suite100%111%122%133%SHOC Scalable HeterOgeneous Computingcl-memHashcatclpeakLuxCoreRenderP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

RDNA3 OpenCL Computecl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Transfer Bandwidth enqueueWriteBufferhashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTSluxcorerender: DLSC - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3Dviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT864.5839.9458.0716.1228203.847750.9659.876994674285730341983333342201666714492119394937.9327.719.483.128.892196.612720.069471.3439.1395610.739368.910673714.3702.8431.4618.3726458.907245.4059.166147880000026333283333298605000012915447993936.8221.867.912.827.611887.692249.227899.1736.1749568.366297.509590470.4437.1360.9457.1522435.765627.0345.225199725714320466766667236825000010729445883405.2117.826.332.315.881098.311688.707318.2128.3924481.60496.2093406470.4437.1361.1457.1222446.785626.2344.685192880000020418200000236795000010756005773875.3717.826.492.315.791092.711687.677352.0028.3906469.75493.39784065568105648235571475321197122011731210OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT2004006008001000SE +/- 0.14, N = 10SE +/- 1.27, N = 10SE +/- 0.05, N = 8SE +/- 0.10, N = 8864.5714.3470.4470.41. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT2004006008001000SE +/- 0.42, N = 10SE +/- 0.28, N = 10SE +/- 0.90, N = 8SE +/- 1.35, N = 8839.9702.8437.1437.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT100200300400500SE +/- 0.08, N = 10SE +/- 0.06, N = 10SE +/- 0.03, N = 8SE +/- 0.04, N = 8458.0431.4360.9361.11. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT150300450600750SE +/- 0.24, N = 10SE +/- 0.06, N = 9SE +/- 0.03, N = 9SE +/- 0.02, N = 9716.12618.37457.15457.121. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT6K12K18K24K30KSE +/- 35.23, N = 9SE +/- 31.08, N = 9SE +/- 2.35, N = 9SE +/- 2.29, N = 928203.8426458.9022435.7622446.781. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT17003400510068008500SE +/- 9.52, N = 8SE +/- 3.17, N = 8SE +/- 1.16, N = 7SE +/- 0.25, N = 77750.967245.405627.035626.231. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT1326395265SE +/- 0.20, N = 8SE +/- 0.23, N = 7SE +/- 0.05, N = 6SE +/- 0.27, N = 659.8759.1645.2244.681. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT15000M30000M45000M60000M75000MSE +/- 70406594.18, N = 7SE +/- 65065410.31, N = 7SE +/- 41792377.17, N = 7SE +/- 43793302.38, N = 769946742857614788000005199725714351928800000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT6000M12000M18000M24000M30000MSE +/- 28637323.16, N = 6SE +/- 72880913.90, N = 6SE +/- 25962661.22, N = 6SE +/- 20754437.28, N = 630341983333263332833332046676666720418200000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT700M1400M2100M2800M3500MSE +/- 1515127.42, N = 6SE +/- 3462633.87, N = 6SE +/- 2310519.42, N = 6SE +/- 889475.50, N = 63422016667298605000023682500002367950000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT300K600K900K1200K1500KSE +/- 3388.56, N = 9SE +/- 3684.81, N = 9SE +/- 1688.29, N = 9SE +/- 867.95, N = 91449211129154410729441075600

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT200K400K600K800K1000KSE +/- 6523.02, N = 15SE +/- 6516.91, N = 15SE +/- 5009.06, N = 15SE +/- 7271.22, N = 15939493799393588340577387

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT246810SE +/- 0.23, N = 12SE +/- 0.19, N = 12SE +/- 0.14, N = 12SE +/- 0.01, N = 37.936.825.215.37MIN: 1.51 / MAX: 8.49MIN: 1.3 / MAX: 7.31MIN: 1.06 / MAX: 5.65MIN: 5.23 / MAX: 5.65

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT714212835SE +/- 0.09, N = 6SE +/- 0.40, N = 12SE +/- 0.01, N = 5SE +/- 0.00, N = 527.7121.8617.8217.82MIN: 25.23 / MAX: 29.43MIN: 17.21 / MAX: 24.64MIN: 16.51 / MAX: 18.35MIN: 16.51 / MAX: 18.34

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT3691215SE +/- 0.23, N = 12SE +/- 0.19, N = 12SE +/- 0.16, N = 12SE +/- 0.01, N = 39.487.916.336.49MIN: 1.49 / MAX: 11.05MIN: 1.28 / MAX: 9.28MIN: 1.04 / MAX: 7.35MIN: 2.8 / MAX: 7.35

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT0.7021.4042.1062.8083.51SE +/- 0.00, N = 3SE +/- 0.02, N = 9SE +/- 0.00, N = 3SE +/- 0.01, N = 33.122.822.312.31MIN: 2.8 / MAX: 3.75MIN: 0.38 / MAX: 3.43MIN: 2.12 / MAX: 2.97MIN: 2.1 / MAX: 2.99

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT246810SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 38.897.615.885.79MIN: 3.78 / MAX: 10.22MIN: 3.26 / MAX: 8.78MIN: 2.29 / MAX: 6.67MIN: 2.39 / MAX: 6.64

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT5001000150020002500SE +/- 5.29, N = 10SE +/- 13.26, N = 12SE +/- 2.95, N = 8SE +/- 2.10, N = 82196.611887.691098.311092.711. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT6001200180024003000SE +/- 2.63, N = 13SE +/- 1.32, N = 12SE +/- 0.97, N = 13SE +/- 0.93, N = 132720.062249.221688.701687.671. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT2K4K6K8K10KSE +/- 175.40, N = 15SE +/- 149.72, N = 15SE +/- 113.03, N = 15SE +/- 97.24, N = 159471.347899.177318.217352.001. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT918273645SE +/- 0.04, N = 13SE +/- 0.07, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 1339.1436.1728.3928.391. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT130260390520650SE +/- 0.76, N = 13SE +/- 1.16, N = 13SE +/- 3.19, N = 14SE +/- 2.96, N = 13610.74568.37481.60469.751. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT80160240320400SE +/- 2.80, N = 15SE +/- 2.42, N = 15SE +/- 1.12, N = 3SE +/- 1.05, N = 3368.91297.5196.2193.401. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT150300450600750SE +/- 5.24, N = 3SE +/- 3.00, N = 3SE +/- 2.19, N = 3SE +/- 4.36, N = 36735904064061. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT50100150200250Min: 12 / Avg: 171.27 / Max: 300Min: 11 / Avg: 64.72 / Max: 258Min: 5 / Avg: 112.02 / Max: 256Min: 6 / Avg: 92.22 / Max: 256

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT1530456075Min: 37 / Avg: 61.92 / Max: 76Min: 32 / Avg: 47.51 / Max: 61Min: 47 / Avg: 65.3 / Max: 79Min: 49 / Avg: 62.56 / Max: 78

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRadeon RX 6800 XT120240360480600SE +/- 6.51, N = 35561. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRadeon RX 6800 XT2004006008001000SE +/- 9.00, N = 38101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRadeon RX 6800 XT120240360480600SE +/- 14.00, N = 25641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRadeon RX 6800 XT2004006008001000SE +/- 5.46, N = 38231. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRadeon RX 6800 XT120240360480600SE +/- 4.37, N = 35571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRadeon RX 6800 XT306090120150SE +/- 0.67, N = 31471. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRadeon RX 6800 XT120240360480600SE +/- 3.84, N = 35321. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRadeon RX 6800 XT30060090012001500SE +/- 3.33, N = 311971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRadeon RX 6800 XT30060090012001500SE +/- 5.77, N = 312201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRadeon RX 6800 XT30060090012001500SE +/- 3.33, N = 311731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRadeon RX 6800 XT30060090012001500SE +/- 0.00, N = 212101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRadeon RX 6800 XT80016002400320040003724.23