RDNA3 OpenCL Compute

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG CROSSHAIR X670E HERO (0805 BIOS) and AMD Radeon RX 6800 XT 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212135-NE-RDNA3OPEN49
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 6 Tests
OpenCL 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
RX 7900 XTX
December 13 2022
  54 Minutes
RX 7900 XT
December 13 2022
  1 Hour, 3 Minutes
RX 6800 XT
December 13 2022
  45 Minutes
Radeon RX 6800 XT
December 13 2022
  37 Minutes
Invert Hiding All Results Option
  50 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


RDNA3 OpenCL ComputeOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0805 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 2000GBAMD Radeon RX 7900 XTX 24GB (3220/1249MHz)AMD Radeon RX 7900 XT 20GB (3125/1249MHz)AMD Radeon RX 6800 XT 16GB (2575/1000MHz)AMD Device ab30AMD Navi 21 HDMI AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.045.15.0-56-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.3 + Wayland4.6 Mesa 22.3.0-devel (LLVM 15.0.3 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)1.3.232GCC 11.3.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudiosMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRDNA3 OpenCL Compute BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa601203 - RX 7900 XTX: BAR1 / Visible vRAM Size: 24560 MB - vBIOS Version: 113-D7020100-102 - RX 7900 XT: BAR1 / Visible vRAM Size: 20464 MB - vBIOS Version: 113-D70401-00- RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - Radeon RX 6800 XT: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120500-101 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XTResult OverviewPhoronix Test Suite100%119%138%157%176%SHOC Scalable HeterOgeneous ComputingViennaCLcl-memLuxCoreRenderHashcatclpeak

RX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XTPer Watt Result OverviewPhoronix Test Suite100%111%122%133%SHOC Scalable HeterOgeneous Computingcl-memHashcatclpeakLuxCoreRenderP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

RDNA3 OpenCL Computecl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Transfer Bandwidth enqueueWriteBufferhashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTSluxcorerender: DLSC - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - S3Dviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTRX 7900 XTXRX 7900 XTRX 6800 XTRadeon RX 6800 XT864.5839.9458.0716.1228203.847750.9659.876994674285730341983333342201666714492119394937.9327.719.483.128.892196.612720.069471.3439.1395610.739368.910673714.3702.8431.4618.3726458.907245.4059.166147880000026333283333298605000012915447993936.8221.867.912.827.611887.692249.227899.1736.1749568.366297.509590470.4437.1360.9457.1522435.765627.0345.225199725714320466766667236825000010729445883405.2117.826.332.315.881098.311688.707318.2128.3924481.60496.2093406470.4437.1361.1457.1222446.785626.2344.685192880000020418200000236795000010756005773875.3717.826.492.315.791092.711687.677352.0028.3906469.75493.39784065568105648235571475321197122011731210OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT2004006008001000SE +/- 0.10, N = 8SE +/- 0.14, N = 10SE +/- 1.27, N = 10SE +/- 0.05, N = 8470.4864.5714.3470.41. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT2004006008001000SE +/- 1.35, N = 8SE +/- 0.42, N = 10SE +/- 0.28, N = 10SE +/- 0.90, N = 8437.1839.9702.8437.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT100200300400500SE +/- 0.04, N = 8SE +/- 0.08, N = 10SE +/- 0.06, N = 10SE +/- 0.03, N = 8361.1458.0431.4360.91. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT150300450600750SE +/- 0.02, N = 9SE +/- 0.24, N = 10SE +/- 0.06, N = 9SE +/- 0.03, N = 9457.12716.12618.37457.151. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT6K12K18K24K30KSE +/- 2.29, N = 9SE +/- 35.23, N = 9SE +/- 31.08, N = 9SE +/- 2.35, N = 922446.7828203.8426458.9022435.761. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT17003400510068008500SE +/- 0.25, N = 7SE +/- 9.52, N = 8SE +/- 3.17, N = 8SE +/- 1.16, N = 75626.237750.967245.405627.031. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT1326395265SE +/- 0.27, N = 6SE +/- 0.20, N = 8SE +/- 0.23, N = 7SE +/- 0.05, N = 644.6859.8759.1645.221. (CXX) g++ options: -O3 -rdynamic -lOpenCL

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT50100150200250Min: 6 / Avg: 92.22 / Max: 256Min: 12 / Avg: 171.27 / Max: 300Min: 11 / Avg: 64.72 / Max: 258Min: 5 / Avg: 112.02 / Max: 256

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT1530456075Min: 49 / Avg: 62.56 / Max: 78Min: 37 / Avg: 61.92 / Max: 76Min: 32 / Avg: 47.51 / Max: 61Min: 47 / Avg: 65.3 / Max: 79

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5Radeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT15000M30000M45000M60000M75000MSE +/- 43793302.38, N = 7SE +/- 70406594.18, N = 7SE +/- 65065410.31, N = 7SE +/- 41792377.17, N = 751928800000699467428576147880000051997257143

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1Radeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT6000M12000M18000M24000M30000MSE +/- 20754437.28, N = 6SE +/- 28637323.16, N = 6SE +/- 72880913.90, N = 6SE +/- 25962661.22, N = 620418200000303419833332633328333320466766667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512Radeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT700M1400M2100M2800M3500MSE +/- 889475.50, N = 6SE +/- 1515127.42, N = 6SE +/- 3462633.87, N = 6SE +/- 2310519.42, N = 62367950000342201666729860500002368250000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT300K600K900K1200K1500KSE +/- 867.95, N = 9SE +/- 3388.56, N = 9SE +/- 3684.81, N = 9SE +/- 1688.29, N = 91075600144921112915441072944

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT200K400K600K800K1000KSE +/- 7271.22, N = 15SE +/- 6523.02, N = 15SE +/- 6516.91, N = 15SE +/- 5009.06, N = 15577387939493799393588340

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT246810SE +/- 0.01, N = 3SE +/- 0.23, N = 12SE +/- 0.19, N = 12SE +/- 0.14, N = 125.377.936.825.21MIN: 5.23 / MAX: 5.65MIN: 1.51 / MAX: 8.49MIN: 1.3 / MAX: 7.31MIN: 1.06 / MAX: 5.65

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT714212835SE +/- 0.00, N = 5SE +/- 0.09, N = 6SE +/- 0.40, N = 12SE +/- 0.01, N = 517.8227.7121.8617.82MIN: 16.51 / MAX: 18.34MIN: 25.23 / MAX: 29.43MIN: 17.21 / MAX: 24.64MIN: 16.51 / MAX: 18.35

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT3691215SE +/- 0.01, N = 3SE +/- 0.23, N = 12SE +/- 0.19, N = 12SE +/- 0.16, N = 126.499.487.916.33MIN: 2.8 / MAX: 7.35MIN: 1.49 / MAX: 11.05MIN: 1.28 / MAX: 9.28MIN: 1.04 / MAX: 7.35

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT0.7021.4042.1062.8083.51SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 9SE +/- 0.00, N = 32.313.122.822.31MIN: 2.1 / MAX: 2.99MIN: 2.8 / MAX: 3.75MIN: 0.38 / MAX: 3.43MIN: 2.12 / MAX: 2.97

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT246810SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 35.798.897.615.88MIN: 2.39 / MAX: 6.64MIN: 3.78 / MAX: 10.22MIN: 3.26 / MAX: 8.78MIN: 2.29 / MAX: 6.67

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRadeon RX 6800 XT80016002400320040003724.23

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT5001000150020002500SE +/- 2.10, N = 8SE +/- 5.29, N = 10SE +/- 13.26, N = 12SE +/- 2.95, N = 81092.712196.611887.691098.311. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT6001200180024003000SE +/- 0.93, N = 13SE +/- 2.63, N = 13SE +/- 1.32, N = 12SE +/- 0.97, N = 131687.672720.062249.221688.701. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT2K4K6K8K10KSE +/- 97.24, N = 15SE +/- 175.40, N = 15SE +/- 149.72, N = 15SE +/- 113.03, N = 157352.009471.347899.177318.211. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT918273645SE +/- 0.00, N = 13SE +/- 0.04, N = 13SE +/- 0.07, N = 13SE +/- 0.00, N = 1328.3939.1436.1728.391. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT130260390520650SE +/- 2.96, N = 13SE +/- 0.76, N = 13SE +/- 1.16, N = 13SE +/- 3.19, N = 14469.75610.74568.37481.601. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT80160240320400SE +/- 1.05, N = 3SE +/- 2.80, N = 15SE +/- 2.42, N = 15SE +/- 1.12, N = 393.40368.91297.5196.211. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRadeon RX 6800 XTRX 7900 XTXRX 7900 XTRX 6800 XT150300450600750SE +/- 4.36, N = 3SE +/- 5.24, N = 3SE +/- 3.00, N = 3SE +/- 2.19, N = 34066735904061. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRadeon RX 6800 XT120240360480600SE +/- 6.51, N = 35561. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRadeon RX 6800 XT2004006008001000SE +/- 9.00, N = 38101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRadeon RX 6800 XT120240360480600SE +/- 14.00, N = 25641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRadeon RX 6800 XT2004006008001000SE +/- 5.46, N = 38231. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRadeon RX 6800 XT120240360480600SE +/- 4.37, N = 35571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRadeon RX 6800 XT306090120150SE +/- 0.67, N = 31471. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRadeon RX 6800 XT120240360480600SE +/- 3.84, N = 35321. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRadeon RX 6800 XT30060090012001500SE +/- 3.33, N = 311971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRadeon RX 6800 XT30060090012001500SE +/- 5.77, N = 312201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRadeon RX 6800 XT30060090012001500SE +/- 3.33, N = 311731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRadeon RX 6800 XT30060090012001500SE +/- 0.00, N = 212101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL