OpenCL ROCm AMD

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG CROSSHAIR X670E HERO (9922 BIOS) and Gigabyte AMD Radeon RX 6600 8GB on Ubuntu 23.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2303092-NE-OPENCLROC28
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
RX 6600
March 09 2023
  1 Hour, 17 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL ROCm AMDOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (9922 BIOS)AMD Device 14d82 x 16 GB DDR5-6000MT/s F5-6000J3038F16GWestern Digital WD_BLACK SN850X 1000GB + 2000GBGigabyte AMD Radeon RX 6600 8GB (2750/875MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.046.2.2-060202-generic (x86_64)GNOME Shell 43.2X Server 1.21.1.64.6 Mesa 23.1.0-devel (git-5f5e30b 2023-03-09 lunar-oibaf-ppa) (LLVM 15.0.7 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)GCC 12.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionOpenCL ROCm AMD BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-EzbZRD/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-EzbZRD/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- BAR1 / Visible vRAM Size: 8176 MB - vBIOS Version: 113-D53201-R66E- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

OpenCL ROCm AMDshoc: OpenCL - Triadshoc: OpenCL - Reductionshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBuffershoc: OpenCL - S3Dclpeak: Double-Precision Computeclpeak: Single-Precision Computeshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - MD5 Hashclpeak: Integer Computeclpeak: Integer 24-bit Computefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Slczero: OpenCLclpeak: Kernel LatencyRX 66006.4413207.8987.13087.0694589.766191.314.9823.0380.3312570.898117.451779.1911.90762165.477891.46965184418221460712.03OpenBenchmarking.org

GPU Power Consumption Monitor

MinAvgMaxRX 66003.09.4100.0OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System Monitoring20406080100

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRX 6600246810SE +/- 0.0504, N = 46.44131. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 660050100150200250SE +/- 0.16, N = 4207.901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRX 6600246810SE +/- 0.0002, N = 47.13081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRX 6600246810SE +/- 0.0005, N = 47.06941. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 6600130260390520650SE +/- 4.68, N = 9589.771. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRX 66004080120160200SE +/- 0.40, N = 5191.311. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferRX 66001.12052.2413.36154.4825.6025SE +/- 0.05, N = 34.981. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferRX 6600612182430SE +/- 0.16, N = 1523.031. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 6600369121511.91

clpeak

OpenBenchmarking.orgGFLOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRX 660081624324035.39

OpenBenchmarking.orgGFLOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 66002004006008001000856.49

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 66002040608010082.16

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 660020406080100SE +/- 1.16, N = 380.331. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRX 6600120240360480600SE +/- 0.52, N = 6570.891. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 66002K4K6K8K10KSE +/- 29.84, N = 78117.451. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 6600400800120016002000SE +/- 4.57, N = 41779.191. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 66003691215SE +/- 0.01, N = 411.911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGIOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRX 660020040060080010001112.68

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRX 66005001000150020002500SE +/- 5.37, N = 72165.471. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRX 66002K4K6K8K10KSE +/- 31.77, N = 77891.461. (CXX) g++ options: -O3

FluidX3D

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP32RX 66004812162016.11

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP16CRX 660051015202522.69

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP16SRX 660071421283528.22

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32RX 66002004006008001000SE +/- 0.33, N = 3965

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CRX 6600400800120016002000SE +/- 2.19, N = 31844

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SRX 6600400800120016002000SE +/- 1.45, N = 31822

LeelaChessZero

OpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterLeelaChessZero 0.28Backend: OpenCLRX 6600306090120150156.10

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRX 66003K6K9K12K15KSE +/- 28.17, N = 3146071. (CXX) g++ options: -flto -pthread

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRX 6600112233445550.55

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

Test: Boat - Acceleration: OpenCL

RX 6600: The test quit with a non-zero exit status. The test run did not produce a result. E: sh: 1: exec: ./darktable: not found

Test: Masskrug - Acceleration: OpenCL

RX 6600: The test run did not produce a result. E: sh: 1: exec: ./darktable: not found

Test: Server Rack - Acceleration: OpenCL

RX 6600: The test run did not produce a result. E: sh: 1: exec: ./darktable: not found

Test: Server Room - Acceleration: OpenCL

RX 6600: The test run did not produce a result. E: sh: 1: exec: ./darktable: not found

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyRX 66003691215SE +/- 0.13, N = 1512.031. (CXX) g++ options: -O3