OpenCL ROCm AMD

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG CROSSHAIR X670E HERO (9922 BIOS) and Gigabyte AMD Radeon RX 6600 8GB on Ubuntu 23.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2303092-NE-OPENCLROC28.

OpenCL ROCm AMDProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionRX 6600AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (9922 BIOS)AMD Device 14d82 x 16 GB DDR5-6000MT/s F5-6000J3038F16GWestern Digital WD_BLACK SN850X 1000GB + 2000GBGigabyte AMD Radeon RX 6600 8GB (2750/875MHz)AMD Navi 21/23ASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.046.2.2-060202-generic (x86_64)GNOME Shell 43.2X Server 1.21.1.64.6 Mesa 23.1.0-devel (git-5f5e30b 2023-03-09 lunar-oibaf-ppa) (LLVM 15.0.7 DRM 3.49)OpenCL 2.1 AMD-APP (3513.0)GCC 12.2.0ext43840x2160OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-EzbZRD/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-EzbZRD/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- BAR1 / Visible vRAM Size: 8176 MB - vBIOS Version: 113-D53201-R66E- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

OpenCL ROCm AMDshoc: OpenCL - S3Dfluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferlczero: OpenCLshoc: OpenCL - Triadshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read BandwidthRX 660080.33129651844182212.032165.477891.46191.31570.898117.454.9823.03146076.441311.9076207.8981779.197.13087.0694589.766OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 660020406080100SE +/- 1.16, N = 380.331. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRX 6600369121511.91

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.06.715.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor510152025

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32RX 66002004006008001000SE +/- 0.33, N = 3965

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP32RX 66004812162016.11

FluidX3D

GPU Power Consumption Monitor

MinAvgMaxRX 66003.059.965.0OpenBenchmarking.orgWatts, Fewer Is BetterFluidX3D 2.3GPU Power Consumption Monitor20406080100

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CRX 6600400800120016002000SE +/- 2.19, N = 31844

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP16CRX 660051015202522.69

FluidX3D

GPU Power Consumption Monitor

MinAvgMaxRX 66003.081.390.0OpenBenchmarking.orgWatts, Fewer Is BetterFluidX3D 2.3GPU Power Consumption Monitor20406080100

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SRX 6600400800120016002000SE +/- 1.45, N = 31822

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.3Test: FP32-FP16SRX 660071421283528.22

FluidX3D

GPU Power Consumption Monitor

MinAvgMaxRX 66003.064.672.0OpenBenchmarking.orgWatts, Fewer Is BetterFluidX3D 2.3GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyRX 66003691215SE +/- 0.13, N = 1512.031. (CXX) g++ options: -O3

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.04.319.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor612182430

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRX 66005001000150020002500SE +/- 5.37, N = 72165.471. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRX 6600306090120150147.13

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.014.7100.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRX 66002K4K6K8K10KSE +/- 31.77, N = 77891.461. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRX 660020040060080010001112.68

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.07.1100.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRX 66004080120160200SE +/- 0.40, N = 5191.311. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRX 66002468106.868

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.027.972.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRX 6600120240360480600SE +/- 0.52, N = 6570.891. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRX 660081624324035.39

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.016.182.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 66002K4K6K8K10KSE +/- 29.84, N = 78117.451. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRX 66002004006008001000856.49

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.09.595.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor20406080100

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferRX 66001.12052.2413.36154.4825.6025SE +/- 0.05, N = 34.981. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferRX 66000.36860.73721.10581.47441.8431.638

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.03.04.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor1.1252.253.3754.55.625

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferRX 6600612182430SE +/- 0.16, N = 1523.031. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferRX 66002468107.529

clpeak

GPU Power Consumption Monitor

MinAvgMaxRX 66003.03.113.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor48121620

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRX 66003K6K9K12K15KSE +/- 28.17, N = 3146071. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterLeelaChessZero 0.28Backend: OpenCLRX 6600306090120150156.10

LeelaChessZero

GPU Power Consumption Monitor

MinAvgMaxRX 66003.093.6100.0OpenBenchmarking.orgWatts, Fewer Is BetterLeelaChessZero 0.28GPU Power Consumption Monitor20406080100

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRX 6600246810SE +/- 0.0504, N = 46.44131. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRX 66000.25630.51260.76891.02521.28151.139

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.05.724.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor816243240

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 66003691215SE +/- 0.01, N = 411.911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRX 66000.27520.55040.82561.10081.3761.223

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.09.7100.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor20406080100

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 660050100150200250SE +/- 0.16, N = 4207.901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRX 660051015202520.85

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.010.065.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor20406080100

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 6600400800120016002000SE +/- 4.57, N = 41779.191. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRX 66002040608010082.16

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.021.797.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor20406080100

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRX 6600246810SE +/- 0.0002, N = 47.13081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRX 66000.23490.46980.70470.93961.17451.044

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.06.830.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor918273645

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRX 6600246810SE +/- 0.0005, N = 47.06941. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRX 66000.25040.50080.75121.00161.2521.113

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.06.429.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor918273645

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 6600130260390520650SE +/- 4.68, N = 9589.771. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRX 660081624324035.50

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRX 66003.016.682.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor20406080100

Meta Performance Per Watts

Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRX 6600112233445550.55

GPU Power Consumption Monitor

Phoronix Test Suite System Monitoring

MinAvgMaxRX 66003.09.4100.0OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System Monitoring20406080100


Phoronix Test Suite v10.8.5