opencl_works_20221029 AMD Ryzen 9 6900HS testing with a ASUS GA402RJ v1.0 (GA402RJ.315 BIOS) and ASUS AMD Radeon RX 6650 XT 8GB on openSUSE 20221027 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2210296-NE-OPENCLWOR07&grs .
opencl_works_20221029 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenCL Vulkan Compiler File-System Screen Resolution opencl_works_20221029 AMD Ryzen 9 6900HS @ 3.30GHz (8 Cores / 16 Threads) ASUS GA402RJ v1.0 (GA402RJ.315 BIOS) AMD Device 14b5 8 GB + 32 GB DDR5-4800MT/s 1024GB Micron_2450_MTFDKBA1T0TFK ASUS AMD Radeon RX 6650 XT 8GB (2400/1000MHz) AMD Navi 21/23 MEDIATEK Device 7922 openSUSE 20221027 6.0.3-1-default (x86_64) KDE Plasma X Server 1.21.1.4 OpenCL 2.1 AMD-APP (3486.0) 1.3.224 GCC 12.2.1 20221020 [revision 0aaef83351473e8f4eb774f8f999bbe87a4866d7] + Clang 15.0.2 + LLVM 7.0.1 btrfs 2560x1600 OpenBenchmarking.org - amdgpu.ppfeaturemask=0xffffffff - Transparent Huge Pages: always - DRI_PRIME=1 - --build=x86_64-suse-linux --disable-libcc1 --disable-libssp --disable-libstdcxx-pch --disable-libvtv --disable-werror --enable-cet=auto --enable-checking=release --enable-gnu-indirect-function --enable-host-shared --enable-languages=c,c++,objc,fortran,obj-c++,ada,go,d,jit --enable-libphobos --enable-libstdcxx-allocator=new --enable-link-mutex --enable-linux-futex --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa, --enable-plugin --enable-ssp --enable-version-specific-runtime-libs --host=x86_64-suse-linux --mandir=/usr/share/man --with-arch-32=x86-64 --with-build-config=bootstrap-lto-lean --with-gcc-major-version-only --with-slibdir=/lib64 --with-tune=generic --without-cuda-driver --without-system-libunwind - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - Platform Profile: balanced - CPU Microcode: 0xa404102 - ACPI Profile: balanced - GLAMOR - vBIOS Version: 113-REMBRANDT-X37 - Python 2.7.18 + Python 3.10.7 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
opencl_works_20221029 clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Global Memory Bandwidth clpeak: Double-Precision Double clpeak: Single-Precision Float clpeak: Integer Compute INT clpeak: Kernel Latency darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL rodinia: OpenCL Leukocyte rodinia: OpenCL Myocyte cl-mem: Write cl-mem: Read cl-mem: Copy shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: OpenCL - Triad shoc: OpenCL - Max SP Flops opencl_works_20221029 49.16 7.65 201.05 533.47 8312.79 2102.77 10.24 0.782 0.445 3.099 2.586 4.056 45.731 190.6 204.5 177.3 562.987 14.0571 14.3230 10.7147 572.166 12.3894 15925100 OpenBenchmarking.org
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer opencl_works_20221029 11 22 33 44 55 SE +/- 0.61, N = 4 49.16 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer opencl_works_20221029 2 4 6 8 10 SE +/- 0.08, N = 3 7.65 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth opencl_works_20221029 40 80 120 160 200 SE +/- 0.03, N = 3 201.05 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double opencl_works_20221029 120 240 360 480 600 SE +/- 0.14, N = 3 533.47 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float opencl_works_20221029 2K 4K 6K 8K 10K SE +/- 30.16, N = 3 8312.79 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT opencl_works_20221029 500 1000 1500 2000 2500 SE +/- 1.46, N = 3 2102.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency opencl_works_20221029 3 6 9 12 15 SE +/- 0.12, N = 3 10.24 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.1 Test: Server Room - Acceleration: OpenCL opencl_works_20221029 0.176 0.352 0.528 0.704 0.88 SE +/- 0.000, N = 3 0.782
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.1 Test: Server Rack - Acceleration: OpenCL opencl_works_20221029 0.1001 0.2002 0.3003 0.4004 0.5005 SE +/- 0.002, N = 3 0.445
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.1 Test: Masskrug - Acceleration: OpenCL opencl_works_20221029 0.6973 1.3946 2.0919 2.7892 3.4865 SE +/- 0.006, N = 3 3.099
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.1 Test: Boat - Acceleration: OpenCL opencl_works_20221029 0.5819 1.1638 1.7457 2.3276 2.9095 SE +/- 0.018, N = 3 2.586
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Leukocyte opencl_works_20221029 0.9126 1.8252 2.7378 3.6504 4.563 SE +/- 0.022, N = 3 4.056 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte opencl_works_20221029 10 20 30 40 50 SE +/- 0.11, N = 3 45.73 1. (CXX) g++ options: -O2 -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write opencl_works_20221029 40 80 120 160 200 SE +/- 0.17, N = 3 190.6 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read opencl_works_20221029 40 80 120 160 200 SE +/- 0.12, N = 3 204.5 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy opencl_works_20221029 40 80 120 160 200 SE +/- 0.00, N = 3 177.3 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth opencl_works_20221029 120 240 360 480 600 SE +/- 6.67, N = 4 562.99 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback opencl_works_20221029 4 8 12 16 20 SE +/- 0.01, N = 3 14.06 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download opencl_works_20221029 4 8 12 16 20 SE +/- 0.00, N = 3 14.32 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash opencl_works_20221029 3 6 9 12 15 SE +/- 0.00, N = 3 10.71 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP opencl_works_20221029 120 240 360 480 600 SE +/- 0.41, N = 3 572.17 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad opencl_works_20221029 3 6 9 12 15 SE +/- 0.09, N = 3 12.39 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops opencl_works_20221029 3M 6M 9M 12M 15M SE +/- 1219971.29, N = 6 15925100 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
Phoronix Test Suite v10.8.4