20240412b AMD Ryzen 7 7840HS testing with a Shenzhen Meigao Electronic Equipment F7BSD v1.1 (1.05 BIOS) and amdgpudrmfb on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2404121-NE-20240412B25 .
20240412b Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenCL Vulkan Compiler File-System Screen Resolution amdgpudrmfb - AMD Ryzen 7 7840HS AMD Ryzen 7 7840HS @ 5.14GHz (8 Cores / 16 Threads) Shenzhen Meigao Electronic Equipment F7BSD v1.1 (1.05 BIOS) AMD Device 14e8 80GB 4097GB SPCC M.2 PCIe SSD amdgpudrmfb (2493/1124MHz) AMD Device ab30 DELL U2713HM 2 x Realtek RTL8125 2.5GbE + Intel Wi-Fi 6 AX210/AX211/AX411 Ubuntu 22.04 6.5.0-27-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.3 OpenCL 2.1 AMD-APP (3602.0) 1.3.255 GCC 11.4.0 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: amd-pstate-epp powersave (EPP: performance) - CPU Microcode: 0xa704104 - 115-D754BP0-101 - Python 3.10.12 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
20240412b shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write fluidx3d: FP32-FP32 fluidx3d: FP32-FP16C fluidx3d: FP32-FP16S clpeak: Kernel Latency clpeak: Integer Compute clpeak: Integer 24-bit Compute clpeak: Global Memory Bandwidth clpeak: Double-Precision Compute clpeak: Single-Precision Compute clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer rodinia: OpenCL Myocyte rodinia: OpenCL Leukocyte viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL lulesh-cl: amdgpudrmfb - AMD Ryzen 7 7840HS 82.5253 6.3472 772.371 12.6004 259.784 2288.87 7.1376 7.2444 824.504 227.7 258.6 256.5 1188 2255 2265 16.36 1831.23 7548.81 226.33 349.09 8611.45 5.94 20.79 7.424 3.576 39.5 59.4 46.9 35.4 53.5 42.4 41.9 42.6 48.2 47.0 52.4 50.4 209 220 213 230 191 219 88.3 236 316 353 352 358 2.422 2.899 0.461 0.934 2472.4343 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D amdgpudrmfb - AMD Ryzen 7 7840HS 20 40 60 80 100 SE +/- 0.88, N = 4 82.53 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad amdgpudrmfb - AMD Ryzen 7 7840HS 2 4 6 8 10 SE +/- 0.0057, N = 3 6.3472 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP amdgpudrmfb - AMD Ryzen 7 7840HS 170 340 510 680 850 SE +/- 0.83, N = 3 772.37 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash amdgpudrmfb - AMD Ryzen 7 7840HS 3 6 9 12 15 SE +/- 0.01, N = 3 12.60 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction amdgpudrmfb - AMD Ryzen 7 7840HS 60 120 180 240 300 SE +/- 0.43, N = 3 259.78 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N amdgpudrmfb - AMD Ryzen 7 7840HS 500 1000 1500 2000 2500 SE +/- 6.99, N = 3 2288.87 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download amdgpudrmfb - AMD Ryzen 7 7840HS 2 4 6 8 10 SE +/- 0.0004, N = 3 7.1376 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback amdgpudrmfb - AMD Ryzen 7 7840HS 2 4 6 8 10 SE +/- 0.0003, N = 3 7.2444 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth amdgpudrmfb - AMD Ryzen 7 7840HS 200 400 600 800 1000 SE +/- 5.80, N = 3 824.50 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.09, N = 3 227.7 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read amdgpudrmfb - AMD Ryzen 7 7840HS 60 120 180 240 300 SE +/- 0.54, N = 3 258.6 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write amdgpudrmfb - AMD Ryzen 7 7840HS 60 120 180 240 300 SE +/- 0.07, N = 3 256.5 1. (CC) gcc options: -O2 -flto -lOpenCL
FluidX3D Test: FP32-FP32 OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP32 amdgpudrmfb - AMD Ryzen 7 7840HS 300 600 900 1200 1500 SE +/- 1.76, N = 3 1188
FluidX3D Test: FP32-FP16C OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16C amdgpudrmfb - AMD Ryzen 7 7840HS 500 1000 1500 2000 2500 SE +/- 2.52, N = 3 2255
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16S amdgpudrmfb - AMD Ryzen 7 7840HS 500 1000 1500 2000 2500 SE +/- 24.01, N = 3 2265
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency amdgpudrmfb - AMD Ryzen 7 7840HS 4 8 12 16 20 SE +/- 0.12, N = 15 16.36 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute amdgpudrmfb - AMD Ryzen 7 7840HS 400 800 1200 1600 2000 SE +/- 1.18, N = 3 1831.23 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute amdgpudrmfb - AMD Ryzen 7 7840HS 1600 3200 4800 6400 8000 SE +/- 90.31, N = 4 7548.81 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.02, N = 3 226.33 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute amdgpudrmfb - AMD Ryzen 7 7840HS 80 160 240 320 400 SE +/- 0.29, N = 3 349.09 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute amdgpudrmfb - AMD Ryzen 7 7840HS 2K 4K 6K 8K 10K SE +/- 6.35, N = 3 8611.45 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer amdgpudrmfb - AMD Ryzen 7 7840HS 1.3365 2.673 4.0095 5.346 6.6825 SE +/- 0.05, N = 15 5.94 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer amdgpudrmfb - AMD Ryzen 7 7840HS 5 10 15 20 25 SE +/- 0.24, N = 15 20.79 1. (CXX) g++ options: -O3
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte amdgpudrmfb - AMD Ryzen 7 7840HS 2 4 6 8 10 SE +/- 0.058, N = 9 7.424 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Leukocyte amdgpudrmfb - AMD Ryzen 7 7840HS 0.8046 1.6092 2.4138 3.2184 4.023 SE +/- 0.039, N = 3 3.576 1. (CXX) g++ options: -O2 -lOpenCL
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY amdgpudrmfb - AMD Ryzen 7 7840HS 9 18 27 36 45 SE +/- 0.52, N = 3 39.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY amdgpudrmfb - AMD Ryzen 7 7840HS 13 26 39 52 65 SE +/- 0.53, N = 3 59.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT amdgpudrmfb - AMD Ryzen 7 7840HS 11 22 33 44 55 SE +/- 0.15, N = 3 46.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY amdgpudrmfb - AMD Ryzen 7 7840HS 8 16 24 32 40 SE +/- 0.03, N = 3 35.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY amdgpudrmfb - AMD Ryzen 7 7840HS 12 24 36 48 60 SE +/- 0.03, N = 3 53.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT amdgpudrmfb - AMD Ryzen 7 7840HS 10 20 30 40 50 SE +/- 0.06, N = 3 42.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N amdgpudrmfb - AMD Ryzen 7 7840HS 10 20 30 40 50 SE +/- 0.12, N = 3 41.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T amdgpudrmfb - AMD Ryzen 7 7840HS 10 20 30 40 50 SE +/- 0.20, N = 3 42.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN amdgpudrmfb - AMD Ryzen 7 7840HS 11 22 33 44 55 SE +/- 0.15, N = 3 48.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT amdgpudrmfb - AMD Ryzen 7 7840HS 11 22 33 44 55 SE +/- 0.18, N = 3 47.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN amdgpudrmfb - AMD Ryzen 7 7840HS 12 24 36 48 60 SE +/- 0.10, N = 3 52.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT amdgpudrmfb - AMD Ryzen 7 7840HS 11 22 33 44 55 SE +/- 0.07, N = 3 50.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.33, N = 3 209 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.00, N = 3 220 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.33, N = 3 213 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 0.33, N = 3 230 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY amdgpudrmfb - AMD Ryzen 7 7840HS 40 80 120 160 200 SE +/- 18.27, N = 3 191 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 19.75, N = 3 219 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N amdgpudrmfb - AMD Ryzen 7 7840HS 20 40 60 80 100 SE +/- 7.35, N = 3 88.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T amdgpudrmfb - AMD Ryzen 7 7840HS 50 100 150 200 250 SE +/- 4.37, N = 3 236 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN amdgpudrmfb - AMD Ryzen 7 7840HS 70 140 210 280 350 SE +/- 0.67, N = 3 316 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT amdgpudrmfb - AMD Ryzen 7 7840HS 80 160 240 320 400 SE +/- 0.33, N = 3 353 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN amdgpudrmfb - AMD Ryzen 7 7840HS 80 160 240 320 400 SE +/- 0.33, N = 3 352 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT amdgpudrmfb - AMD Ryzen 7 7840HS 80 160 240 320 400 SE +/- 0.00, N = 3 358 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Boat - Acceleration: OpenCL amdgpudrmfb - AMD Ryzen 7 7840HS 0.545 1.09 1.635 2.18 2.725 SE +/- 0.006, N = 3 2.422
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Masskrug - Acceleration: OpenCL amdgpudrmfb - AMD Ryzen 7 7840HS 0.6523 1.3046 1.9569 2.6092 3.2615 SE +/- 0.007, N = 3 2.899
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Rack - Acceleration: OpenCL amdgpudrmfb - AMD Ryzen 7 7840HS 0.1037 0.2074 0.3111 0.4148 0.5185 SE +/- 0.003, N = 15 0.461
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Room - Acceleration: OpenCL amdgpudrmfb - AMD Ryzen 7 7840HS 0.2102 0.4204 0.6306 0.8408 1.051 SE +/- 0.006, N = 3 0.934
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 amdgpudrmfb - AMD Ryzen 7 7840HS 500 1000 1500 2000 2500 SE +/- 6.89, N = 3 2472.43 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
Phoronix Test Suite v10.8.5