gpu+cpu_opencl

Run after me figuring out how linux works with getty services, kmscon.

HTML result view exported from: https://openbenchmarking.org/result/2401194-NE-GPUCPUOPE60.

gpu+cpu_openclProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionASUS AMD Radeon RX 6700 XTAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASRock X570 Pro4 (P5.01 BIOS)AMD Starship/Matisse128GB2 x 1000GB Western Digital WDS100T1X0E-00AFY0 + 750GB Western Digital WD7500BPKT-0 + 120GB Seagate ST9120822ASASUS AMD Radeon RX 6700 XT 12GBAMD RV710/7303 x VZ249 + DELL P2214HIntel I211Fedora Linux 396.6.11-200.fc39.x86_64 (x86_64)GNOME Shell 45.3X Server4.6 Mesa 23.3.3 (LLVM 17.0.6 DRM 3.54)Clang 17.0.6 + LLVM 17.0.6btrfs5760x1080OpenBenchmarking.org- amdgpu.tmz=1 - Transparent Huge Pages: madvise- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201025- BAR1 / Visible vRAM Size: 256 MB- Python 3.12.1- SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

gpu+cpu_openclshoc: OpenCL - S3Dshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingrodinia: OpenMP LavaMDrodinia: OpenCL Myocyterodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLluxmark: CPU+GPU - Hotelluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRlulesh-cl: ASUS AMD Radeon RX 6700 XT111.6500.00010.00010.00470.00020.0002291.5346.0312.61412294829442538.5911383.36326.06802.6812694.245.6020.4291.4092331.0099729.16028491.563314111.92241.94364.81365.02112.21712.8164366053022732973141032957107377047332.5302.0230.1260.965426830768396062881.2497OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DASUS AMD Radeon RX 6700 XT20406080100SE +/- 1.67, N = 3111.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPASUS AMD Radeon RX 6700 XTSE +/- 0.0000, N = 30.00011. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashASUS AMD Radeon RX 6700 XTSE +/- 0.0000, N = 30.00011. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NASUS AMD Radeon RX 6700 XT0.00110.00220.00330.00440.0055SE +/- 0.0000, N = 30.00471. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsASUS AMD Radeon RX 6700 XTSE +/- 0.0000, N = 30.00021. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthASUS AMD Radeon RX 6700 XTSE +/- 0.0000, N = 30.00021. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyASUS AMD Radeon RX 6700 XT60120180240300SE +/- 0.07, N = 3291.51. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadASUS AMD Radeon RX 6700 XT80160240320400SE +/- 1.42, N = 3346.01. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteASUS AMD Radeon RX 6700 XT70140210280350SE +/- 1.10, N = 3312.61. (CC) gcc options: -O2 -flto -lOpenCL

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32ASUS AMD Radeon RX 6700 XT30060090012001500SE +/- 2.19, N = 31412

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CASUS AMD Radeon RX 6700 XT6001200180024003000SE +/- 22.33, N = 32948

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16SASUS AMD Radeon RX 6700 XT6001200180024003000SE +/- 25.89, N = 32944

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeASUS AMD Radeon RX 6700 XT5001000150020002500SE +/- 5.11, N = 32538.591. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeASUS AMD Radeon RX 6700 XT2K4K6K8K10KSE +/- 132.51, N = 1511383.361. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthASUS AMD Radeon RX 6700 XT70140210280350SE +/- 0.17, N = 3326.061. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeASUS AMD Radeon RX 6700 XT2004006008001000SE +/- 1.26, N = 3802.681. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeASUS AMD Radeon RX 6700 XT3K6K9K12K15KSE +/- 16.59, N = 312694.241. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferASUS AMD Radeon RX 6700 XT1.262.523.785.046.3SE +/- 0.26, N = 155.601. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferASUS AMD Radeon RX 6700 XT510152025SE +/- 0.09, N = 320.421. (CXX) g++ options: -O3

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMASUS AMD Radeon RX 6700 XT20406080100SE +/- 0.05, N = 391.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPASUS AMD Radeon RX 6700 XT0.22720.45440.68160.90881.136SE +/- 0.011361, N = 31.0099721. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilASUS AMD Radeon RX 6700 XT3691215SE +/- 0.142881, N = 159.1602841. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingASUS AMD Radeon RX 6700 XT20406080100SE +/- 0.13, N = 391.561. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDASUS AMD Radeon RX 6700 XT306090120150SE +/- 0.94, N = 3111.921. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteASUS AMD Radeon RX 6700 XT1020304050SE +/- 4.42, N = 1541.941. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DASUS AMD Radeon RX 6700 XT1428425670SE +/- 0.88, N = 364.811. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteASUS AMD Radeon RX 6700 XT1530456075SE +/- 0.27, N = 365.021. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverASUS AMD Radeon RX 6700 XT3691215SE +/- 0.16, N = 512.221. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterASUS AMD Radeon RX 6700 XT3691215SE +/- 0.22, N = 1412.821. (CXX) g++ options: -O2 -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYASUS AMD Radeon RX 6700 XT90180270360450SE +/- 4.62, N = 34361. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYASUS AMD Radeon RX 6700 XT130260390520650SE +/- 8.19, N = 36051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTASUS AMD Radeon RX 6700 XT70140210280350SE +/- 43.64, N = 33021. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYASUS AMD Radeon RX 6700 XT60120180240300SE +/- 0.88, N = 32731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYASUS AMD Radeon RX 6700 XT60120180240300SE +/- 0.58, N = 32971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTASUS AMD Radeon RX 6700 XT70140210280350SE +/- 0.58, N = 33141. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NASUS AMD Radeon RX 6700 XT20406080100SE +/- 0.33, N = 31031. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TASUS AMD Radeon RX 6700 XT60120180240300SE +/- 0.67, N = 32951. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNASUS AMD Radeon RX 6700 XT150300450600750SE +/- 0.58, N = 37101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTASUS AMD Radeon RX 6700 XT160320480640800SE +/- 0.67, N = 37371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNASUS AMD Radeon RX 6700 XT150300450600750SE +/- 0.33, N = 37041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTASUS AMD Radeon RX 6700 XT160320480640800SE +/- 0.58, N = 37331. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Boat - Acceleration: OpenCLASUS AMD Radeon RX 6700 XT0.56931.13861.70792.27722.8465SE +/- 0.003, N = 32.530

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: OpenCLASUS AMD Radeon RX 6700 XT0.45520.91041.36561.82082.276SE +/- 0.016, N = 32.023

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Server Rack - Acceleration: OpenCLASUS AMD Radeon RX 6700 XT0.02840.05680.08520.11360.142SE +/- 0.003, N = 150.126

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Server Room - Acceleration: OpenCLASUS AMD Radeon RX 6700 XT0.21710.43420.65130.86841.0855SE +/- 0.010, N = 30.965

LuxMark

OpenCL Device: CPU+GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelASUS AMD Radeon RX 6700 XT9001800270036004500SE +/- 11.68, N = 34268

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneASUS AMD Radeon RX 6700 XT7K14K21K28K35KSE +/- 179.01, N = 330768

LuxMark

OpenCL Device: CPU+GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRASUS AMD Radeon RX 6700 XT8K16K24K32K40KSE +/- 115.67, N = 339606

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06ASUS AMD Radeon RX 6700 XT6001200180024003000SE +/- 69.10, N = 152881.251. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm


Phoronix Test Suite v10.8.4