opencl-set0-yoda-prehw

Intel Core i7-7700 testing with a ASUS PRIME H270M-PLUS (0809 BIOS) and Sapphire AMD Radeon RX 6700 XT 12GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2309274-BILL-OPENCLS98&grr.

opencl-set0-yoda-prehwProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution20230927-trialIntel Core i7-7700 @ 4.20GHz (4 Cores / 8 Threads)ASUS PRIME H270M-PLUS (0809 BIOS)Intel Xeon E3-1200 v6/7th + H27032GBSamsung SSD 960 EVO 250GB + 1000GB Samsung SSD 970 EVO Plus 1TB + 3001GB Western Digital WD30EFRX-68ESapphire AMD Radeon RX 6700 XT 12GB (2725/1000MHz)Realtek ALC887-VDPB248Intel I219-V + Intel Wi-Fi 6 AX200Ubuntu 22.045.15.0-84-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.34.6 Mesa 23.2.0-devel (LLVM 16.0.6 DRM 3.54)OpenCL 2.1 AMD-APP (3590.0)1.3.252GCC 11.4.0 + LLVM 14.0.0ext4 (ecryptfs)1920x1200OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9- BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05- Python 3.10.12- gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled

opencl-set0-yoda-prehwluxmark: GPU - Microphoneluxmark: CPU+GPU - Microphoneviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYfluidx3d: FP32-FP16Srodinia: OpenCL Myocytelulesh-cl: smallpt-gpu: GPU - Caustic3parboil: OpenMP Stencilmandelbulbgpu: CPU+GPUjuliagpu: CPU+GPUjuliagpu: GPUdarktable: Masskrug - OpenCLdarktable: Masskrug - CPU-onlycl-mem: Copyclpeak: Double-Precision Computemandelgpu: CPU+GPUshoc: OpenCL - Triadsmallpt-gpu: CPU - Caustic320230927-trial300632979222172971073270517375.8187201322620413268912.0632953.3839169583453523.16534979519746.9135315754.4137240757.55.2167.894281.3808.92266133232.111.1138OpenBenchmarking.org

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone20230927-trial6K12K18K24K30KSE +/- 72.19, N = 330063

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Microphone20230927-trial6K12K18K24K30KSE +/- 177.68, N = 329792

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY20230927-trial50100150200250SE +/- 2.94, N = 112211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT20230927-trial160320480640800SE +/- 0.26, N = 127291. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN20230927-trial150300450600750SE +/- 0.29, N = 127101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT20230927-trial160320480640800SE +/- 0.29, N = 127321. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN20230927-trial150300450600750SE +/- 0.36, N = 127051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T20230927-trial4080120160200SE +/- 12.99, N = 111731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N20230927-trial20406080100SE +/- 1.09, N = 1275.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT20230927-trial4080120160200SE +/- 11.40, N = 121871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY20230927-trial4080120160200SE +/- 5.02, N = 122011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT20230927-trial70140210280350SE +/- 3.17, N = 123221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY20230927-trial130260390520650SE +/- 0.69, N = 126201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY20230927-trial90180270360450SE +/- 19.45, N = 124131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16S20230927-trial6001200180024003000SE +/- 6.17, N = 32689

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte20230927-trial3691215SE +/- 3.48, N = 1512.061. (CXX) g++ options: -O2 -lOpenCL

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-0620230927-trial6001200180024003000SE +/- 47.73, N = 152953.381. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

SmallPT GPU

OpenCL Device: GPU - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic320230927-trial400M800M1200M1600M2000MSE +/- 25.12, N = 316958345351. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil20230927-trial612182430SE +/- 0.15, N = 323.171. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

MandelbulbGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: CPU+GPU20230927-trial20M40M60M80M100MSE +/- 1033260.89, N = 779519746.91. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPU+GPU20230927-trial30M60M90M120M150MSE +/- 343734.88, N = 3135315754.41. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPU20230927-trial30M60M90M120M150MSE +/- 251640.79, N = 3137240757.51. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: OpenCL20230927-trial1.17362.34723.52084.69445.868SE +/- 0.051, N = 35.216

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: CPU-only20230927-trial246810SE +/- 0.007, N = 37.894

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy20230927-trial60120180240300SE +/- 0.09, N = 3281.31. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Compute20230927-trial2004006008001000SE +/- 0.21, N = 3808.921. (CXX) g++ options: -O3

MandelGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPU+GPU20230927-trial60M120M180M240M300MSE +/- 1702516.89, N = 3266133232.11. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad20230927-trial3691215SE +/- 0.16, N = 511.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi


Phoronix Test Suite v10.8.5