VIENNACL CL BLAS

Intel Core i5-4300M testing with a Dell 0VWNW8 (A26 BIOS) and Intel HD 4600 HSW GT2 2GB on cachyos rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402277-EIRI-231129974&sro&grr.

VIENNACL CL BLASProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel Core i5-4300M @ 3.30GHz (2 Cores / 4 Threads)Dell 0VWNW8 (A26 BIOS)Intel Xeon E3-1200 v3/4th8GB128GB SAMSUNG SSD PM85AMD Radeon HD 8790M (1250MHz)Intel Xeon E3-1200 v3/4thIntel I217-LM + Intel Centrino Ultimate-N 6300cachyos rolling6.6.2-4-cachyos-lto (x86_64)GNOME Shell 45.1X Server 1.21.1.94.6 Mesa 24.0.0-devel (git-023fa0aa5d) (LLVM 16.0.6 DRM 3.54)OpenCL 1.1 Mesa 24.0.0-devel (git-023fa0aa5d)GCC 13.2.1 20231110 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3xfs1920x1080Intel HD 4600 HSW GT2 2GB (1250MHz)6.7.6-1-cachyos-rt-bore-lto (x86_64)KDE Plasma 5.27.10X Server 1.21.1.114.6 Mesa 24.0.1-arch1.1OpenCL 2.0 beignet 1.4 (git-f72309a5)GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6OpenBenchmarking.orgKernel Details- Radeon HD 8790M: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 - Transparent Huge Pages: alwaysEnvironment Details- Radeon HD 8790M: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"Compiler Details- Radeon HD 8790M: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - IntelR HD Graphics 4600 HSW GT2 0x416: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Radeon HD 8790M: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28- IntelR HD Graphics 4600 HSW GT2 0x416: Scaling Governor: intel_cpufreq powersave - CPU Microcode: 0x28Security Details- Radeon HD 8790M: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - IntelR HD Graphics 4600 HSW GT2 0x416: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

VIENNACL CL BLASviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x41631.129.527.338.537.739.739.022.734.544.740.635.514.214.216.1OpenBenchmarking.org

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYIntelR HD Graphics 4600 HSW GT2 0x416Radeon HD 8790M714212835SE +/- 0.30, N = 14SE +/- 0.03, N = 313.231.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYIntelR HD Graphics 4600 HSW GT2 0x416Radeon HD 8790M714212835SE +/- 0.12, N = 7SE +/- 0.07, N = 313.629.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTIntelR HD Graphics 4600 HSW GT2 0x416Radeon HD 8790M612182430SE +/- 0.53, N = 14SE +/- 0.03, N = 314.6127.301. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRadeon HD 8790M918273645SE +/- 0.17, N = 338.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRadeon HD 8790M918273645SE +/- 0.06, N = 337.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRadeon HD 8790M918273645SE +/- 0.13, N = 339.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRadeon HD 8790M918273645SE +/- 0.07, N = 339.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRadeon HD 8790M510152025SE +/- 0.07, N = 322.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRadeon HD 8790M816243240SE +/- 0.28, N = 334.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRadeon HD 8790M1020304050SE +/- 0.10, N = 344.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRadeon HD 8790M918273645SE +/- 0.03, N = 340.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRadeon HD 8790M816243240SE +/- 0.09, N = 335.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5