VIENNACL CL BLAS

Intel Core i5-4300M testing with a Dell 0VWNW8 (A26 BIOS) and Intel HD 4600 HSW GT2 2GB on cachyos rolling via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2403070-EIRI-240227747
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Radeon HD 8790M
November 29 2023
  19 Minutes
IntelR HD Graphics 4600 HSW GT2 0x416
February 27
  2 Minutes
Intel HD Graphics 4600 HSW GT2 CLANG70
March 08
  5 Minutes
Invert Hiding All Results Option
  9 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


VIENNACL CL BLASProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70Intel Core i5-4300M @ 3.30GHz (2 Cores / 4 Threads)Dell 0VWNW8 (A26 BIOS)Intel Xeon E3-1200 v3/4th8GB128GB SAMSUNG SSD PM85AMD Radeon HD 8790M (1250MHz)Intel Xeon E3-1200 v3/4thIntel I217-LM + Intel Centrino Ultimate-N 6300cachyos rolling6.6.2-4-cachyos-lto (x86_64)GNOME Shell 45.1X Server 1.21.1.94.6 Mesa 24.0.0-devel (git-023fa0aa5d) (LLVM 16.0.6 DRM 3.54)OpenCL 1.1 Mesa 24.0.0-devel (git-023fa0aa5d)GCC 13.2.1 20231110 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3xfs1920x1080Intel HD 4600 HSW GT2 2GB (1250MHz)6.7.6-1-cachyos-rt-bore-lto (x86_64)KDE Plasma 5.27.10X Server 1.21.1.114.6 Mesa 24.0.1-arch1.1OpenCL 2.0 beignet 1.4 (git-f72309a5)GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.66.7.9-1-cachyos-rt-bore-lto (x86_64)KDE Plasma 6.0.14.6 Mesa 24.0.2-arch1.2Clang 17.0.6 + GCC 13.2.1 20230801 + LLVM 17.0.6OpenBenchmarking.orgKernel Details- Radeon HD 8790M: cfg80211.cfg80211_disable_40mhz_24ghz=1 mac80211.minstrel_vht_only=1 - Transparent Huge Pages: alwaysEnvironment Details- Radeon HD 8790M: DRI_PRIME=1 NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"Compiler Details- Radeon HD 8790M: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - IntelR HD Graphics 4600 HSW GT2 0x416: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Radeon HD 8790M: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28- IntelR HD Graphics 4600 HSW GT2 0x416: Scaling Governor: intel_cpufreq powersave - CPU Microcode: 0x28- Intel HD Graphics 4600 HSW GT2 CLANG70: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28Security Details- Radeon HD 8790M: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected - IntelR HD Graphics 4600 HSW GT2 0x416: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - Intel HD Graphics 4600 HSW GT2 CLANG70: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

VIENNACL CL BLASviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG7029.531.127.335.540.644.734.522.739.039.737.738.514.214.216.114.413.315.3OpenBenchmarking.org

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70714212835SE +/- 0.07, N = 3SE +/- 0.11, N = 15SE +/- 0.13, N = 329.514.414.7
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70714212835Min: 29.4 / Avg: 29.47 / Max: 29.6Min: 13.6 / Avg: 14.39 / Max: 15.1Min: 14.4 / Avg: 14.67 / Max: 14.8

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70714212835SE +/- 0.03, N = 3SE +/- 0.16, N = 14SE +/- 0.06, N = 331.113.613.4
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70714212835Min: 31 / Avg: 31.07 / Max: 31.1Min: 11.9 / Avg: 13.64 / Max: 14.2Min: 13.3 / Avg: 13.4 / Max: 13.5

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70612182430SE +/- 0.03, N = 3SE +/- 0.14, N = 15SE +/- 0.06, N = 327.3015.6015.10
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRadeon HD 8790MIntelR HD Graphics 4600 HSW GT2 0x416Intel HD Graphics 4600 HSW GT2 CLANG70612182430Min: 27.3 / Avg: 27.33 / Max: 27.4Min: 14.4 / Avg: 15.57 / Max: 16.2Min: 15 / Avg: 15.1 / Max: 15.2

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRadeon HD 8790M816243240SE +/- 0.09, N = 335.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRadeon HD 8790M918273645SE +/- 0.03, N = 340.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRadeon HD 8790M1020304050SE +/- 0.10, N = 344.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRadeon HD 8790M816243240SE +/- 0.28, N = 334.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRadeon HD 8790M510152025SE +/- 0.07, N = 322.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRadeon HD 8790M918273645SE +/- 0.07, N = 339.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRadeon HD 8790M918273645SE +/- 0.13, N = 339.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRadeon HD 8790M918273645SE +/- 0.06, N = 337.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRadeon HD 8790M918273645SE +/- 0.17, N = 338.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL