opencl-set0-yoda-prehw

Test after swapping in the new HW, CPU set to performance.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310204-BILL-230928918
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 3 Tests
HPC - High Performance Computing 3 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 15 Tests
OpenMPI Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
20230927-trial
September 27 2023
  1 Hour, 28 Minutes
20230928_preswitch
September 28 2023
  41 Minutes
20231020_postswitchperf
October 20 2023
  50 Minutes
Invert Hiding All Results Option
  1 Hour

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


opencl-set0-yoda-prehw - Phoronix Test Suite

opencl-set0-yoda-prehw

Test after swapping in the new HW, CPU set to performance.

HTML result view exported from: https://openbenchmarking.org/result/2310204-BILL-230928918&sor&grw.

opencl-set0-yoda-prehwProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution20230927-trial20230928_preswitch20231020_postswitchperfIntel Core i7-7700 @ 4.20GHz (4 Cores / 8 Threads)ASUS PRIME H270M-PLUS (0809 BIOS)Intel Xeon E3-1200 v6/7th + H27032GBSamsung SSD 960 EVO 250GB + 1000GB Samsung SSD 970 EVO Plus 1TB + 3001GB Western Digital WD30EFRX-68ESapphire AMD Radeon RX 6700 XT 12GB (2725/1000MHz)Realtek ALC887-VDPB248Intel I219-V + Intel Wi-Fi 6 AX200Ubuntu 22.045.15.0-84-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.34.6 Mesa 23.2.0-devel (LLVM 16.0.6 DRM 3.54)OpenCL 2.1 AMD-APP (3590.0)1.3.252GCC 11.4.0 + LLVM 14.0.0ext4 (ecryptfs)1920x1200AMD Ryzen 9 7950X3D 16-Core @ 4.20GHz (16 Cores / 32 Threads)ASUS TUF GAMING B650M-PLUS WIFI (0823 BIOS)AMD Device 14d862GB1000GB Samsung SSD 970 EVO Plus 1TB + Samsung SSD 970 EVO Plus 500GB + 3001GB Western Digital WD30EFRX-68EAMD Navi 21 HDMI AudioRealtek RTL8125 2.5GbE + Realtek Device b8525.15.0-86-generic (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- 20230927-trial: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9- 20230928_preswitch: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9- 20231020_postswitchperf: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- 20230927-trial: BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05- 20230928_preswitch: BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05- 20231020_postswitchperf: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5122200-S05Python Details- Python 3.10.12Security Details- 20230927-trial: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - 20230928_preswitch: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - 20231020_postswitchperf: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

opencl-set0-yoda-prehwdarktable: Masskrug - OpenCLdarktable: Masskrug - CPU-onlyshoc: OpenCL - Triadparboil: OpenMP Stencilrodinia: OpenCL Myocytecl-mem: Copyclpeak: Double-Precision Computemandelgpu: CPU+GPUviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dAXPYfluidx3d: FP32-FP16Sjuliagpu: GPUjuliagpu: CPU+GPUlulesh-cl: luxmark: GPU - Microphoneluxmark: CPU+GPU - Microphonemandelbulbgpu: CPU+GPUsmallpt-gpu: GPU - Caustic3xsbench-cl: 20230927-trial20230928_preswitch20231020_postswitchperf5.2167.89411.113823.16534912.063281.3808.92266133232.141362032220118775.81737057327107292212689137240757.5135315754.42953.3839300632979279519746.916958345355.3308.00810.924722.1601718.657281.2808.68268569163.242959830519421076.11997127337087302232716142637305.6139394174.92913.8920297973013279087731.316958780161.3821.83313.51054.73826247.627281.5788.75330203056.34225903352703261022907007227017163022620428957016.6411077431.43122.50703598235363222793824.31697804201OpenBenchmarking.org

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: OpenCL20231020_postswitchperf20230927-trial20230928_preswitch1.19932.39863.59794.79725.9965SE +/- 0.019, N = 3SE +/- 0.051, N = 3SE +/- 0.020, N = 31.3825.2165.330

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: CPU-only20231020_postswitchperf20230927-trial20230928_preswitch246810SE +/- 0.003, N = 3SE +/- 0.007, N = 3SE +/- 0.016, N = 31.8337.8948.008

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad20231020_postswitchperf20230927-trial20230928_preswitch3691215SE +/- 0.14, N = 15SE +/- 0.16, N = 5SE +/- 0.22, N = 313.5111.1110.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil20231020_postswitchperf20230928_preswitch20230927-trial612182430SE +/- 0.054079, N = 3SE +/- 0.024061, N = 3SE +/- 0.148343, N = 34.73826222.16017123.1653491. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte20230928_preswitch20230927-trial20231020_postswitchperf1122334455SE +/- 0.142, N = 4SE +/- 3.477, N = 15SE +/- 11.383, N = 158.65712.06347.6271. (CXX) g++ options: -O2 -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy20231020_postswitchperf20230927-trial20230928_preswitch60120180240300SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3281.5281.3281.21. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Compute20230927-trial20230928_preswitch20231020_postswitchperf2004006008001000SE +/- 0.21, N = 3SE +/- 0.28, N = 3SE +/- 0.44, N = 3808.92808.68788.751. (CXX) g++ options: -O3

MandelGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPU+GPU20231020_postswitchperf20230928_preswitch20230927-trial70M140M210M280M350MSE +/- 688999.96, N = 3SE +/- 403315.78, N = 3SE +/- 1702516.89, N = 3330203056.3268569163.2266133232.11. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY20230928_preswitch20231020_postswitchperf20230927-trial90180270360450SE +/- 3.21, N = 3SE +/- 5.77, N = 3SE +/- 19.45, N = 124294224131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY20230927-trial20230928_preswitch20231020_postswitchperf130260390520650SE +/- 0.69, N = 12SE +/- 1.45, N = 3SE +/- 8.08, N = 36205985901. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT20231020_postswitchperf20230927-trial20230928_preswitch70140210280350SE +/- 5.55, N = 3SE +/- 3.17, N = 12SE +/- 4.62, N = 33353223051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY20231020_postswitchperf20230927-trial20230928_preswitch60120180240300SE +/- 0.00, N = 3SE +/- 5.02, N = 12SE +/- 11.57, N = 32702011941. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT20231020_postswitchperf20230928_preswitch20230927-trial70140210280350SE +/- 1.33, N = 3SE +/- 28.01, N = 3SE +/- 11.40, N = 123262101871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N20231020_postswitchperf20230928_preswitch20230927-trial20406080100SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 1.09, N = 12102.076.175.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T20231020_postswitchperf20230928_preswitch20230927-trial60120180240300SE +/- 1.00, N = 3SE +/- 27.91, N = 3SE +/- 12.99, N = 112901991731. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN20230928_preswitch20230927-trial20231020_postswitchperf150300450600750SE +/- 0.33, N = 3SE +/- 0.36, N = 12SE +/- 0.58, N = 37127057001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT20230928_preswitch20230927-trial20231020_postswitchperf160320480640800SE +/- 0.58, N = 3SE +/- 0.29, N = 12SE +/- 0.58, N = 37337327221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN20230927-trial20230928_preswitch20231020_postswitchperf150300450600750SE +/- 0.29, N = 12SE +/- 0.67, N = 3SE +/- 0.67, N = 37107087011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT20230928_preswitch20230927-trial20231020_postswitchperf160320480640800SE +/- 0.58, N = 3SE +/- 0.26, N = 12SE +/- 2.33, N = 37307297161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY20231020_postswitchperf20230928_preswitch20230927-trial70140210280350SE +/- 0.33, N = 3SE +/- 5.04, N = 3SE +/- 2.94, N = 113022232211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16S20230928_preswitch20230927-trial20231020_postswitchperf6001200180024003000SE +/- 2.85, N = 3SE +/- 6.17, N = 3SE +/- 5.04, N = 3271626892620

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPU20231020_postswitchperf20230928_preswitch20230927-trial90M180M270M360M450MSE +/- 6247483.81, N = 3SE +/- 2175742.71, N = 5SE +/- 251640.79, N = 3428957016.6142637305.6137240757.51. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

JuliaGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPU+GPU20231020_postswitchperf20230928_preswitch20230927-trial90M180M270M360M450MSE +/- 10623925.13, N = 15SE +/- 1203139.51, N = 3SE +/- 343734.88, N = 3411077431.4139394174.9135315754.41. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-0620231020_postswitchperf20230927-trial20230928_preswitch7001400210028003500SE +/- 42.68, N = 15SE +/- 47.73, N = 15SE +/- 36.57, N = 153122.512953.382913.891. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone20231020_postswitchperf20230927-trial20230928_preswitch8K16K24K32K40KSE +/- 593.35, N = 3SE +/- 72.19, N = 3SE +/- 174.17, N = 3359823006329797

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Microphone20231020_postswitchperf20230928_preswitch20230927-trial8K16K24K32K40KSE +/- 9.74, N = 3SE +/- 6.11, N = 3SE +/- 177.68, N = 3353633013229792

MandelbulbGPU

OpenCL Device: CPU+GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: CPU+GPU20231020_postswitchperf20230927-trial20230928_preswitch50M100M150M200M250MSE +/- 4252385.89, N = 15SE +/- 1033260.89, N = 7SE +/- 359470.33, N = 3222793824.379519746.979087731.31. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic320231020_postswitchperf20230928_preswitch20230927-trial400M800M1200M1600M2000MSE +/- 24.54, N = 3SE +/- 25.12, N = 3SE +/- 25.12, N = 31697804201169587801616958345351. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4