Test after swapping in the new HW, CPU set to performance.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2310204-BILL-230928918 opencl-set0-yoda-prehw - Phoronix Test Suite opencl-set0-yoda-prehw Test after swapping in the new HW, CPU set to performance.
HTML result view exported from: https://openbenchmarking.org/result/2310204-BILL-230928918&grw&sor .
opencl-set0-yoda-prehw Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 20230927-trial 20230928_preswitch 20231020_postswitchperf Intel Core i7-7700 @ 4.20GHz (4 Cores / 8 Threads) ASUS PRIME H270M-PLUS (0809 BIOS) Intel Xeon E3-1200 v6/7th + H270 32GB Samsung SSD 960 EVO 250GB + 1000GB Samsung SSD 970 EVO Plus 1TB + 3001GB Western Digital WD30EFRX-68E Sapphire AMD Radeon RX 6700 XT 12GB (2725/1000MHz) Realtek ALC887-VD PB248 Intel I219-V + Intel Wi-Fi 6 AX200 Ubuntu 22.04 5.15.0-84-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.3 4.6 Mesa 23.2.0-devel (LLVM 16.0.6 DRM 3.54) OpenCL 2.1 AMD-APP (3590.0) 1.3.252 GCC 11.4.0 + LLVM 14.0.0 ext4 (ecryptfs) 1920x1200 AMD Ryzen 9 7950X3D 16-Core @ 4.20GHz (16 Cores / 32 Threads) ASUS TUF GAMING B650M-PLUS WIFI (0823 BIOS) AMD Device 14d8 62GB 1000GB Samsung SSD 970 EVO Plus 1TB + Samsung SSD 970 EVO Plus 500GB + 3001GB Western Digital WD30EFRX-68E AMD Navi 21 HDMI Audio Realtek RTL8125 2.5GbE + Realtek Device b852 5.15.0-86-generic (x86_64) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - 20230927-trial: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9 - 20230928_preswitch: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9 - 20231020_postswitchperf: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203 Graphics Details - 20230927-trial: BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05 - 20230928_preswitch: BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05 - 20231020_postswitchperf: BAR1 / Visible vRAM Size: 12272 MB - vBIOS Version: 113-D5122200-S05 Python Details - Python 3.10.12 Security Details - 20230927-trial: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - 20230928_preswitch: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - 20231020_postswitchperf: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
opencl-set0-yoda-prehw darktable: Masskrug - OpenCL darktable: Masskrug - CPU-only shoc: OpenCL - Triad parboil: OpenMP Stencil rodinia: OpenCL Myocyte cl-mem: Copy clpeak: Double-Precision Compute mandelgpu: CPU+GPU viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT viennacl: OpenCL BLAS - dAXPY fluidx3d: FP32-FP16S juliagpu: GPU juliagpu: CPU+GPU lulesh-cl: luxmark: GPU - Microphone luxmark: CPU+GPU - Microphone mandelbulbgpu: CPU+GPU smallpt-gpu: GPU - Caustic3 xsbench-cl: 20230927-trial 20230928_preswitch 20231020_postswitchperf 5.216 7.894 11.1138 23.165349 12.063 281.3 808.92 266133232.1 413 620 322 201 187 75.8 173 705 732 710 729 221 2689 137240757.5 135315754.4 2953.3839 30063 29792 79519746.9 1695834535 5.330 8.008 10.9247 22.160171 8.657 281.2 808.68 268569163.2 429 598 305 194 210 76.1 199 712 733 708 730 223 2716 142637305.6 139394174.9 2913.8920 29797 30132 79087731.3 1695878016 1.382 1.833 13.5105 4.738262 47.627 281.5 788.75 330203056.3 422 590 335 270 326 102 290 700 722 701 716 302 2620 428957016.6 411077431.4 3122.5070 35982 35363 222793824.3 1697804201 OpenBenchmarking.org
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: OpenCL 20231020_postswitchperf 20230927-trial 20230928_preswitch 1.1993 2.3986 3.5979 4.7972 5.9965 SE +/- 0.019, N = 3 SE +/- 0.051, N = 3 SE +/- 0.020, N = 3 1.382 5.216 5.330
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only 20231020_postswitchperf 20230927-trial 20230928_preswitch 2 4 6 8 10 SE +/- 0.003, N = 3 SE +/- 0.007, N = 3 SE +/- 0.016, N = 3 1.833 7.894 8.008
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad 20231020_postswitchperf 20230927-trial 20230928_preswitch 3 6 9 12 15 SE +/- 0.14, N = 15 SE +/- 0.16, N = 5 SE +/- 0.22, N = 3 13.51 11.11 10.92 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 20231020_postswitchperf 20230928_preswitch 20230927-trial 6 12 18 24 30 SE +/- 0.054079, N = 3 SE +/- 0.024061, N = 3 SE +/- 0.148343, N = 3 4.738262 22.160171 23.165349 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte 20230928_preswitch 20230927-trial 20231020_postswitchperf 11 22 33 44 55 SE +/- 0.142, N = 4 SE +/- 3.477, N = 15 SE +/- 11.383, N = 15 8.657 12.063 47.627 1. (CXX) g++ options: -O2 -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 20231020_postswitchperf 20230927-trial 20230928_preswitch 60 120 180 240 300 SE +/- 0.17, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 281.5 281.3 281.2 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute 20230927-trial 20230928_preswitch 20231020_postswitchperf 200 400 600 800 1000 SE +/- 0.21, N = 3 SE +/- 0.28, N = 3 SE +/- 0.44, N = 3 808.92 808.68 788.75 1. (CXX) g++ options: -O3
MandelGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: CPU+GPU 20231020_postswitchperf 20230928_preswitch 20230927-trial 70M 140M 210M 280M 350M SE +/- 688999.96, N = 3 SE +/- 403315.78, N = 3 SE +/- 1702516.89, N = 3 330203056.3 268569163.2 266133232.1 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 20230928_preswitch 20231020_postswitchperf 20230927-trial 90 180 270 360 450 SE +/- 3.21, N = 3 SE +/- 5.77, N = 3 SE +/- 19.45, N = 12 429 422 413 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 20230927-trial 20230928_preswitch 20231020_postswitchperf 130 260 390 520 650 SE +/- 0.69, N = 12 SE +/- 1.45, N = 3 SE +/- 8.08, N = 3 620 598 590 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 20231020_postswitchperf 20230927-trial 20230928_preswitch 70 140 210 280 350 SE +/- 5.55, N = 3 SE +/- 3.17, N = 12 SE +/- 4.62, N = 3 335 322 305 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 20231020_postswitchperf 20230927-trial 20230928_preswitch 60 120 180 240 300 SE +/- 0.00, N = 3 SE +/- 5.02, N = 12 SE +/- 11.57, N = 3 270 201 194 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 20231020_postswitchperf 20230928_preswitch 20230927-trial 70 140 210 280 350 SE +/- 1.33, N = 3 SE +/- 28.01, N = 3 SE +/- 11.40, N = 12 326 210 187 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 20231020_postswitchperf 20230928_preswitch 20230927-trial 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 1.09, N = 12 102.0 76.1 75.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 20231020_postswitchperf 20230928_preswitch 20230927-trial 60 120 180 240 300 SE +/- 1.00, N = 3 SE +/- 27.91, N = 3 SE +/- 12.99, N = 11 290 199 173 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 20230928_preswitch 20230927-trial 20231020_postswitchperf 150 300 450 600 750 SE +/- 0.33, N = 3 SE +/- 0.36, N = 12 SE +/- 0.58, N = 3 712 705 700 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 20230928_preswitch 20230927-trial 20231020_postswitchperf 160 320 480 640 800 SE +/- 0.58, N = 3 SE +/- 0.29, N = 12 SE +/- 0.58, N = 3 733 732 722 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 20230927-trial 20230928_preswitch 20231020_postswitchperf 150 300 450 600 750 SE +/- 0.29, N = 12 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 710 708 701 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 20230928_preswitch 20230927-trial 20231020_postswitchperf 160 320 480 640 800 SE +/- 0.58, N = 3 SE +/- 0.26, N = 12 SE +/- 2.33, N = 3 730 729 716 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 20231020_postswitchperf 20230928_preswitch 20230927-trial 70 140 210 280 350 SE +/- 0.33, N = 3 SE +/- 5.04, N = 3 SE +/- 2.94, N = 11 302 223 221 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.3 Test: FP32-FP16S 20230928_preswitch 20230927-trial 20231020_postswitchperf 600 1200 1800 2400 3000 SE +/- 2.85, N = 3 SE +/- 6.17, N = 3 SE +/- 5.04, N = 3 2716 2689 2620
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU 20231020_postswitchperf 20230928_preswitch 20230927-trial 90M 180M 270M 360M 450M SE +/- 6247483.81, N = 3 SE +/- 2175742.71, N = 5 SE +/- 251640.79, N = 3 428957016.6 142637305.6 137240757.5 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
JuliaGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: CPU+GPU 20231020_postswitchperf 20230928_preswitch 20230927-trial 90M 180M 270M 360M 450M SE +/- 10623925.13, N = 15 SE +/- 1203139.51, N = 3 SE +/- 343734.88, N = 3 411077431.4 139394174.9 135315754.4 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 20231020_postswitchperf 20230927-trial 20230928_preswitch 700 1400 2100 2800 3500 SE +/- 42.68, N = 15 SE +/- 47.73, N = 15 SE +/- 36.57, N = 15 3122.51 2953.38 2913.89 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone 20231020_postswitchperf 20230927-trial 20230928_preswitch 8K 16K 24K 32K 40K SE +/- 593.35, N = 3 SE +/- 72.19, N = 3 SE +/- 174.17, N = 3 35982 30063 29797
LuxMark OpenCL Device: CPU+GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone 20231020_postswitchperf 20230928_preswitch 20230927-trial 8K 16K 24K 32K 40K SE +/- 9.74, N = 3 SE +/- 6.11, N = 3 SE +/- 177.68, N = 3 35363 30132 29792
MandelbulbGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: CPU+GPU 20231020_postswitchperf 20230927-trial 20230928_preswitch 50M 100M 150M 200M 250M SE +/- 4252385.89, N = 15 SE +/- 1033260.89, N = 7 SE +/- 359470.33, N = 3 222793824.3 79519746.9 79087731.3 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Caustic3 20231020_postswitchperf 20230928_preswitch 20230927-trial 400M 800M 1200M 1600M 2000M SE +/- 24.54, N = 3 SE +/- 25.12, N = 3 SE +/- 25.12, N = 3 1697804201 1695878016 1695834535 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.4