Test before swapping MB and CPU.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2309289-BILL-230927441 opencl-set0-yoda-prehw - Phoronix Test Suite opencl-set0-yoda-prehw Test before swapping MB and CPU.
HTML result view exported from: https://openbenchmarking.org/result/2309289-BILL-230927441&grt&sor .
opencl-set0-yoda-prehw Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 20230927-trial 20230928_preswitch Intel Core i7-7700 @ 4.20GHz (4 Cores / 8 Threads) ASUS PRIME H270M-PLUS (0809 BIOS) Intel Xeon E3-1200 v6/7th + H270 32GB Samsung SSD 960 EVO 250GB + 1000GB Samsung SSD 970 EVO Plus 1TB + 3001GB Western Digital WD30EFRX-68E Sapphire AMD Radeon RX 6700 XT 12GB (2725/1000MHz) Realtek ALC887-VD PB248 Intel I219-V + Intel Wi-Fi 6 AX200 Ubuntu 22.04 5.15.0-84-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.3 4.6 Mesa 23.2.0-devel (LLVM 16.0.6 DRM 3.54) OpenCL 2.1 AMD-APP (3590.0) 1.3.252 GCC 11.4.0 + LLVM 14.0.0 ext4 (ecryptfs) 1920x1200 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9 Graphics Details - BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05 Python Details - Python 3.10.12 Security Details - gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled
opencl-set0-yoda-prehw cl-mem: Copy clpeak: Double-Precision Compute darktable: Masskrug - OpenCL darktable: Masskrug - CPU-only fluidx3d: FP32-FP16S juliagpu: GPU juliagpu: CPU+GPU lulesh-cl: luxmark: GPU - Microphone luxmark: CPU+GPU - Microphone mandelbulbgpu: CPU+GPU mandelgpu: CPU+GPU parboil: OpenMP Stencil rodinia: OpenCL Myocyte shoc: OpenCL - Triad smallpt-gpu: GPU - Caustic3 viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT viennacl: OpenCL BLAS - dAXPY xsbench-cl: 20230927-trial 20230928_preswitch 281.3 808.92 5.216 7.894 2689 137240757.5 135315754.4 2953.3839 30063 29792 79519746.9 266133232.1 23.165349 12.063 11.1138 1695834535 413 620 322 201 187 75.8 173 705 732 710 729 221 281.2 808.68 5.330 8.008 2716 142637305.6 139394174.9 2913.8920 29797 30132 79087731.3 268569163.2 22.160171 8.657 10.9247 1695878016 429 598 305 194 210 76.1 199 712 733 708 730 223 OpenBenchmarking.org
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 20230927-trial 20230928_preswitch 60 120 180 240 300 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 281.3 281.2 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute 20230927-trial 20230928_preswitch 200 400 600 800 1000 SE +/- 0.21, N = 3 SE +/- 0.28, N = 3 808.92 808.68 1. (CXX) g++ options: -O3
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: OpenCL 20230927-trial 20230928_preswitch 1.1993 2.3986 3.5979 4.7972 5.9965 SE +/- 0.051, N = 3 SE +/- 0.020, N = 3 5.216 5.330
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only 20230927-trial 20230928_preswitch 2 4 6 8 10 SE +/- 0.007, N = 3 SE +/- 0.016, N = 3 7.894 8.008
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.3 Test: FP32-FP16S 20230928_preswitch 20230927-trial 600 1200 1800 2400 3000 SE +/- 2.85, N = 3 SE +/- 6.17, N = 3 2716 2689
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU 20230928_preswitch 20230927-trial 30M 60M 90M 120M 150M SE +/- 2175742.71, N = 5 SE +/- 251640.79, N = 3 142637305.6 137240757.5 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
JuliaGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: CPU+GPU 20230928_preswitch 20230927-trial 30M 60M 90M 120M 150M SE +/- 1203139.51, N = 3 SE +/- 343734.88, N = 3 139394174.9 135315754.4 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 20230927-trial 20230928_preswitch 600 1200 1800 2400 3000 SE +/- 47.73, N = 15 SE +/- 36.57, N = 15 2953.38 2913.89 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone 20230927-trial 20230928_preswitch 6K 12K 18K 24K 30K SE +/- 72.19, N = 3 SE +/- 174.17, N = 3 30063 29797
LuxMark OpenCL Device: CPU+GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone 20230928_preswitch 20230927-trial 6K 12K 18K 24K 30K SE +/- 6.11, N = 3 SE +/- 177.68, N = 3 30132 29792
MandelbulbGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: CPU+GPU 20230927-trial 20230928_preswitch 20M 40M 60M 80M 100M SE +/- 1033260.89, N = 7 SE +/- 359470.33, N = 3 79519746.9 79087731.3 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
MandelGPU OpenCL Device: CPU+GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: CPU+GPU 20230928_preswitch 20230927-trial 60M 120M 180M 240M 300M SE +/- 403315.78, N = 3 SE +/- 1702516.89, N = 3 268569163.2 266133232.1 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 20230928_preswitch 20230927-trial 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 22.16 23.17 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte 20230928_preswitch 20230927-trial 3 6 9 12 15 SE +/- 0.142, N = 4 SE +/- 3.477, N = 15 8.657 12.063 1. (CXX) g++ options: -O2 -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad 20230927-trial 20230928_preswitch 3 6 9 12 15 SE +/- 0.16, N = 5 SE +/- 0.22, N = 3 11.11 10.92 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SmallPT GPU OpenCL Device: GPU - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Caustic3 20230928_preswitch 20230927-trial 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 SE +/- 25.12, N = 3 1695878016 1695834535 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY 20230928_preswitch 20230927-trial 90 180 270 360 450 SE +/- 3.21, N = 3 SE +/- 19.45, N = 12 429 413 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY 20230927-trial 20230928_preswitch 130 260 390 520 650 SE +/- 0.69, N = 12 SE +/- 1.45, N = 3 620 598 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT 20230927-trial 20230928_preswitch 70 140 210 280 350 SE +/- 3.17, N = 12 SE +/- 4.62, N = 3 322 305 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY 20230927-trial 20230928_preswitch 40 80 120 160 200 SE +/- 5.02, N = 12 SE +/- 11.57, N = 3 201 194 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT 20230928_preswitch 20230927-trial 50 100 150 200 250 SE +/- 28.01, N = 3 SE +/- 11.40, N = 12 210 187 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N 20230928_preswitch 20230927-trial 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 1.09, N = 12 76.1 75.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T 20230928_preswitch 20230927-trial 40 80 120 160 200 SE +/- 27.91, N = 3 SE +/- 12.99, N = 11 199 173 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN 20230928_preswitch 20230927-trial 150 300 450 600 750 SE +/- 0.33, N = 3 SE +/- 0.36, N = 12 712 705 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT 20230928_preswitch 20230927-trial 160 320 480 640 800 SE +/- 0.58, N = 3 SE +/- 0.29, N = 12 733 732 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN 20230927-trial 20230928_preswitch 150 300 450 600 750 SE +/- 0.29, N = 12 SE +/- 0.67, N = 3 710 708 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT 20230928_preswitch 20230927-trial 160 320 480 640 800 SE +/- 0.58, N = 3 SE +/- 0.26, N = 12 730 729 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY 20230928_preswitch 20230927-trial 50 100 150 200 250 SE +/- 5.04, N = 3 SE +/- 2.94, N = 11 223 221 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4