opencl-set0-yoda-prehw

Test before swapping MB and CPU.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2309289-BILL-230927441
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 3 Tests
HPC - High Performance Computing 3 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 15 Tests
OpenMPI Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
20230927-trial
September 27 2023
  1 Hour, 28 Minutes
20230928_preswitch
September 28 2023
  41 Minutes
Invert Hiding All Results Option
  1 Hour, 5 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


opencl-set0-yoda-prehwOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-7700 @ 4.20GHz (4 Cores / 8 Threads)ASUS PRIME H270M-PLUS (0809 BIOS)Intel Xeon E3-1200 v6/7th + H27032GBSamsung SSD 960 EVO 250GB + 1000GB Samsung SSD 970 EVO Plus 1TB + 3001GB Western Digital WD30EFRX-68ESapphire AMD Radeon RX 6700 XT 12GB (2725/1000MHz)Realtek ALC887-VDPB248Intel I219-V + Intel Wi-Fi 6 AX200Ubuntu 22.045.15.0-84-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.34.6 Mesa 23.2.0-devel (LLVM 16.0.6 DRM 3.54)OpenCL 2.1 AMD-APP (3590.0)1.3.252GCC 11.4.0 + LLVM 14.0.0ext4 (ecryptfs)1920x1200ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionOpencl-set0-yoda-prehw BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xf4 - Thermald 2.4.9- BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D5122200-S05- Python 3.10.12- gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of IBRS IBPB: conditional STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled

20230927-trial vs. 20230928_preswitch ComparisonPhoronix Test SuiteBaseline+9.8%+9.8%+19.6%+19.6%+29.4%+29.4%+39.2%+39.2%39.3%15%12.3%4.5%3.9%3.9%3%OpenCL MyocyteOpenCL BLAS - dGEMV-TOpenCL BLAS - dDOTOpenCL BLAS - sDOT5.6%OpenMP StencilGPUOpenCL BLAS - sCOPYOpenCL BLAS - sAXPY3.7%OpenCL BLAS - dCOPY3.6%CPU+GPUMasskrug - OpenCL2.2%RodiniaViennaCLViennaCLViennaCLParboilJuliaGPUViennaCLViennaCLViennaCLJuliaGPUDarktable20230927-trial20230928_preswitch

opencl-set0-yoda-prehwshoc: OpenCL - Triadcl-mem: Copyfluidx3d: FP32-FP16Sclpeak: Double-Precision Computeparboil: OpenMP Stencilrodinia: OpenCL Myocyteviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dAXPYdarktable: Masskrug - OpenCLdarktable: Masskrug - CPU-onlyjuliagpu: GPUjuliagpu: CPU+GPUmandelbulbgpu: CPU+GPUmandelgpu: CPU+GPUsmallpt-gpu: GPU - Caustic3luxmark: GPU - Microphoneluxmark: CPU+GPU - Microphonelulesh-cl: 20230927-trial20230928_preswitch11.1138281.32689808.9223.16534912.06341362032220118775.81737057327107292215.2167.894137240757.5135315754.479519746.9266133232.1169583453530063297922953.383910.9247281.22716808.6822.1601718.65742959830519421076.11997127337087302235.3308.008142637305.6139394174.979087731.3268569163.2169587801629797301322913.8920OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad20230927-trial20230928_preswitch3691215SE +/- 0.16, N = 5SE +/- 0.22, N = 311.1110.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad20230927-trial20230928_preswitch3691215Min: 10.77 / Avg: 11.11 / Max: 11.71Min: 10.7 / Avg: 10.92 / Max: 11.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy20230927-trial20230928_preswitch60120180240300SE +/- 0.09, N = 3SE +/- 0.09, N = 3281.3281.21. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy20230927-trial20230928_preswitch50100150200250Min: 281.2 / Avg: 281.33 / Max: 281.5Min: 281 / Avg: 281.17 / Max: 281.31. (CC) gcc options: -O2 -flto -lOpenCL

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16S20230927-trial20230928_preswitch6001200180024003000SE +/- 6.17, N = 3SE +/- 2.85, N = 326892716
OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16S20230927-trial20230928_preswitch5001000150020002500Min: 2682 / Avg: 2688.67 / Max: 2701Min: 2713 / Avg: 2716.33 / Max: 2722

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Compute20230927-trial20230928_preswitch2004006008001000SE +/- 0.21, N = 3SE +/- 0.28, N = 3808.92808.681. (CXX) g++ options: -O3
OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Compute20230927-trial20230928_preswitch140280420560700Min: 808.49 / Avg: 808.92 / Max: 809.13Min: 808.27 / Avg: 808.68 / Max: 809.221. (CXX) g++ options: -O3

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil20230927-trial20230928_preswitch612182430SE +/- 0.15, N = 3SE +/- 0.02, N = 323.1722.161. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil20230927-trial20230928_preswitch510152025Min: 22.88 / Avg: 23.17 / Max: 23.37Min: 22.12 / Avg: 22.16 / Max: 22.21. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte20230927-trial20230928_preswitch3691215SE +/- 3.477, N = 15SE +/- 0.142, N = 412.0638.6571. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte20230927-trial20230928_preswitch48121620Min: 6.95 / Avg: 12.06 / Max: 60.58Min: 8.34 / Avg: 8.66 / Max: 8.971. (CXX) g++ options: -O2 -lOpenCL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY20230927-trial20230928_preswitch90180270360450SE +/- 19.45, N = 12SE +/- 3.21, N = 34134291. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY20230927-trial20230928_preswitch80160240320400Min: 211 / Avg: 413 / Max: 448Min: 424 / Avg: 429 / Max: 4351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY20230927-trial20230928_preswitch130260390520650SE +/- 0.69, N = 12SE +/- 1.45, N = 36205981. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY20230927-trial20230928_preswitch110220330440550Min: 616 / Avg: 619.75 / Max: 624Min: 595 / Avg: 597.67 / Max: 6001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT20230927-trial20230928_preswitch70140210280350SE +/- 3.17, N = 12SE +/- 4.62, N = 33223051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT20230927-trial20230928_preswitch60120180240300Min: 302 / Avg: 321.83 / Max: 338Min: 297 / Avg: 305 / Max: 3131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY20230927-trial20230928_preswitch4080120160200SE +/- 5.02, N = 12SE +/- 11.57, N = 32011941. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY20230927-trial20230928_preswitch4080120160200Min: 174 / Avg: 200.58 / Max: 226Min: 173 / Avg: 193.67 / Max: 2131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT20230927-trial20230928_preswitch50100150200250SE +/- 11.40, N = 12SE +/- 28.01, N = 31872101. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT20230927-trial20230928_preswitch4080120160200Min: 150 / Avg: 186.58 / Max: 283Min: 161 / Avg: 210 / Max: 2581. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N20230927-trial20230928_preswitch20406080100SE +/- 1.09, N = 12SE +/- 0.33, N = 375.876.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N20230927-trial20230928_preswitch1530456075Min: 67.6 / Avg: 75.84 / Max: 82.9Min: 75.6 / Avg: 76.07 / Max: 76.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T20230927-trial20230928_preswitch4080120160200SE +/- 12.99, N = 11SE +/- 27.91, N = 31731991. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T20230927-trial20230928_preswitch4080120160200Min: 130 / Avg: 172.64 / Max: 236Min: 143 / Avg: 198.67 / Max: 2301. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN20230927-trial20230928_preswitch150300450600750SE +/- 0.36, N = 12SE +/- 0.33, N = 37057121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN20230927-trial20230928_preswitch130260390520650Min: 703 / Avg: 704.92 / Max: 707Min: 711 / Avg: 711.67 / Max: 7121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT20230927-trial20230928_preswitch160320480640800SE +/- 0.29, N = 12SE +/- 0.58, N = 37327331. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT20230927-trial20230928_preswitch130260390520650Min: 731 / Avg: 731.92 / Max: 734Min: 732 / Avg: 733 / Max: 7341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN20230927-trial20230928_preswitch150300450600750SE +/- 0.29, N = 12SE +/- 0.67, N = 37107081. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN20230927-trial20230928_preswitch120240360480600Min: 709 / Avg: 710.08 / Max: 712Min: 707 / Avg: 707.67 / Max: 7091. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT20230927-trial20230928_preswitch160320480640800SE +/- 0.26, N = 12SE +/- 0.58, N = 37297301. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT20230927-trial20230928_preswitch130260390520650Min: 728 / Avg: 729.08 / Max: 731Min: 729 / Avg: 730 / Max: 7311. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY20230927-trial20230928_preswitch50100150200250SE +/- 2.94, N = 11SE +/- 5.04, N = 32212231. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY20230927-trial20230928_preswitch4080120160200Min: 209 / Avg: 220.64 / Max: 243Min: 216 / Avg: 223.33 / Max: 2331. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: OpenCL20230927-trial20230928_preswitch1.19932.39863.59794.79725.9965SE +/- 0.051, N = 3SE +/- 0.020, N = 35.2165.330
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: OpenCL20230927-trial20230928_preswitch246810Min: 5.14 / Avg: 5.22 / Max: 5.31Min: 5.31 / Avg: 5.33 / Max: 5.37

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: CPU-only20230927-trial20230928_preswitch246810SE +/- 0.007, N = 3SE +/- 0.016, N = 37.8948.008
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.4.2Test: Masskrug - Acceleration: CPU-only20230927-trial20230928_preswitch3691215Min: 7.88 / Avg: 7.89 / Max: 7.91Min: 7.99 / Avg: 8.01 / Max: 8.04

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

20230927-trial: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

20230928_preswitch: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU

20230927-trial: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

20230928_preswitch: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPU20230927-trial20230928_preswitch30M60M90M120M150MSE +/- 251640.79, N = 3SE +/- 2175742.71, N = 5137240757.5142637305.61. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPU20230927-trial20230928_preswitch20M40M60M80M100MMin: 136824328.6 / Avg: 137240757.5 / Max: 137693736.1Min: 137103454 / Avg: 142637305.58 / Max: 147904731.81. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch30M60M90M120M150MSE +/- 343734.88, N = 3SE +/- 1203139.51, N = 3135315754.4139394174.91. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch20M40M60M80M100MMin: 134953568.9 / Avg: 135315754.4 / Max: 136002887.9Min: 137031066.8 / Avg: 139394174.9 / Max: 140968697.31. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch20M40M60M80M100MSE +/- 1033260.89, N = 7SE +/- 359470.33, N = 379519746.979087731.31. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch14M28M42M56M70MMin: 75476191.3 / Avg: 79519746.86 / Max: 83870523.2Min: 78435134 / Avg: 79087731.33 / Max: 79675266.11. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch60M120M180M240M300MSE +/- 1702516.89, N = 3SE +/- 403315.78, N = 3266133232.1268569163.21. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: CPU+GPU20230927-trial20230928_preswitch50M100M150M200M250MMin: 262768130.8 / Avg: 266133232.13 / Max: 268266072.8Min: 267849654.5 / Avg: 268569163.23 / Max: 269244704.31. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Caustic3

20230927-trial: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

20230928_preswitch: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic320230927-trial20230928_preswitch400M800M1200M1600M2000MSE +/- 25.12, N = 3SE +/- 25.12, N = 3169583453516958780161. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic320230927-trial20230928_preswitch300M600M900M1200M1500MMin: 1695834491 / Avg: 1695834534.67 / Max: 1695834578Min: 1695877972 / Avg: 1695878015.67 / Max: 16958780591. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Microphone

20230927-trial: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

20230928_preswitch: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone20230927-trial20230928_preswitch6K12K18K24K30KSE +/- 72.19, N = 3SE +/- 174.17, N = 33006329797
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphone20230927-trial20230928_preswitch5K10K15K20K25KMin: 29919 / Avg: 30063 / Max: 30144Min: 29620 / Avg: 29796.67 / Max: 30145

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Microphone20230927-trial20230928_preswitch6K12K18K24K30KSE +/- 177.68, N = 3SE +/- 6.11, N = 32979230132
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Microphone20230927-trial20230928_preswitch5K10K15K20K25KMin: 29610 / Avg: 29791.67 / Max: 30147Min: 30120 / Avg: 30132 / Max: 30140

OpenCL Device: Hybrid GPU - Scene: Microphone

20230927-trial: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

20230928_preswitch: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-0620230927-trial20230928_preswitch6001200180024003000SE +/- 47.73, N = 15SE +/- 36.57, N = 152953.382913.891. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-0620230927-trial20230928_preswitch5001000150020002500Min: 2642.98 / Avg: 2953.38 / Max: 3123.86Min: 2646 / Avg: 2913.89 / Max: 3105.551. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm