VK-CL

Intel Core i7-9750H testing with a Dell 0F7T8V (1.14.0 BIOS) and Intel UHD 630 CFL GT2 8GB on EndeavourOS rolling via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2304037-EIRI-VKCL84166
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Intel UHD 630
April 03 2023
  14 Hours, 41 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


VK-CLOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-9750H @ 4.50GHz (6 Cores / 12 Threads)Dell 0F7T8V (1.14.0 BIOS)Intel Cannon Lake PCH32GB2000GB Samsung SSD 970 EVO Plus 2TB + 1000GB CT1000MX500SSD1Intel UHD 630 CFL GT2 8GB (885/6000MHz)Realtek ALC3204Realtek Device 2502 + Intel-AC 9260EndeavourOS rolling6.2.9-arch1-1 (x86_64)KDE Plasma 5.27.3X Server 1.21.1.8NVIDIA 530.41.034.6 Mesa 23.0.1OpenCL 3.0 CUDA 12.1.98 + OpenCL 3.0 + OpenCL 3.0 LINUXGCC 12.2.1 20230201 + Clang 15.0.7 + LLVM 15.0.7 + CUDA 12.1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionVK-CL BenchmarksSystem Logs- Transparent Huge Pages: always- NVM_CD_FLAGS=- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xf0 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.2d.00.bb- GPU Compute Cores: 2304- Python 3.10.10- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Vulnerable + retbleed: Vulnerable + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected

VK-CLrealsr-ncnn: 4x - Yesvkfft: luxmark: CPU+GPU - Hotelclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferparboil: OpenCL LBMluxmark: CPU+GPU - Microphonevkmark: 1920 x 1080 - Mailboxvkmark: 1920 x 1080 - Immediaterodinia: OpenCL Myocytencnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetblender: Classroom - OpenCLblender: Fishy Cat - OpenCLblender: Pabellon Barcelona - OpenCLcl-mem: Copyblender: BMW27 - OpenCLsmallpt-gpu: GPU - Complexvkmark: 800 x 600 - Mailboxvkmark: 800 x 600 - Immediateluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: CPU+GPU - Luxball HDRluxmark: GPU - Luxball HDRcl-mem: Readcl-mem: Writerodinia: OpenCL Particle Filterclpeak: Double-Precision Computewaifu2x-ncnn: 2x - 3 - Yessmallpt-gpu: GPU - Caustic3smallpt-gpu: GPU - Cornelldarktable: Boat - OpenCLviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYvkresample: 2x - Singlevkresample: 2x - Doublelulesh-cl: vkpeak: fp32-scalarparboil: OpenCL BFSrealsr-ncnn: 4x - Nodarktable: Masskrug - OpenCLparboil: OpenCL TPACFclpeak: Integer 24-bit Computeclpeak: Integer Computeclpeak: Single-Precision Computeclpeak: Global Memory Bandwidthdarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLwaifu2x-ncnn: 2x - 3 - Noclpeak: Kernel Latencyluxmark: Hybrid GPU - HotelIntel UHD 630292.053160315702.062.40134.7432554267952980176.08410.61895.6310.3025.1043.6044.4918.6921.00105.2018.974.2714.937.838.698.757.6022.171255.431222.461129.8838.4558.0516805574113609361691230254981404317.517.392.612181.0615.4691680557802168055767322.94020.420.217.618.418.51313.817.411.313.6200.047200.4421122.2085264.831.48967715.3948.7246.5367754237.144448.003562.99325.660.8130.7572.0666.11OpenBenchmarking.org

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesIntel UHD 63060120180240300SE +/- 7.16, N = 9292.05

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1Intel UHD 63030060090012001500SE +/- 4.98, N = 316031. (CXX) g++ options: -O3

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelIntel UHD 63030060090012001500SE +/- 356.90, N = 121570

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferIntel UHD 6300.46350.9271.39051.8542.3175SE +/- 0.19, N = 152.061. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferIntel UHD 6300.541.081.622.162.7SE +/- 0.35, N = 152.401. (CXX) g++ options: -O3

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMIntel UHD 630306090120150SE +/- 6.60, N = 12134.741. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneIntel UHD 6309001800270036004500SE +/- 187.03, N = 124267

VKMark

VKMark is a collection of open-source Vulkan tests / rendering benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 1920 x 1080 - Present Mode: MailboxIntel UHD 6302004006008001000SE +/- 48.30, N = 129521. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 1920 x 1080 - Present Mode: ImmediateIntel UHD 6302004006008001000SE +/- 45.00, N = 129801. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteIntel UHD 6304080120160200SE +/- 19.11, N = 9176.081. (CXX) g++ options: -O2 -lOpenCL

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: FastestDetIntel UHD 6303691215SE +/- 0.15, N = 310.61MIN: 9.85 / MAX: 14.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vision_transformerIntel UHD 6302004006008001000SE +/- 12.79, N = 3895.63MIN: 844.64 / MAX: 1281.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: regnety_400mIntel UHD 6303691215SE +/- 0.03, N = 310.30MIN: 9.86 / MAX: 13.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: squeezenet_ssdIntel UHD 630612182430SE +/- 0.08, N = 325.10MIN: 24.7 / MAX: 28.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: yolov4-tinyIntel UHD 6301020304050SE +/- 0.15, N = 343.60MIN: 42.31 / MAX: 50.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet50Intel UHD 6301020304050SE +/- 0.06, N = 344.49MIN: 43.77 / MAX: 53.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: alexnetIntel UHD 630510152025SE +/- 0.03, N = 318.69MIN: 18.14 / MAX: 19.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet18Intel UHD 630510152025SE +/- 0.04, N = 321.00MIN: 20.5 / MAX: 22.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vgg16Intel UHD 63020406080100SE +/- 0.16, N = 3105.20MIN: 103.46 / MAX: 118.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: googlenetIntel UHD 630510152025SE +/- 0.01, N = 318.97MIN: 18.75 / MAX: 19.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: blazefaceIntel UHD 6300.96081.92162.88243.84324.804SE +/- 0.22, N = 34.27MIN: 2.67 / MAX: 8.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: efficientnet-b0Intel UHD 63048121620SE +/- 0.03, N = 314.93MIN: 14.41 / MAX: 16.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mnasnetIntel UHD 630246810SE +/- 0.02, N = 37.83MIN: 7.25 / MAX: 111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: shufflenet-v2Intel UHD 630246810SE +/- 0.13, N = 28.69MIN: 7.85 / MAX: 11.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Intel UHD 630246810SE +/- 0.02, N = 38.75MIN: 8.04 / MAX: 9.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Intel UHD 630246810SE +/- 0.07, N = 37.60MIN: 7.18 / MAX: 8.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mobilenetIntel UHD 630510152025SE +/- 0.08, N = 322.17MIN: 21.66 / MAX: 24.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Classroom - Compute: OpenCLIntel UHD 630300600900120015001255.43

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Fishy Cat - Compute: OpenCLIntel UHD 630300600900120015001222.46

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Pabellon Barcelona - Compute: OpenCLIntel UHD 63020040060080010001129.88

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyIntel UHD 630918273645SE +/- 14.50, N = 1538.41. (CC) gcc options: -O2 -flto -lOpenCL

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: BMW27 - Compute: OpenCLIntel UHD 630120240360480600558.05

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: ComplexIntel UHD 630400M800M1200M1600M2000MSE +/- 102.77, N = 316805574111. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL Histo

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

VKMark

VKMark is a collection of open-source Vulkan tests / rendering benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 800 x 600 - Present Mode: MailboxIntel UHD 6308001600240032004000SE +/- 0.58, N = 336091. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 800 x 600 - Present Mode: ImmediateIntel UHD 6308001600240032004000SE +/- 2.08, N = 336161. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelIntel UHD 6302004006008001000SE +/- 11.46, N = 3912

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneIntel UHD 6306001200180024003000SE +/- 18.67, N = 33025

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRIntel UHD 63011002200330044005500SE +/- 34.90, N = 34981

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRIntel UHD 6309001800270036004500SE +/- 0.58, N = 34043

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadIntel UHD 63048121620SE +/- 0.00, N = 317.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteIntel UHD 63048121620SE +/- 0.00, N = 317.31. (CC) gcc options: -O2 -flto -lOpenCL

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterIntel UHD 63020406080100SE +/- 0.12, N = 392.611. (CXX) g++ options: -O2 -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeIntel UHD 6304080120160200SE +/- 5.83, N = 15181.061. (CXX) g++ options: -O3

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesIntel UHD 63048121620SE +/- 1.62, N = 1515.47

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Intel UHD 630400M800M1200M1600M2000MSE +/- 24.25, N = 316805578021. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellIntel UHD 630400M800M1200M1600M2000MSE +/- 22.52, N = 316805576731. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Boat - Acceleration: OpenCLIntel UHD 630510152025SE +/- 0.00, N = 322.94

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTIntel UHD 63051015202520.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNIntel UHD 630510152025SE +/- 0.00, N = 320.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TIntel UHD 63048121620SE +/- 0.03, N = 317.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NIntel UHD 63051015202518.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTIntel UHD 630510152025SE +/- 0.00, N = 318.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYIntel UHD 6303691215SE +/- 0.00, N = 3131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYIntel UHD 63048121620SE +/- 0.03, N = 313.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTIntel UHD 63048121620SE +/- 0.00, N = 317.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYIntel UHD 6303691215SE +/- 0.00, N = 311.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYIntel UHD 6303691215SE +/- 0.00, N = 313.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleIntel UHD 6304080120160200SE +/- 0.01, N = 3200.051. (CXX) g++ options: -O3

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleIntel UHD 6304080120160200SE +/- 0.01, N = 3200.441. (CXX) g++ options: -O3

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06Intel UHD 6302004006008001000SE +/- 1.39, N = 31122.211. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarIntel UHD 63060120180240300SE +/- 0.09, N = 3264.83

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSIntel UHD 6300.33520.67041.00561.34081.676SE +/- 0.167959, N = 151.4896771. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoIntel UHD 63048121620SE +/- 0.11, N = 315.39

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

Blend File: Barbershop - Compute: Intel oneAPI

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 6, total unfreed memory 0.000534 MB

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Masskrug - Acceleration: OpenCLIntel UHD 630246810SE +/- 0.006, N = 38.724

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFIntel UHD 630246810SE +/- 0.037924, N = 36.5367751. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeIntel UHD 6309001800270036004500SE +/- 48.13, N = 154237.141. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeIntel UHD 63010002000300040005000SE +/- 83.96, N = 154448.001. (CXX) g++ options: -O3

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL MRI Gridding

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: :135:57: error: call to 'min' is ambiguous

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeIntel UHD 6308001600240032004000SE +/- 19.00, N = 33562.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthIntel UHD 63070140210280350SE +/- 0.01, N = 3325.661. (CXX) g++ options: -O3

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Server Room - Acceleration: OpenCLIntel UHD 6300.18290.36580.54870.73160.9145SE +/- 0.002, N = 30.813

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Server Rack - Acceleration: OpenCLIntel UHD 6300.17030.34060.51090.68120.8515SE +/- 0.000, N = 30.757

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: NoIntel UHD 6300.46490.92981.39471.85962.3245SE +/- 0.009, N = 32.066

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

Blend File: Fishy Cat - Compute: Intel oneAPI

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyIntel UHD 630246810SE +/- 0.06, N = 76.111. (CXX) g++ options: -O3

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

Blend File: Pabellon Barcelona - Compute: Intel oneAPI

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

Blend File: BMW27 - Compute: Intel oneAPI

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

Blend File: Classroom - Compute: Intel oneAPI

Intel UHD 630: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Found no Cycles device of the specified type

Blend File: Barbershop - Compute: OpenCL

Intel UHD 630: The test run did not produce a result. E: Error: Cannot read file 'blender-2.79-linux-glibc219-x86_64/../barbershop_interior_gpu.blend': No such file or directory

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: Hybrid GPU - Scene: Luxball HDR

Intel UHD 630: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Microphone

Intel UHD 630: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Hotel

Intel UHD 630: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

73 Results Shown

RealSR-NCNN
VkFFT
LuxMark
clpeak:
  Transfer Bandwidth enqueueWriteBuffer
  Transfer Bandwidth enqueueReadBuffer
Parboil
LuxMark
VKMark:
  1920 x 1080 - Mailbox
  1920 x 1080 - Immediate
Rodinia
NCNN:
  Vulkan GPU - FastestDet
  Vulkan GPU - vision_transformer
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
Blender:
  Classroom - OpenCL
  Fishy Cat - OpenCL
  Pabellon Barcelona - OpenCL
cl-mem
Blender
SmallPT GPU
VKMark:
  800 x 600 - Mailbox
  800 x 600 - Immediate
LuxMark:
  GPU - Hotel
  GPU - Microphone
  CPU+GPU - Luxball HDR
  GPU - Luxball HDR
cl-mem:
  Read
  Write
Rodinia
clpeak
Waifu2x-NCNN Vulkan
SmallPT GPU:
  GPU - Caustic3
  GPU - Cornell
Darktable
ViennaCL:
  OpenCL BLAS - dGEMM-NT
  OpenCL BLAS - dGEMM-NN
  OpenCL BLAS - dGEMV-T
  OpenCL BLAS - dGEMV-N
  OpenCL BLAS - dDOT
  OpenCL BLAS - dAXPY
  OpenCL BLAS - dCOPY
  OpenCL BLAS - sDOT
  OpenCL BLAS - sAXPY
  OpenCL BLAS - sCOPY
VkResample:
  2x - Single
  2x - Double
Lulesh OpenCL
vkpeak
Parboil
RealSR-NCNN
Darktable
Parboil
clpeak:
  Integer 24-bit Compute
  Integer Compute
  Single-Precision Compute
  Global Memory Bandwidth
Darktable:
  Server Room - OpenCL
  Server Rack - OpenCL
Waifu2x-NCNN Vulkan
clpeak