VK-CL

Intel Core i7-9750H testing with a Dell 0F7T8V (1.14.0 BIOS) and Intel UHD 630 CFL GT2 8GB on EndeavourOS rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2304037-EIRI-VKCL84166&grr.

VK-CLProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionIntel UHD 630Intel Core i7-9750H @ 4.50GHz (6 Cores / 12 Threads)Dell 0F7T8V (1.14.0 BIOS)Intel Cannon Lake PCH32GB2000GB Samsung SSD 970 EVO Plus 2TB + 1000GB CT1000MX500SSD1Intel UHD 630 CFL GT2 8GB (885/6000MHz)Realtek ALC3204Realtek Device 2502 + Intel-AC 9260EndeavourOS rolling6.2.9-arch1-1 (x86_64)KDE Plasma 5.27.3X Server 1.21.1.8NVIDIA 530.41.034.6 Mesa 23.0.1OpenCL 3.0 CUDA 12.1.98 + OpenCL 3.0 + OpenCL 3.0 LINUXGCC 12.2.1 20230201 + Clang 15.0.7 + LLVM 15.0.7 + CUDA 12.1ext41920x1080OpenBenchmarking.org- Transparent Huge Pages: always- NVM_CD_FLAGS=- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xf0 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.2d.00.bb- GPU Compute Cores: 2304- Python 3.10.10- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: vulnerable + mds: Vulnerable; SMT vulnerable + meltdown: Vulnerable + mmio_stale_data: Vulnerable + retbleed: Vulnerable + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Vulnerable + tsx_async_abort: Not affected

VK-CLrealsr-ncnn: 4x - Yesvkfft: luxmark: CPU+GPU - Hotelclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferparboil: OpenCL LBMluxmark: CPU+GPU - Microphonevkmark: 1920 x 1080 - Mailboxvkmark: 1920 x 1080 - Immediaterodinia: OpenCL Myocytencnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetblender: Classroom - OpenCLblender: Fishy Cat - OpenCLblender: Pabellon Barcelona - OpenCLcl-mem: Copyblender: BMW27 - OpenCLsmallpt-gpu: GPU - Complexvkmark: 800 x 600 - Mailboxvkmark: 800 x 600 - Immediateluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: CPU+GPU - Luxball HDRluxmark: GPU - Luxball HDRcl-mem: Readcl-mem: Writerodinia: OpenCL Particle Filterclpeak: Double-Precision Computewaifu2x-ncnn: 2x - 3 - Yessmallpt-gpu: GPU - Caustic3smallpt-gpu: GPU - Cornelldarktable: Boat - OpenCLviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYvkresample: 2x - Singlevkresample: 2x - Doublelulesh-cl: vkpeak: fp32-scalarparboil: OpenCL BFSrealsr-ncnn: 4x - Nodarktable: Masskrug - OpenCLparboil: OpenCL TPACFclpeak: Integer 24-bit Computeclpeak: Integer Computeclpeak: Single-Precision Computeclpeak: Global Memory Bandwidthdarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLwaifu2x-ncnn: 2x - 3 - Noclpeak: Kernel Latencyluxmark: Hybrid GPU - HotelIntel UHD 630292.053160315702.062.40134.7432554267952980176.08410.61895.6310.3025.1043.6044.4918.6921.00105.2018.974.2714.937.838.698.757.6022.171255.431222.461129.8838.4558.0516805574113609361691230254981404317.517.392.612181.0615.4691680557802168055767322.94020.420.217.618.418.51313.817.411.313.6200.047200.4421122.2085264.831.48967715.3948.7246.5367754237.144448.003562.99325.660.8130.7572.0666.11OpenBenchmarking.org

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesIntel UHD 63060120180240300SE +/- 7.16, N = 9292.05

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1Intel UHD 63030060090012001500SE +/- 4.98, N = 316031. (CXX) g++ options: -O3

LuxMark

OpenCL Device: CPU+GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelIntel UHD 63030060090012001500SE +/- 356.90, N = 121570

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferIntel UHD 6300.46350.9271.39051.8542.3175SE +/- 0.19, N = 152.061. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferIntel UHD 6300.541.081.622.162.7SE +/- 0.35, N = 152.401. (CXX) g++ options: -O3

Parboil

Test: OpenCL LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMIntel UHD 630306090120150SE +/- 6.60, N = 12134.741. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

LuxMark

OpenCL Device: CPU+GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneIntel UHD 6309001800270036004500SE +/- 187.03, N = 124267

VKMark

Resolution: 1920 x 1080 - Present Mode: Mailbox

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 1920 x 1080 - Present Mode: MailboxIntel UHD 6302004006008001000SE +/- 48.30, N = 129521. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

VKMark

Resolution: 1920 x 1080 - Present Mode: Immediate

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 1920 x 1080 - Present Mode: ImmediateIntel UHD 6302004006008001000SE +/- 45.00, N = 129801. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteIntel UHD 6304080120160200SE +/- 19.11, N = 9176.081. (CXX) g++ options: -O2 -lOpenCL

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: FastestDetIntel UHD 6303691215SE +/- 0.15, N = 310.61MIN: 9.85 / MAX: 14.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vision_transformerIntel UHD 6302004006008001000SE +/- 12.79, N = 3895.63MIN: 844.64 / MAX: 1281.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: regnety_400mIntel UHD 6303691215SE +/- 0.03, N = 310.30MIN: 9.86 / MAX: 13.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: squeezenet_ssdIntel UHD 630612182430SE +/- 0.08, N = 325.10MIN: 24.7 / MAX: 28.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: yolov4-tinyIntel UHD 6301020304050SE +/- 0.15, N = 343.60MIN: 42.31 / MAX: 50.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet50Intel UHD 6301020304050SE +/- 0.06, N = 344.49MIN: 43.77 / MAX: 53.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: alexnetIntel UHD 630510152025SE +/- 0.03, N = 318.69MIN: 18.14 / MAX: 19.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet18Intel UHD 630510152025SE +/- 0.04, N = 321.00MIN: 20.5 / MAX: 22.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vgg16Intel UHD 63020406080100SE +/- 0.16, N = 3105.20MIN: 103.46 / MAX: 118.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: googlenetIntel UHD 630510152025SE +/- 0.01, N = 318.97MIN: 18.75 / MAX: 19.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: blazefaceIntel UHD 6300.96081.92162.88243.84324.804SE +/- 0.22, N = 34.27MIN: 2.67 / MAX: 8.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: efficientnet-b0Intel UHD 63048121620SE +/- 0.03, N = 314.93MIN: 14.41 / MAX: 16.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mnasnetIntel UHD 630246810SE +/- 0.02, N = 37.83MIN: 7.25 / MAX: 111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: shufflenet-v2Intel UHD 630246810SE +/- 0.13, N = 28.69MIN: 7.85 / MAX: 11.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Intel UHD 630246810SE +/- 0.02, N = 38.75MIN: 8.04 / MAX: 9.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Intel UHD 630246810SE +/- 0.07, N = 37.60MIN: 7.18 / MAX: 8.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mobilenetIntel UHD 630510152025SE +/- 0.08, N = 322.17MIN: 21.66 / MAX: 24.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Classroom - Compute: OpenCLIntel UHD 630300600900120015001255.43

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Fishy Cat - Compute: OpenCLIntel UHD 630300600900120015001222.46

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Pabellon Barcelona - Compute: OpenCLIntel UHD 63020040060080010001129.88

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyIntel UHD 630918273645SE +/- 14.50, N = 1538.41. (CC) gcc options: -O2 -flto -lOpenCL

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: BMW27 - Compute: OpenCLIntel UHD 630120240360480600558.05

SmallPT GPU

OpenCL Device: GPU - Scene: Complex

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: ComplexIntel UHD 630400M800M1200M1600M2000MSE +/- 102.77, N = 316805574111. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

VKMark

Resolution: 800 x 600 - Present Mode: Mailbox

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 800 x 600 - Present Mode: MailboxIntel UHD 6308001600240032004000SE +/- 0.58, N = 336091. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

VKMark

Resolution: 800 x 600 - Present Mode: Immediate

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2022-05-16Resolution: 800 x 600 - Present Mode: ImmediateIntel UHD 6308001600240032004000SE +/- 2.08, N = 336161. (CXX) g++ options: -pthread -ldl -std=c++14 -O0 -MD -MQ -MF

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelIntel UHD 6302004006008001000SE +/- 11.46, N = 3912

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneIntel UHD 6306001200180024003000SE +/- 18.67, N = 33025

LuxMark

OpenCL Device: CPU+GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRIntel UHD 63011002200330044005500SE +/- 34.90, N = 34981

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRIntel UHD 6309001800270036004500SE +/- 0.58, N = 34043

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadIntel UHD 63048121620SE +/- 0.00, N = 317.51. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteIntel UHD 63048121620SE +/- 0.00, N = 317.31. (CC) gcc options: -O2 -flto -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterIntel UHD 63020406080100SE +/- 0.12, N = 392.611. (CXX) g++ options: -O2 -lOpenCL

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeIntel UHD 6304080120160200SE +/- 5.83, N = 15181.061. (CXX) g++ options: -O3

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesIntel UHD 63048121620SE +/- 1.62, N = 1515.47

SmallPT GPU

OpenCL Device: GPU - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Intel UHD 630400M800M1200M1600M2000MSE +/- 24.25, N = 316805578021. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellIntel UHD 630400M800M1200M1600M2000MSE +/- 22.52, N = 316805576731. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Boat - Acceleration: OpenCLIntel UHD 630510152025SE +/- 0.00, N = 322.94

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTIntel UHD 63051015202520.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNIntel UHD 630510152025SE +/- 0.00, N = 320.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TIntel UHD 63048121620SE +/- 0.03, N = 317.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NIntel UHD 63051015202518.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTIntel UHD 630510152025SE +/- 0.00, N = 318.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYIntel UHD 6303691215SE +/- 0.00, N = 3131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYIntel UHD 63048121620SE +/- 0.03, N = 313.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTIntel UHD 63048121620SE +/- 0.00, N = 317.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYIntel UHD 6303691215SE +/- 0.00, N = 311.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYIntel UHD 6303691215SE +/- 0.00, N = 313.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleIntel UHD 6304080120160200SE +/- 0.01, N = 3200.051. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleIntel UHD 6304080120160200SE +/- 0.01, N = 3200.441. (CXX) g++ options: -O3

Lulesh OpenCL

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06Intel UHD 6302004006008001000SE +/- 1.39, N = 31122.211. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarIntel UHD 63060120180240300SE +/- 0.09, N = 3264.83

Parboil

Test: OpenCL BFS

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSIntel UHD 6300.33520.67041.00561.34081.676SE +/- 0.167959, N = 151.4896771. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoIntel UHD 63048121620SE +/- 0.11, N = 315.39

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Masskrug - Acceleration: OpenCLIntel UHD 630246810SE +/- 0.006, N = 38.724

Parboil

Test: OpenCL TPACF

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFIntel UHD 630246810SE +/- 0.037924, N = 36.5367751. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeIntel UHD 6309001800270036004500SE +/- 48.13, N = 154237.141. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeIntel UHD 63010002000300040005000SE +/- 83.96, N = 154448.001. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeIntel UHD 6308001600240032004000SE +/- 19.00, N = 33562.991. (CXX) g++ options: -O3

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthIntel UHD 63070140210280350SE +/- 0.01, N = 3325.661. (CXX) g++ options: -O3

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Server Room - Acceleration: OpenCLIntel UHD 6300.18290.36580.54870.73160.9145SE +/- 0.002, N = 30.813

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.1Test: Server Rack - Acceleration: OpenCLIntel UHD 6300.17030.34060.51090.68120.8515SE +/- 0.000, N = 30.757

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: NoIntel UHD 6300.46490.92981.39471.85962.3245SE +/- 0.009, N = 32.066

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyIntel UHD 630246810SE +/- 0.06, N = 76.111. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5