RTX 4070 SUPER
sudo apt install vulkan-headers vulkan-tools libvulkan-dev
HTML result view exported from: https://openbenchmarking.org/result/2412102-NE-INTELGPU716&grs&sro.
NCNN
Target: Vulkan GPU - Model: googlenet
NCNN
Target: Vulkan GPU - Model: mobilenet
NCNN
Target: Vulkan GPU - Model: vision_transformer
NCNN
Target: Vulkan GPU - Model: resnet18
NCNN
Target: Vulkan GPU - Model: regnety_400m
ProjectPhysX OpenCL-Benchmark
Operation: INT32 Compute
Hashcat
Benchmark: SHA1
Hashcat
Benchmark: 7-Zip
ViennaCL
Test: OpenCL BLAS - sAXPY
clpeak
OpenCL Test: Integer Compute INT
ProjectPhysX OpenCL-Benchmark
Operation: FP32 Compute
Hashcat
Benchmark: SHA-512
clpeak
OpenCL Test: Single-Precision Float
ViennaCL
Test: OpenCL BLAS - sCOPY
cl-mem
Benchmark: Read
Unigine Valley
Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL
VkFFT
Test: FFT + iFFT C2C Bluestein in single precision
Hashcat
Benchmark: TrueCrypt RIPEMD160 + XTS
ViennaCL
Test: OpenCL BLAS - sDOT
IndigoBench
Acceleration: OpenCL GPU - Scene: Supercar
NCNN
Target: Vulkan GPU - Model: vgg16
Hashcat
Benchmark: MD5
ProjectPhysX OpenCL-Benchmark
Operation: Memory Bandwidth Coalesced Read
IndigoBench
Acceleration: OpenCL GPU - Scene: Bedroom
RealSR-NCNN
Scale: 4x - TAA: Yes
ViennaCL
Test: CPU BLAS - sDOT
NCNN
Target: Vulkan GPU - Model: resnet50
NCNN
Target: Vulkan GPU - Model: squeezenet_ssd
OpenArena
Resolution: 1920 x 1080 - Total Frame Time
Waifu2x-NCNN Vulkan
Scale: 2x - Denoise: 3 - TAA: Yes
VkFFT
Test: FFT + iFFT R2C / C2R
TensorFlow
Device: GPU - Batch Size: 64 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 32 - Model: GoogLeNet
VkFFT
Test: FFT + iFFT C2C multidimensional in single precision
ProjectPhysX OpenCL-Benchmark
Operation: INT8 Compute
TensorFlow
Device: GPU - Batch Size: 16 - Model: VGG-16
TensorFlow
Device: GPU - Batch Size: 32 - Model: VGG-16
ViennaCL
Test: CPU BLAS - sCOPY
TensorFlow
Device: GPU - Batch Size: 16 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 64 - Model: ResNet-50
ProjectPhysX OpenCL-Benchmark
Operation: INT16 Compute
TensorFlow
Device: GPU - Batch Size: 32 - Model: ResNet-50
NCNN
Target: Vulkan GPU - Model: alexnet
cl-mem
Benchmark: Write
TensorFlow
Device: GPU - Batch Size: 16 - Model: ResNet-50
vkpeak
int32-vec4
vkpeak
int32-scalar
vkpeak
fp16-scalar
vkpeak
int16-scalar
vkpeak
fp16-vec4
vkpeak
fp32-vec4
vkpeak
int16-vec4
vkpeak
fp32-scalar
TensorFlow
Device: GPU - Batch Size: 64 - Model: AlexNet
NCNN
Target: Vulkan GPU - Model: yolov4-tiny
TensorFlow
Device: GPU - Batch Size: 32 - Model: AlexNet
ViennaCL
Test: CPU BLAS - sAXPY
cl-mem
Benchmark: Copy
ViennaCL
Test: CPU BLAS - dDOT
TensorFlow
Device: GPU - Batch Size: 16 - Model: AlexNet
TensorFlow
Device: GPU - Batch Size: 1 - Model: VGG-16
VkFFT
Test: FFT + iFFT C2C 1D batched in single precision
ViennaCL
Test: CPU BLAS - dCOPY
TensorFlow
Device: GPU - Batch Size: 1 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 1 - Model: ResNet-50
ViennaCL
Test: CPU BLAS - dGEMM-TN
LuxMark
OpenCL Device: GPU - Scene: Microphone
LuxMark
OpenCL Device: CPU+GPU - Scene: Hotel
LuxMark
OpenCL Device: GPU - Scene: Hotel
LuxMark
OpenCL Device: CPU+GPU - Scene: Microphone
LuxMark
OpenCL Device: CPU+GPU - Scene: Luxball HDR
LuxMark
OpenCL Device: GPU - Scene: Luxball HDR
VkFFT
Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling
vkpeak
fp32-scalar
vkpeak
fp16-vec4
vkpeak
fp32-vec4
vkpeak
fp16-scalar
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: MD5 Hash
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Texture Read Bandwidth
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: GEMM SGEMM_N
ProjectPhysX OpenCL-Benchmark
Operation: FP16 Compute
ProjectPhysX OpenCL-Benchmark
Operation: Memory Bandwidth Coalesced Write
TensorFlow
Device: GPU - Batch Size: 1 - Model: AlexNet
Xonotic
Resolution: 1920 x 1080 - Effects Quality: Low
ViennaCL
Test: CPU BLAS - dGEMM-NN
clpeak
OpenCL Test: Global Memory Bandwidth
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: ENERGY-03
VkResample
Upscale: 2x - Precision: Single
OpenArena
Resolution: 1920 x 1080
ParaView
Test: Many Spheres - Resolution: 1920 x 1080
ViennaCL
Test: CPU BLAS - dAXPY
Xonotic
Resolution: 1920 x 1080 - Effects Quality: High
ParaView
Test: Wavelet Contour - Resolution: 1920 x 1080
Xonotic
Resolution: 1920 x 1080 - Effects Quality: Ultra
Unigine Heaven
Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL
ViennaCL
Test: CPU BLAS - dGEMM-TT
ViennaCL
Test: CPU BLAS - dGEMM-NT
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Reduction
NCNN
Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3
VkFFT
Test: FFT + iFFT C2C Bluestein in single precision
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: MEDICAL-O3
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: CATIA-06
Xonotic
Resolution: 1920 x 1080 - Effects Quality: Ultimate
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: MAYA-06
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: S3D
ViennaCL
Test: CPU BLAS - dGEMV-T
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: SOLIDWORKS-07
VkFFT
Test: FFT + iFFT C2C multidimensional in single precision
VkFFT
Test: FFT + iFFT R2C / C2R
IndigoBench
Acceleration: CPU - Scene: Supercar
Darktable
Test: Server Rack - Acceleration: OpenCL
TensorFlow
Device: GPU - Batch Size: 64 - Model: AlexNet
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: SNX-04
Darktable
Test: Server Room - Acceleration: OpenCL
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Triad
SPECViewPerf 2020
Resolution: 1920 x 1080 - Viewset: CREO-03
TensorFlow
Device: GPU - Batch Size: 1 - Model: VGG-16
ViennaCL
Test: CPU BLAS - dGEMV-N
IndigoBench
Acceleration: CPU - Scene: Bedroom
TensorFlow
Device: GPU - Batch Size: 1 - Model: AlexNet
TensorFlow
Device: GPU - Batch Size: 16 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 32 - Model: GoogLeNet
Darktable
Test: Masskrug - Acceleration: OpenCL
Darktable
Test: Server Room - Acceleration: CPU-only
Darktable
Test: Server Rack - Acceleration: CPU-only
Darktable
Test: Boat - Acceleration: OpenCL
Darktable
Test: Boat - Acceleration: CPU-only
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: FFT SP
TensorFlow
Device: GPU - Batch Size: 1 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 32 - Model: VGG-16
TensorFlow
Device: GPU - Batch Size: 1 - Model: ResNet-50
VkFFT
Test: FFT + iFFT C2C 1D batched in single precision
TensorFlow
Device: GPU - Batch Size: 16 - Model: AlexNet
TensorFlow
Device: GPU - Batch Size: 32 - Model: AlexNet
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Bus Speed Readback
VkFFT
Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling
Darktable
Test: Masskrug - Acceleration: CPU-only
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Bus Speed Download
TensorFlow
Device: GPU - Batch Size: 64 - Model: ResNet-50
TensorFlow
Device: GPU - Batch Size: 64 - Model: GoogLeNet
TensorFlow
Device: GPU - Batch Size: 32 - Model: ResNet-50
TensorFlow
Device: GPU - Batch Size: 16 - Model: ResNet-50
TensorFlow
Device: GPU - Batch Size: 64 - Model: VGG-16
TensorFlow
Device: GPU - Batch Size: 16 - Model: VGG-16
TensorFlow
Device: GPU - Batch Size: 64 - Model: VGG-16
NeatBench
Acceleration: GPU
Blender
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
Blender
Blend File: Barbershop - Compute: NVIDIA OptiX
Blender
Blend File: Fishy Cat - Compute: NVIDIA OptiX
Blender
Blend File: Classroom - Compute: NVIDIA OptiX
Blender
Blend File: BMW27 - Compute: NVIDIA OptiX
ViennaCL
Test: OpenCL BLAS - dGEMM-TT
ViennaCL
Test: OpenCL BLAS - dGEMM-TN
ViennaCL
Test: OpenCL BLAS - dGEMM-NT
ViennaCL
Test: OpenCL BLAS - dGEMM-NN
ViennaCL
Test: OpenCL BLAS - dGEMV-T
ViennaCL
Test: OpenCL BLAS - dGEMV-N
ViennaCL
Test: OpenCL BLAS - dDOT
ViennaCL
Test: OpenCL BLAS - dAXPY
ViennaCL
Test: OpenCL BLAS - dCOPY
clpeak
OpenCL Test: Double-Precision Double
FAHBench
VkResample
Upscale: 2x - Precision: Double
VkFFT
Test: FFT + iFFT C2C Bluestein benchmark in double precision
VkFFT
Test: FFT + iFFT C2C 1D batched in double precision
ProjectPhysX OpenCL-Benchmark
Operation: FP64 Compute
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Max SP Flops
ParaView
Test: Wavelet Contour - Resolution: 1920 x 1080
ParaView
Test: Wavelet Volume - Resolution: 1920 x 1080
ParaView
Test: Wavelet Volume - Resolution: 1920 x 1080
ParaView
Test: Many Spheres - Resolution: 1920 x 1080
VkFFT
Test: FFT + iFFT C2C 1D batched in half precision
NCNN
Target: Vulkan GPU - Model: FastestDet
NCNN
Target: Vulkan GPU - Model: blazeface
NCNN
Target: Vulkan GPU - Model: efficientnet-b0
NCNN
Target: Vulkan GPU - Model: mnasnet
NCNN
Target: Vulkan GPU - Model: shufflenet-v2
NCNN
Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3
NCNN
Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2
MandelGPU
OpenCL Device: GPU
FinanceBench
Benchmark: Black-Scholes OpenCL
VkFFT
Test: FFT + iFFT C2C 1D batched in half precision
RealSR-NCNN
Scale: 4x - TAA: No
ProjectPhysX OpenCL-Benchmark
Operation: INT64 Compute
Phoronix Test Suite v10.8.5