RTX 4070 SUPER

sudo apt install vulkan-headers vulkan-tools libvulkan-dev

NVIDIA RTX 4070 SUPER

Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: ASUS NVIDIA GeForce RTX 4070 SUPER 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70

OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x11d
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.c1
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

intel-gpu

Processor: Intel Core i5-10300H @ 4.50GHz (4 Cores / 8 Threads), Motherboard: CML Stonic_CMS (V1.00 BIOS), Chipset: Intel Comet Lake PCH, Memory: 16GB, Disk: 1000GB CT1000P3SSD8 + 256GB Western Digital PC SN530 SDBPNPZ-256G-1014, Graphics: Intel UHD CML GT2 4GB (1350/6000MHz), Audio: Intel Comet Lake PCH cAVS, Network: Realtek Killer E2600 GbE + Intel Comet Lake PCH CNVi WiFi

OS: Ubuntu 24.04, Kernel: 6.8.0-49-generic (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server 1.20.13, Display Driver: NVIDIA 535.183.01, OpenGL: 4.6 Mesa 24.0.9-0ubuntu0.2, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0xfc - Thermald 2.5.6
Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.17.4c.00.1d
Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

nvidia-gpu

Processor: Intel Core i5-10300H @ 4.50GHz (4 Cores / 8 Threads), Motherboard: CML Stonic_CMS (V1.00 BIOS), Chipset: Intel Comet Lake PCH, Memory: 16GB, Disk: 1000GB CT1000P3SSD8 + 256GB Western Digital PC SN530 SDBPNPZ-256G-1014, Graphics: NVIDIA GeForce GTX 1650 Ti 4GB, Audio: Intel Comet Lake PCH cAVS, Network: Realtek Killer E2600 GbE + Intel Comet Lake PCH CNVi WiFi

OS: Ubuntu 24.04, Kernel: 6.8.0-49-generic (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server 1.20.13, Display Driver: NVIDIA 535.183.01, OpenGL: 4.6.0, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: __GLX_VENDOR_LIBRARY_NAME=nvidia
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0xfc - Thermald 2.5.6
Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.17.4c.00.1d
Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Intel ARC A770 8Gb

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: MSI MEG Z890 UNIFY-X (MS-7E20) v1.0 (1.A10 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6000MT/s Corsair CMH32GX5M2B6000Z30, Disk: 1024GB Wodposit NVMe SSD, Graphics: MSI Intel Arc A770 DG2 8GB, Audio: Intel DG2 Audio, Monitor: PiKVM V3, Network: Realtek Device 5000 + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.12.1-061201-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.3.1 kisak-mesa PPA, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x110 - Thermald 2.5.8
Python Notes: Python 3.12.7
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Intel ARC A750

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: MSI MEG Z890 UNIFY-X (MS-7E20) v1.0 (1.A10 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6000MT/s Corsair CMH32GX5M2B6000Z30, Disk: 1024GB Wodposit NVMe SSD, Graphics: Intel Arc A750 DG2 8GB, Audio: Intel DG2 Audio, Monitor: PiKVM V3, Network: Realtek Device 5000 + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.12.1-061201-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.3.1 kisak-mesa PPA, OpenCL: OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1920x1080

Intel ARC A580

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: MSI MEG Z890 UNIFY-X (MS-7E20) v1.0 (1.A10 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6000MT/s Corsair CMH32GX5M2B6000Z30, Disk: 1024GB Wodposit NVMe SSD, Graphics: Intel Arc A580 DG2 8GB, Audio: Intel DG2 Audio, Monitor: PiKVM V3, Network: Realtek Device 5000 + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.12.1-061201-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.3.1 kisak-mesa PPA, OpenCL: OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1280x720

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: GPU - Batch Size: 32 - Model: VGG-16

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Device: GPU - Batch Size: 64 - Model: VGG-16

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: UnboundLocalError: cannot access local variable 'decorators' where it is not associated with a value

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Intel ARC A580: The test quit with a non-zero exit status.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: GPU - Batch Size: 16 - Model: VGG-16

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Device: GPU - Batch Size: 64 - Model: ResNet-50

Intel ARC A580: The test quit with a non-zero exit status.

Device: GPU - Batch Size: 64 - Model: ResNet-50

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Intel ARC A580: The test quit with a non-zero exit status.

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

Unigine Heaven

SPECViewPerf 2020

TensorFlow

Device: GPU - Batch Size: 32 - Model: ResNet-50

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Intel ARC A580: The test quit with a non-zero exit status.

SPECViewPerf 2020

TensorFlow

Device: GPU - Batch Size: 32 - Model: ResNet-50

Intel ARC A580: The test quit with a non-zero exit status.

SPECViewPerf 2020

Unigine Valley

This test calculates the average frame-rate within the Valley demo for the Unigine engine, released in February 2013. This engine is extremely demanding on the system's graphics card. Unigine Valley relies upon an OpenGL 3 core profile context. Learn more via the OpenBenchmarking.org test page.

SPECViewPerf 2020

TensorFlow

Device: GPU - Batch Size: 16 - Model: ResNet-50

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

TensorFlow

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

Xonotic

This is a benchmark of Xonotic, which is a fork of the DarkPlaces-based Nexuiz game. Development began in March of 2010 on the Xonotic game for this open-source first person shooter title. Learn more via the OpenBenchmarking.org test page.

SPECViewPerf 2020

TensorFlow

Device: GPU - Batch Size: 64 - Model: GoogLeNet

Intel ARC A580: The test quit with a non-zero exit status.

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

vkpeak

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

TensorFlow

Device: GPU - Batch Size: 32 - Model: GoogLeNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Device: GPU - Batch Size: 64 - Model: GoogLeNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Intel ARC A580: The test quit with a non-zero exit status.

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

TensorFlow

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

TensorFlow

Xonotic

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: GPU - Batch Size: 16 - Model: GoogLeNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: GPU - Batch Size: 64 - Model: AlexNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Intel ARC A580: The test quit with a non-zero exit status.

VkFFT

TensorFlow

Device: GPU - Batch Size: 32 - Model: AlexNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

vkpeak

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

Xonotic

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

TensorFlow

VkFFT

Test: FFT + iFFT R2C / C2R

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

Blender

Xonotic

TensorFlow

Device: GPU - Batch Size: 1 - Model: VGG-16

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

OpenArena

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

Intel ARC A750: The test quit with a non-zero exit status.

Intel ARC A580: The test quit with a non-zero exit status.

TensorFlow

Device: GPU - Batch Size: 16 - Model: AlexNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

ParaView

VkFFT

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

TensorFlow

Blender

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

Intel ARC A770 8Gb: The test run did not produce a result.

Intel ARC A750: The test run did not produce a result.

Intel ARC A580: The test run did not produce a result.

TensorFlow

Device: GPU - Batch Size: 1 - Model: ResNet-50

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./vkfft: 3: ./Vulkan_FFT: not found

Intel ARC A750: The test quit with a non-zero exit status.

Intel ARC A580: The test quit with a non-zero exit status.

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

Operation: INT8 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: Memory Bandwidth Coalesced Read

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: INT16 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: Memory Bandwidth Coalesced Write

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: INT32 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: INT64 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

Operation: FP32 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

SHOC Scalable HeterOgeneous Computing

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

VkFFT

ViennaCL

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Upscale: 2x - Precision: Double

Intel ARC A770 8Gb: The test quit with a non-zero exit status.

Intel ARC A750: The test quit with a non-zero exit status.

Intel ARC A580: The test quit with a non-zero exit status.

VkFFT

RealSR-NCNN

Blender

ViennaCL

Blender

ViennaCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Benchmark: SHA1

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

Benchmark: 7-Zip

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenCL Test: Double-Precision Double

Intel ARC A750: The test run did not produce a result.

Intel ARC A580: The test run did not produce a result.

TensorFlow

VkResample

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Benchmark: SHA-512

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Benchmark: MD5

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

ParaView

TensorFlow

Device: GPU - Batch Size: 1 - Model: GoogLeNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Benchmark: Read

Intel ARC A770 8Gb: The test quit with a non-zero exit status.

Benchmark: Write

Intel ARC A770 8Gb: The test quit with a non-zero exit status.

SHOC Scalable HeterOgeneous Computing

TensorFlow

Device: GPU - Batch Size: 1 - Model: AlexNet

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

Operation: FP64 Compute

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: | Error: There are no OpenCL devices available. Make sure that the OpenCL 1.2 |

SHOC Scalable HeterOgeneous Computing

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Benchmark: Copy

Intel ARC A770 8Gb: The test quit with a non-zero exit status.

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

Intel ARC A750: The test quit with a non-zero exit status.

Intel ARC A580: The test quit with a non-zero exit status.

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Benchmark: TrueCrypt RIPEMD160 + XTS

Intel ARC A770 8Gb: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

Darktable

ParaView

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Darktable

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x7670bcda4450 google::LogMessageFatal::~LogMessageFatal()

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x7b5ea59be450 google::LogMessageFatal::~LogMessageFatal()

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x7d7151816450 google::LogMessageFatal::~LogMessageFatal()

Darktable

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x7dd7c6de3450 google::LogMessageFatal::~LogMessageFatal()

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x73552c3e3450 google::LogMessageFatal::~LogMessageFatal()

Darktable

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: @ 0x74746a490450 google::LogMessageFatal::~LogMessageFatal()

Darktable

SHOC Scalable HeterOgeneous Computing

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

NVIDIA RTX 4070 SUPER: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Intel ARC A770 8Gb: The test run did not produce a result.

Intel ARC A750: The test run did not produce a result.

Intel ARC A580: The test run did not produce a result.

SHOC Scalable HeterOgeneous Computing

Darktable

SHOC Scalable HeterOgeneous Computing

Darktable

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

Acceleration: GPU

Intel ARC A770 8Gb: The test run did not produce a result. E: Failed to load CUDA driver ("/usr/lib64/libcuda.so.1")

Intel ARC A750: The test run did not produce a result.

Intel ARC A580: The test run did not produce a result.

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

Intel ARC A750: The test quit with a non-zero exit status.

Intel ARC A580: The test quit with a non-zero exit status.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Target: Vulkan GPU

NVIDIA RTX 4070 SUPER: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ncnn: line 3: ./benchncnn: No such file or directory

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Test: Conjugate Gradient OpenCL

NVIDIA RTX 4070 SUPER: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: arrayfire: line 3: ./cg_opencl: No such file or directory

Intel ARC A750: The test run did not produce a result. E: ./arrayfire: 3: ./cg_opencl: not found

Intel ARC A580: The test run did not produce a result. E: ./arrayfire: 3: ./cg_opencl: not found

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

Codec: ETC1 - Quality: Highest

Intel ARC A750: The test quit with a non-zero exit status. E: ./betsy: 3: ./betsy: not found

Intel ARC A580: The test quit with a non-zero exit status. E: ./betsy: 3: ./betsy: not found

GLmark2

This is a test of GLmark2, a basic OpenGL and OpenGL ES 2.0 benchmark supporting various windowing/display back-ends. Learn more via the OpenBenchmarking.org test page.

Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

Intel ARC A750: The test quit with a non-zero exit status. E: ./glmark2: 2: ./bin/glmark2: not found

Intel ARC A580: The test quit with a non-zero exit status. E: ./glmark2: 2: ./bin/glmark2: not found

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

Intel ARC A750: The test quit with a non-zero exit status. E: ./betsy: 3: ./betsy: not found

Intel ARC A580: The test quit with a non-zero exit status. E: ./betsy: 3: ./betsy: not found

184 Results Shown

TensorFlow
IndigoBench
TensorFlow:
GPU - 32 - VGG-16:
images/sec
images/sec
GPU - 64 - VGG-16:
images/sec
GPU - 16 - VGG-16:
images/sec
NCNN:
Vulkan GPU - vgg16
Vulkan GPU - FastestDet
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
TensorFlow:
GPU - 16 - VGG-16
GPU - 64 - ResNet-50
GPU - 64 - ResNet-50
SPECViewPerf 2020
Unigine Heaven
SPECViewPerf 2020:
1920 x 1080 - SOLIDWORKS-07
1920 x 1080 - MAYA-06
TensorFlow
SPECViewPerf 2020
TensorFlow:
GPU - 32 - ResNet-50
GPU - 16 - ResNet-50
SPECViewPerf 2020
Unigine Valley
SPECViewPerf 2020
TensorFlow:
GPU - 16 - AlexNet
GPU - 16 - ResNet-50
vkpeak:
int32-scalar
int32-vec4
int16-vec4
int16-scalar
fp16-vec4
fp16-scalar
fp32-vec4
fp32-scalar
TensorFlow
VkFFT
Xonotic
SPECViewPerf 2020
TensorFlow:
GPU - 32 - GoogLeNet
GPU - 64 - GoogLeNet
LuxMark:
GPU - Microphone
GPU - Hotel
CPU+GPU - Microphone
GPU - Luxball HDR
CPU+GPU - Luxball HDR
CPU+GPU - Hotel
TensorFlow:
GPU - 32 - GoogLeNet
GPU - 64 - GoogLeNet
IndigoBench
TensorFlow
VkFFT
TensorFlow
Xonotic
SHOC Scalable HeterOgeneous Computing
TensorFlow:
GPU - 1 - VGG-16
GPU - 16 - GoogLeNet
VkFFT
IndigoBench:
CPU - Bedroom
CPU - Supercar
TensorFlow
VkFFT
TensorFlow
vkpeak:
fp16-vec4
fp16-scalar
fp32-vec4
fp32-scalar
VkFFT
RealSR-NCNN
Blender
VkFFT:
FFT + iFFT C2C 1D batched in single precision, no reshuffling:
Benchmark Score
Benchmark Score
Xonotic
VkFFT
TensorFlow
VkFFT
Blender
Xonotic
TensorFlow
OpenArena
OpenArena
VkFFT
TensorFlow
ParaView:
Many Spheres - 1920 x 1080:
MiPolys / Sec
Frames / Sec
VkFFT
ProjectPhysX OpenCL-Benchmark
TensorFlow
Blender
FAHBench
TensorFlow
VkFFT
ProjectPhysX OpenCL-Benchmark:
INT8 Compute
Memory Bandwidth Coalesced Read
INT16 Compute
Memory Bandwidth Coalesced Write
INT32 Compute
INT64 Compute
FP32 Compute
SHOC Scalable HeterOgeneous Computing
ViennaCL:
CPU BLAS - dGEMM-TT
CPU BLAS - dGEMM-TN
CPU BLAS - dGEMM-NT
CPU BLAS - dGEMM-NN
CPU BLAS - dGEMV-T
CPU BLAS - dGEMV-N
CPU BLAS - dDOT
CPU BLAS - dAXPY
CPU BLAS - dCOPY
CPU BLAS - sDOT
CPU BLAS - sAXPY
CPU BLAS - sCOPY
VkFFT
ViennaCL:
OpenCL BLAS - dGEMM-TT
OpenCL BLAS - dGEMM-TN
OpenCL BLAS - dGEMM-NT
OpenCL BLAS - dGEMM-NN
OpenCL BLAS - dGEMV-T
OpenCL BLAS - dGEMV-N
OpenCL BLAS - dDOT
OpenCL BLAS - dAXPY
OpenCL BLAS - dCOPY
VkResample
VkFFT
RealSR-NCNN
Blender
ViennaCL:
OpenCL BLAS - sCOPY
OpenCL BLAS - sAXPY
Blender
ViennaCL
clpeak:
Single-Precision Float
Integer Compute INT
Hashcat:
SHA1
7-Zip
clpeak
TensorFlow
VkResample
Hashcat
MandelGPU
Hashcat
Darktable
ParaView:
Wavelet Volume - 1920 x 1080:
MiVoxels / Sec
Frames / Sec
TensorFlow
cl-mem:
Read
Write
SHOC Scalable HeterOgeneous Computing
TensorFlow
ProjectPhysX OpenCL-Benchmark
SHOC Scalable HeterOgeneous Computing
Waifu2x-NCNN Vulkan
cl-mem
Hashcat
Darktable
ParaView:
Wavelet Contour - 1920 x 1080:
MiPolys / Sec
Frames / Sec
clpeak
Darktable:
Masskrug - OpenCL
Server Room - OpenCL
FinanceBench
Darktable:
Masskrug - CPU-only
Server Room - CPU-only
SHOC Scalable HeterOgeneous Computing:
OpenCL - Triad
OpenCL - MD5 Hash
OpenCL - Bus Speed Readback
Darktable
SHOC Scalable HeterOgeneous Computing:
OpenCL - Bus Speed Download
OpenCL - FFT SP
OpenCL - S3D
Darktable
NeatBench

NVIDIA RTX 4070 SUPER

Testing initiated at 25 January 2024 21:36 by user test.

intel-gpu

Testing initiated at 5 December 2024 08:22 by user gandharv.

nvidia-gpu

Processor: Intel Core i5-10300H @ 4.50GHz (4 Cores / 8 Threads), Motherboard: CML Stonic_CMS (V1.00 BIOS), Chipset: Intel Comet Lake PCH, Memory: 16GB, Disk: 1000GB CT1000P3SSD8 + 256GB Western Digital PC SN530 SDBPNPZ-256G-1014, Graphics: NVIDIA GeForce GTX 1650 Ti 4GB, Audio: Intel Comet Lake PCH cAVS, Network: Realtek Killer E2600 GbE + Intel Comet Lake PCH CNVi WiFi

OS: Ubuntu 24.04, Kernel: 6.8.0-49-generic (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server 1.20.13, Display Driver: NVIDIA 535.183.01, OpenGL: 4.6.0, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080

Testing initiated at 5 December 2024 08:38 by user gandharv.

Intel ARC A770 8Gb

Testing initiated at 7 December 2024 09:07 by user test.

Intel ARC A750

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: MSI MEG Z890 UNIFY-X (MS-7E20) v1.0 (1.A10 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6000MT/s Corsair CMH32GX5M2B6000Z30, Disk: 1024GB Wodposit NVMe SSD, Graphics: Intel Arc A750 DG2 8GB, Audio: Intel DG2 Audio, Monitor: PiKVM V3, Network: Realtek Device 5000 + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.12.1-061201-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.3.1 kisak-mesa PPA, OpenCL: OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1920x1080

Testing initiated at 7 December 2024 12:59 by user test.

Intel ARC A580

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: MSI MEG Z890 UNIFY-X (MS-7E20) v1.0 (1.A10 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6000MT/s Corsair CMH32GX5M2B6000Z30, Disk: 1024GB Wodposit NVMe SSD, Graphics: Intel Arc A580 DG2 8GB, Audio: Intel DG2 Audio, Monitor: PiKVM V3, Network: Realtek Device 5000 + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.12.1-061201-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.3.1 kisak-mesa PPA, OpenCL: OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1280x720

Testing initiated at 9 December 2024 22:24 by user test.

RTX 4070 SUPER

View

Statistics

Graph Settings

Additional Graphs

Multi-Way Comparison

Table

Run Management

NVIDIA RTX 4070 SUPER

intel-gpu

nvidia-gpu

Intel ARC A770 8Gb

Intel ARC A750

Intel ARC A580

TensorFlow

IndigoBench

TensorFlow

NCNN

TensorFlow

SPECViewPerf 2020

Unigine Heaven

SPECViewPerf 2020

TensorFlow

SPECViewPerf 2020

TensorFlow

SPECViewPerf 2020

Unigine Valley

SPECViewPerf 2020

TensorFlow

vkpeak

TensorFlow

VkFFT

Xonotic

SPECViewPerf 2020

TensorFlow

LuxMark

vkpeak

TensorFlow

IndigoBench

TensorFlow

VkFFT

TensorFlow

Xonotic

SHOC Scalable HeterOgeneous Computing

TensorFlow

VkFFT

IndigoBench

TensorFlow

VkFFT

TensorFlow

vkpeak

VkFFT

RealSR-NCNN

Blender

VkFFT

Xonotic

VkFFT

TensorFlow

VkFFT

Blender

Xonotic

TensorFlow

OpenArena

VkFFT

TensorFlow

ParaView

VkFFT

ProjectPhysX OpenCL-Benchmark

TensorFlow

Blender

FAHBench

TensorFlow

VkFFT

ProjectPhysX OpenCL-Benchmark

SHOC Scalable HeterOgeneous Computing

ViennaCL

VkFFT

ViennaCL

VkResample

VkFFT