Test

2 x Intel Xeon E5-2682 v4 testing with a Supermicro SYS-7048GR-TR X10DRG-Q v1.10 (3.2 BIOS) and MSI NVIDIA GeForce RTX 3090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

test

Processor: 2 x Intel Xeon E5-2682 v4 @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro X10DRG-Q v1.10 (3.2 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 32GB, Disk: 1000GB Samsung SSD 870 + 4001GB Western Digital WD40EFPX-68C, Graphics: MSI NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC888-VD, Network: 2 x Intel I350

OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: NVM_CD_FLAGS=-q
Processor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0xb000040
Python Notes: Python 3.10.14
Security Notes: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Test

Processor: 2 x Intel Xeon E5-2682 v4 @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro SYS-7048GR-TR X10DRG-Q v1.10 (3.2 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 2 x 16GB DDR4-2400MT/s, Disk: 1000GB Samsung SSD 870 + 4001GB Western Digital WD40EFPX-68C, Graphics: MSI NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC888-VD, Network: 2 x Intel I350

OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.89, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: NVM_CD_FLAGS=-q
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0xb000040
Security Notes: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

MSI NVIDIA GeForce RTX 3090

OS: Ubuntu 22.04, Kernel: 5.15.0-102-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.4.89, Vulkan: 1.3.277, Compiler: GCC 11.4.0 + Clang 14.0.0-1ubuntu1.1 + LLVM 14.0.0 + CUDA 12.4, File-System: ext4, Screen Resolution: 1024x768

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: NVM_CD_FLAGS=-q
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0xb000040
Graphics Notes: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.59.00.8a
Python Notes: Python 3.10.12
Security Notes: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

Mode: NVIDIA CUDA GPU

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status.

Mode: NVIDIA RTX GPU

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status.

Blender

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/home/guyi/miniconda3/envs/Structimragh/lib/python3.10/collections/__init__.py)

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: caffe: line 3: ./tools/caffe: No such file or directory

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: Fatal error:

ViennaCL

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Test: Conjugate Gradient OpenCL

MSI NVIDIA GeForce RTX 3090: The test run did not produce a result. E: arrayfire: line 3: ./cg_opencl: No such file or directory

LuxCoreRender

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Backend: OpenCL

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: lczero: line 4: ./lc0: No such file or directory

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

MSI NVIDIA GeForce RTX 3090: The test quit with a non-zero exit status. E: redshift: line 3: /usr/redshift/bin/redshiftBenchmark: No such file or directory

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Target: OpenCL - Benchmark: Texture Read Bandwidth