gooxi-ai

KVM testing on Ubuntu 24.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2408042-NE-GOOXIAI6105
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 3 Tests
C++ Boost Tests 2 Tests
CPU Massive 4 Tests
Creator Workloads 3 Tests
HPC - High Performance Computing 8 Tests
Machine Learning 5 Tests
Multi-Core 6 Tests
NVIDIA GPU Compute 20 Tests
OpenCL 6 Tests
OpenMPI Tests 2 Tests
Python Tests 2 Tests
Renderers 2 Tests
Scientific Computing 2 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Nvidia compute
August 03
 
Nvidia Compute
August 03
  10 Hours, 43 Minutes
Invert Hiding All Results Option
  5 Hours, 22 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


gooxi-aiOpenBenchmarking.orgPhoronix Test SuiteIntel Xeon Gold 6226R (32 Cores)QEMU Standard PC (i440FX + PIIX 1996) (rel-1.16.2-0-gea1b7a073390-prebuilt.qemu.org BIOS)Intel 440FX 82441FX PMC16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 16 GB + 14240 MB RAM1100GB QEMU HDDNVIDIA RTX A4000 16GBNVIDIA GA104 HD AudioQEMU MonitorRed Hat Virtio deviceUbuntu 24.046.8.0-39-generic (x86_64)X ServerNVIDIAGCC 13.2.0 + CUDA 12.4ext41280x800KVMProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionSystem LayerGooxi-ai BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - CPU Microcode: 0x5000021- BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.57.00.08- Python 3.12.3- gather_data_sampling: Unknown: Dependent on hypervisor status + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + reg_file_data_sampling: Not affected + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: SW loop KVM: SW loop + srbds: Not affected + tsx_async_abort: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown

gooxi-aincnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetluxcorerender: DLSC - GPUluxcorerender: LuxCore Benchmark - GPUfahbench: viennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYblender: Junkshop - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXhashcat: MD5hashcat: SHA1luxcorerender: Orange Juice - GPUblender: Barbershop - NVIDIA OptiXluxcorerender: Danish Mood - GPUblender: BMW27 - NVIDIA OptiXhashcat: TrueCrypt RIPEMD160 + XTShashcat: 7-Ziphashcat: SHA-512blender: Classroom - NVIDIA OptiXclpeak: Double-Precision Doubleblender: Fishy Cat - NVIDIA OptiXluxcorerender: Rainbow Colors and Prism - GPUclpeak: Global Memory Bandwidthclpeak: Integer Compute INTclpeak: Single-Precision Floatmixbench: NVIDIA CUDA - Integermixbench: NVIDIA CUDA - Half Precisionmixbench: NVIDIA CUDA - Double Precisionmixbench: NVIDIA CUDA - Single Precisionmixbench: OpenCL - IntegerNvidia computeNvidia Compute8.6775.1822.2218.6136.4623.4524.688.3912.5650.7820.073.2010.047.167.937.068.4323.4565.4533.02234.623173.373.271.573.391.077.778.078.851.4158194120.422.5318.221764739500005853410625056.9750.3840.417.64403530049603331531120000013.41361.5311.21135.27377.039572.6218537.609803.6721382.24300.5720359.29OpenBenchmarking.org

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

Acceleration: GPU

Nvidia Compute: The test run did not produce a result.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetNvidia Compute246810SE +/- 0.35, N = 128.67MIN: 6.81 / MAX: 11.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerNvidia Compute20406080100SE +/- 0.66, N = 1275.18MIN: 70.08 / MAX: 106.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mNvidia Compute510152025SE +/- 0.17, N = 1222.22MIN: 20.56 / MAX: 24.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdNvidia Compute510152025SE +/- 0.33, N = 1218.61MIN: 16.16 / MAX: 21.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyNvidia Compute816243240SE +/- 0.45, N = 1236.46MIN: 31.52 / MAX: 88.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3Nvidia Compute612182430SE +/- 0.37, N = 1223.45MIN: 20.89 / MAX: 60.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50Nvidia Compute612182430SE +/- 0.37, N = 1224.68MIN: 21.62 / MAX: 32.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetNvidia Compute246810SE +/- 0.08, N = 128.39MIN: 7.4 / MAX: 9.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18Nvidia Compute3691215SE +/- 0.22, N = 1212.56MIN: 10.26 / MAX: 13.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16Nvidia Compute1122334455SE +/- 0.41, N = 1250.78MIN: 45.84 / MAX: 60.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetNvidia Compute510152025SE +/- 0.36, N = 1220.07MIN: 16.97 / MAX: 23.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceNvidia Compute0.721.442.162.883.6SE +/- 0.08, N = 123.20MIN: 2.6 / MAX: 3.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0Nvidia Compute3691215SE +/- 0.14, N = 1210.04MIN: 9.14 / MAX: 11.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetNvidia Compute246810SE +/- 0.12, N = 127.16MIN: 6.32 / MAX: 8.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2Nvidia Compute246810SE +/- 0.06, N = 127.93MIN: 7.3 / MAX: 12.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Nvidia Compute246810SE +/- 0.08, N = 127.06MIN: 6.45 / MAX: 7.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Nvidia Compute246810SE +/- 0.21, N = 128.43MIN: 7.52 / MAX: 11.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetNvidia Compute612182430SE +/- 0.37, N = 1223.45MIN: 20.89 / MAX: 60.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPUNvidia Compute1530456075SE +/- 5.95, N = 1265.45MAX: 75.04

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPUNvidia Compute816243240SE +/- 3.00, N = 1233.02MAX: 50.55

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Nvidia Compute50100150200250SE +/- 0.37, N = 3234.62

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTNvidia Compute1632486480SE +/- 0.28, N = 1573.31. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNNvidia Compute1632486480SE +/- 0.41, N = 1573.21. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTNvidia Compute1632486480SE +/- 0.28, N = 1571.51. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNNvidia Compute1632486480SE +/- 0.27, N = 1573.31. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TNvidia Compute20406080100SE +/- 0.90, N = 1591.01. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NNvidia Compute20406080100SE +/- 0.38, N = 1577.71. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTNvidia Compute20406080100SE +/- 0.39, N = 1578.01. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYNvidia Compute20406080100SE +/- 0.16, N = 1578.81. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYNvidia Compute1224364860SE +/- 0.26, N = 1551.41. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTNvidia Compute306090120150SE +/- 2.13, N = 151581. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYNvidia Compute4080120160200SE +/- 2.25, N = 151941. (CXX) g++ options: -fopenmp -O3 -rdynamic

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYNvidia Compute306090120150SE +/- 3.29, N = 15120.41. (CXX) g++ options: -fopenmp -O3 -rdynamic

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA OptiXNvidia Compute510152025SE +/- 0.16, N = 1322.53

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

Nvidia Compute: The test quit with a non-zero exit status. E: Fatal error:

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXNvidia Compute48121620SE +/- 0.19, N = 1518.22

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5Nvidia Compute40000M80000M120000M160000M200000MSE +/- 36846448323.63, N = 16176473950000

Benchmark: MD5

Nvidia compute: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1Nvidia Compute13000M26000M39000M52000M65000MSE +/- 12213728795.72, N = 1658534106250

Benchmark: SHA1

Nvidia compute: The test quit with a non-zero exit status. E: ./hashcat: 3: ./hashcat.bin: not found

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPUNvidia Compute1326395265SE +/- 0.55, N = 356.97MIN: 46.42 / MAX: 72.03

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA OptiXNvidia Compute1122334455SE +/- 0.24, N = 350.38

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPUNvidia Compute918273645SE +/- 0.25, N = 340.41MIN: 19.22 / MAX: 52.88

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA OptiXNvidia Compute246810SE +/- 0.13, N = 157.64

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSNvidia Compute900K1800K2700K3600K4500KSE +/- 6018.58, N = 34035300

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipNvidia Compute1.1M2.2M3.3M4.4M5.5MSE +/- 8457.41, N = 34960333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512Nvidia Compute3000M6000M9000M12000M15000MSE +/- 25008065.37, N = 315311200000

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA OptiXNvidia Compute3691215SE +/- 0.01, N = 313.41

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision DoubleNvidia Compute80160240320400SE +/- 0.33, N = 3361.531. (CXX) g++ options: -O3

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA OptiXNvidia Compute3691215SE +/- 0.07, N = 311.21

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPUNvidia Compute306090120150SE +/- 0.66, N = 3135.27MIN: 119.63 / MAX: 156.49

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthNvidia Compute80160240320400SE +/- 0.00, N = 3377.031. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Compute INTNvidia Compute2K4K6K8K10KSE +/- 84.11, N = 39572.621. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision FloatNvidia Compute4K8K12K16K20KSE +/- 15.25, N = 318537.601. (CXX) g++ options: -O3

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerNvidia Compute2K4K6K8K10KSE +/- 10.08, N = 39803.671. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionNvidia Compute5K10K15K20K25KSE +/- 16.35, N = 321382.241. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionNvidia Compute70140210280350SE +/- 0.01, N = 3300.571. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionNvidia Compute4K8K12K16K20KSE +/- 8.22, N = 320359.291. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./plaidml: line 24: /.local/bin/plaidbench: No such file or directory

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./plaidml: line 24: /.local/bin/plaidbench: No such file or directory

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./plaidml: line 24: /.local/bin/plaidbench: No such file or directory

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./plaidml: line 24: /.local/bin/plaidbench: No such file or directory

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./plaidml: line 24: /.local/bin/plaidbench: No such file or directory

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Benchmark: Black-Scholes OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./financebench: line 4: ./blackScholesAnalyticEngine.exe: No such file or directory

ArrayFire

Test: Conjugate Gradient OpenCL

Nvidia Compute: The test run did not produce a result. E: ./arrayfire: 7: ./cg_opencl: not found

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU

Nvidia Compute: The test quit with a non-zero exit status. E: ./mandelGPU: error while loading shared libraries: libglut.so.3: cannot open shared object file: No such file or directory

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

Nvidia Compute: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL BLAS

Nvidia Compute: The test quit with a non-zero exit status. E: ./viennacl: 2: ./ViennaCL-1.7.1/build/examples/benchmarks/dense_blas-bench-opencl: not found

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL Particle Filter

Nvidia Compute: The test quit with a non-zero exit status. E: ./rodinia: 49: ./OCL_particlefilter_double: not found

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Backend: OpenCL

Nvidia Compute: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Nvidia Compute: The test quit with a non-zero exit status. E: ./redshift: 3: /usr/redshift/bin/redshiftBenchmark: not found

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Benchmark: Write

Nvidia Compute: The test quit with a non-zero exit status. E: ./cl-mem: 3: ./cl-mem: not found

Benchmark: Read

Nvidia Compute: The test quit with a non-zero exit status. E: ./cl-mem: 3: ./cl-mem: not found

Benchmark: Copy

Nvidia Compute: The test quit with a non-zero exit status. E: ./cl-mem: 3: ./cl-mem: not found

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Target: OpenCL - Benchmark: Texture Read Bandwidth

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: Bus Speed Readback

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: Bus Speed Download

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: Max SP Flops

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: GEMM SGEMM_N

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: Reduction

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: MD5 Hash

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: FFT SP

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: Triad

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Target: OpenCL - Benchmark: S3D

Nvidia Compute: The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Backend: OpenCL - Benchmark: Single Precision

Nvidia Compute: The test quit with a non-zero exit status. E: ./mixbench: 3: ./mixbench-ocl-ro: not found

Backend: OpenCL - Benchmark: Double Precision

Nvidia Compute: The test quit with a non-zero exit status. E: ./mixbench: 3: ./mixbench-ocl-ro: not found

Backend: OpenCL - Benchmark: Integer

Nvidia Compute: The test quit with a non-zero exit status. E: ./mixbench: 3: ./mixbench-ocl-ro: not found