garuda-unit-test

AMD Ryzen 5 5600X 6-Core testing with a Gigabyte X570 I AORUS PRO WIFI (F36d BIOS) and ASUS NVIDIA GeForce RTX 3070 8GB on Garuda Soaring via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2301034-NE-GARUDAUNI78
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
CantRemember
January 03 2023
  4 Hours, 22 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


garuda-unit-testOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 5 5600X 6-Core @ 3.70GHz (6 Cores / 12 Threads)Gigabyte X570 I AORUS PRO WIFI (F36d BIOS)AMD Starship/Matisse16GB1000GB CT1000P1SSD8 + 2000GB CT2000P3SSD8ASUS NVIDIA GeForce RTX 3070 8GBNVIDIA GA104 HD AudioOdyssey G52AIntel I211 + Intel Wi-Fi 6 AX200Garuda Soaring6.1.1-zen1-1-zen (x86_64)KDE Plasma 5.26.4X Server 1.21.1.6NVIDIA 525.60.114.6.01.3.224GCC 12.2.0btrfs5120x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGaruda-unit-test BenchmarksSystem Logs- Transparent Huge Pages: always- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.3a.40.20- GPU Compute Cores: 5888- Python 3.10.9- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

garuda-unit-testshoc: OpenCL - Max SP Flopsluxmark: CPU+GPU - Hotelrodinia: OpenCL Myocyteviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYluxmark: GPU - Microphoneluxmark: GPU - Hotelluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRluxmark: GPU - Luxball HDRfluidx3d: FP32-FP32smallpt-gpu: GPU - 5120 x 1440 - Causticsmallpt-gpu: GPU - 5120 x 1440 - Caustic3fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Ssmallpt-gpu: GPU - 5120 x 1440 - Cornellclpeak: Double-Precision Doubleviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYdarktable: Boat - OpenCLshoc: OpenCL - Texture Read Bandwidthclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferrodinia: OpenCL Leukocyterodinia: OpenCL Particle Filtercl-mem: Copycl-mem: Writecl-mem: Readdarktable: Masskrug - OpenCLlulesh-cl: shoc: OpenCL - S3Dshoc: OpenCL - GEMM SGEMM_Ndarktable: Server Room - OpenCLshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Triadshoc: OpenCL - FFT SPdarktable: Server Rack - OpenCLshoc: OpenCL - Reductionclpeak: Integer Compute INTclpeak: Kernel Latencyclpeak: Global Memory Bandwidthshoc: OpenCL - MD5 Hashshoc: OpenCL - Bus Speed Downloadclpeak: Single-Precision Floatopendwarfs: LU DecompositionCantRemember20986.71114228.60925.527.825.626.940.142.629.830.920.537.737.124.13518311041355875104750404224616727265551672726825443945061672726690358.243603213213283273161683853773113422841.7042061.0824.4514.133.2826.138259.5321.1344.03.3385940.0720217.9793589.400.78027.103924.64561117.170.141327.38010411.515.25390.5825.281926.306320546.88OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsCantRemember4K8K12K16K20KSE +/- 245.20, N = 320986.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelCantRemember2K4K6K8K10KSE +/- 102.63, N = 1211142

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteCantRemember714212835SE +/- 0.38, N = 1528.611. (CXX) g++ options: -O2 -lOpenCL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTCantRemember612182430SE +/- 0.24, N = 1325.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNCantRemember714212835SE +/- 0.36, N = 1527.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTCantRemember612182430SE +/- 0.15, N = 1425.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNCantRemember612182430SE +/- 0.07, N = 1526.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TCantRemember918273645SE +/- 1.19, N = 1340.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NCantRemember1020304050SE +/- 0.70, N = 1542.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTCantRemember714212835SE +/- 0.50, N = 1429.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYCantRemember714212835SE +/- 0.27, N = 1530.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYCantRemember510152025SE +/- 0.25, N = 1520.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTCantRemember918273645SE +/- 1.56, N = 1537.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYCantRemember918273645SE +/- 0.58, N = 1537.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYCantRemember612182430SE +/- 0.79, N = 1524.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneCantRemember8K16K24K32K40KSE +/- 189.69, N = 335183

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelCantRemember2K4K6K8K10KSE +/- 61.92, N = 311041

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneCantRemember8K16K24K32K40KSE +/- 5.61, N = 335587

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRCantRemember11K22K33K44K55KSE +/- 47.99, N = 351047

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRCantRemember11K22K33K44K55KSE +/- 164.85, N = 350404

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP32CantRemember5001000150020002500SE +/- 1.53, N = 32246

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: CausticCantRemember400M800M1200M1600M2000MSE +/- 25.12, N = 316727265551. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: Caustic3CantRemember400M800M1200M1600M2000MSE +/- 24.83, N = 316727268251. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP16CCantRemember10002000300040005000SE +/- 8.84, N = 34439

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP16SCantRemember10002000300040005000SE +/- 1.53, N = 34506

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: CornellCantRemember400M800M1200M1600M2000MSE +/- 24.25, N = 316727266901. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleCantRemember80160240320400SE +/- 0.60, N = 3358.241. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYCantRemember80160240320400SE +/- 4.50, N = 23601. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTCantRemember70140210280350SE +/- 10.50, N = 23211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNCantRemember70140210280350SE +/- 9.21, N = 33211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTCantRemember70140210280350SE +/- 1.45, N = 33281. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNCantRemember70140210280350SE +/- 0.88, N = 33271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TCantRemember70140210280350SE +/- 3.38, N = 33161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NCantRemember4080120160200SE +/- 4.26, N = 31681. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTCantRemember80160240320400SE +/- 2.03, N = 33851. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYCantRemember80160240320400SE +/- 3.84, N = 33771. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTCantRemember70140210280350SE +/- 6.56, N = 33111. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYCantRemember70140210280350SE +/- 4.93, N = 33421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYCantRemember60120180240300SE +/- 0.67, N = 32841. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Boat - Acceleration: OpenCLCantRemember0.38340.76681.15021.53361.917SE +/- 0.014, N = 151.704

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthCantRemember400800120016002000SE +/- 3.84, N = 32061.081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferCantRemember612182430SE +/- 0.19, N = 324.451. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferCantRemember48121620SE +/- 0.05, N = 314.131. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteCantRemember0.73851.4772.21552.9543.6925SE +/- 0.033, N = 63.2821. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterCantRemember246810SE +/- 0.065, N = 36.1381. (CXX) g++ options: -O2 -lOpenCL

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyCantRemember60120180240300SE +/- 0.27, N = 3259.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteCantRemember70140210280350SE +/- 0.07, N = 3321.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadCantRemember70140210280350SE +/- 0.20, N = 3344.01. (CC) gcc options: -O2 -flto -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Masskrug - Acceleration: OpenCLCantRemember0.75111.50222.25333.00443.7555SE +/- 0.025, N = 33.338

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06CantRemember13002600390052006500SE +/- 6.98, N = 35940.071. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DCantRemember50100150200250SE +/- 0.19, N = 3217.981. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NCantRemember8001600240032004000SE +/- 22.79, N = 33589.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Server Room - Acceleration: OpenCLCantRemember0.17550.3510.52650.7020.8775SE +/- 0.003, N = 30.780

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackCantRemember612182430SE +/- 0.01, N = 327.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadCantRemember612182430SE +/- 0.00, N = 324.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPCantRemember2004006008001000SE +/- 7.28, N = 31117.171. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Server Rack - Acceleration: OpenCLCantRemember0.03170.06340.09510.12680.1585SE +/- 0.001, N = 30.141

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionCantRemember70140210280350SE +/- 0.35, N = 3327.381. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTCantRemember2K4K6K8K10KSE +/- 36.62, N = 310411.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyCantRemember1.18132.36263.54394.72525.9065SE +/- 0.01, N = 35.251. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthCantRemember80160240320400SE +/- 0.02, N = 3390.581. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashCantRemember612182430SE +/- 0.04, N = 325.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Hotel

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadCantRemember612182430SE +/- 0.01, N = 326.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL Heartwall

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatCantRemember4K8K12K16K20KSE +/- 53.88, N = 320546.881. (CXX) g++ options: -O3 -rdynamic -lOpenCL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Microphone

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL LavaMD

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: :129:14: error: variables in the local address space can only be declared in the outermost scope of a kernel function

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Luxball HDR

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenCL Device: Hybrid GPU - Scene: Luxball HDR

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Microphone

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Hotel

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL MRI Gridding

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL TPACF

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL Histo

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL LBM

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL BFS

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

Test: Cyclic Redundancy Check

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

Test: Compressed Sparse Row

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: LU Decomposition

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

64 Results Shown

SHOC Scalable HeterOgeneous Computing
LuxMark
Rodinia
ViennaCL:
  CPU BLAS - dGEMM-TT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMV-N
  CPU BLAS - dDOT
  CPU BLAS - dAXPY
  CPU BLAS - dCOPY
  CPU BLAS - sDOT
  CPU BLAS - sAXPY
  CPU BLAS - sCOPY
LuxMark:
  GPU - Microphone
  GPU - Hotel
  CPU+GPU - Microphone
  CPU+GPU - Luxball HDR
  GPU - Luxball HDR
FluidX3D
SmallPT GPU:
  GPU - 5120 x 1440 - Caustic
  GPU - 5120 x 1440 - Caustic3
FluidX3D:
  FP32-FP16C
  FP32-FP16S
SmallPT GPU
clpeak
ViennaCL:
  OpenCL BLAS - dCOPY
  OpenCL BLAS - dGEMM-TT
  OpenCL BLAS - dGEMM-TN
  OpenCL BLAS - dGEMM-NT
  OpenCL BLAS - dGEMM-NN
  OpenCL BLAS - dGEMV-T
  OpenCL BLAS - dGEMV-N
  OpenCL BLAS - dDOT
  OpenCL BLAS - dAXPY
  OpenCL BLAS - sDOT
  OpenCL BLAS - sAXPY
  OpenCL BLAS - sCOPY
Darktable
SHOC Scalable HeterOgeneous Computing
clpeak:
  Transfer Bandwidth enqueueWriteBuffer
  Transfer Bandwidth enqueueReadBuffer
Rodinia:
  OpenCL Leukocyte
  OpenCL Particle Filter
cl-mem:
  Copy
  Write
  Read
Darktable
Lulesh OpenCL
SHOC Scalable HeterOgeneous Computing:
  OpenCL - S3D
  OpenCL - GEMM SGEMM_N
Darktable
SHOC Scalable HeterOgeneous Computing:
  OpenCL - Bus Speed Readback
  OpenCL - Triad
  OpenCL - FFT SP
Darktable
SHOC Scalable HeterOgeneous Computing
clpeak:
  Integer Compute INT
  Kernel Latency
  Global Memory Bandwidth
SHOC Scalable HeterOgeneous Computing:
  OpenCL - MD5 Hash
  OpenCL - Bus Speed Download
clpeak