garuda-unit-test

AMD Ryzen 5 5600X 6-Core testing with a Gigabyte X570 I AORUS PRO WIFI (F36d BIOS) and ASUS NVIDIA GeForce RTX 3070 8GB on Garuda Soaring via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2301034-NE-GARUDAUNI78
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
CantRemember
January 03
  4 Hours, 22 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):


garuda-unit-testOpenBenchmarking.orgPhoronix Test Suite 10.8.4AMD Ryzen 5 5600X 6-Core @ 3.70GHz (6 Cores / 12 Threads)Gigabyte X570 I AORUS PRO WIFI (F36d BIOS)AMD Starship/Matisse16GB1000GB CT1000P1SSD8 + 2000GB CT2000P3SSD8ASUS NVIDIA GeForce RTX 3070 8GBNVIDIA GA104 HD AudioOdyssey G52AIntel I211 + Intel Wi-Fi 6 AX200Garuda Soaring6.1.1-zen1-1-zen (x86_64)KDE Plasma 5.26.4X Server 1.21.1.6NVIDIA 525.60.114.6.01.3.224GCC 12.2.0btrfs5120x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGaruda-unit-test BenchmarksSystem Logs- Transparent Huge Pages: always- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.04.3a.40.20- GPU Compute Cores: 5888- Python 3.10.9- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

garuda-unit-testshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Srodinia: OpenCL Myocyterodinia: OpenCL Leukocyterodinia: OpenCL Particle Filterviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dCOPYdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLsmallpt-gpu: GPU - 5120 x 1440 - Causticsmallpt-gpu: GPU - 5120 x 1440 - Cornellsmallpt-gpu: GPU - 5120 x 1440 - Caustic3luxmark: GPU - Hotelluxmark: CPU+GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferlulesh-cl: CantRemember217.97924.64561117.1725.2819327.3803589.4020986.726.306327.10392061.08259.5344.0321.122464439450628.6093.2826.13824.137.137.720.530.929.842.640.126.925.627.825.52843423113773851683163273283213213601.7043.3380.1410.7801672726555167272669016727268251104111142351835040435587510475.2510411.5120546.88358.24390.5814.1324.455940.0720OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DCantRemember50100150200250SE +/- 0.19, N = 3217.981. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadCantRemember612182430SE +/- 0.00, N = 324.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPCantRemember2004006008001000SE +/- 7.28, N = 31117.171. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashCantRemember612182430SE +/- 0.04, N = 325.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionCantRemember70140210280350SE +/- 0.35, N = 3327.381. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NCantRemember8001600240032004000SE +/- 22.79, N = 33589.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsCantRemember4K8K12K16K20KSE +/- 245.20, N = 320986.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadCantRemember612182430SE +/- 0.01, N = 326.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackCantRemember612182430SE +/- 0.01, N = 327.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthCantRemember400800120016002000SE +/- 3.84, N = 32061.081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyCantRemember60120180240300SE +/- 0.27, N = 3259.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadCantRemember70140210280350SE +/- 0.20, N = 3344.01. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteCantRemember70140210280350SE +/- 0.07, N = 3321.11. (CC) gcc options: -O2 -flto -lOpenCL

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP32CantRemember5001000150020002500SE +/- 1.53, N = 32246

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP16CCantRemember10002000300040005000SE +/- 8.84, N = 34439

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 1.4Test: FP32-FP16SCantRemember10002000300040005000SE +/- 1.53, N = 34506

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL BFS

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL LBM

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL Histo

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL TPACF

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Test: OpenCL MRI Gridding

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: SyntaxError: Missing parentheses in call to 'print'. Did you mean print(...)?

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL LavaMD

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: :129:14: error: variables in the local address space can only be declared in the outermost scope of a kernel function

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteCantRemember714212835SE +/- 0.38, N = 1528.611. (CXX) g++ options: -O2 -lOpenCL

Test: OpenCL Heartwall

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL LeukocyteCantRemember0.73851.4772.21552.9543.6925SE +/- 0.033, N = 63.2821. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterCantRemember246810SE +/- 0.065, N = 36.1381. (CXX) g++ options: -O2 -lOpenCL

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYCantRemember612182430SE +/- 0.79, N = 1524.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYCantRemember918273645SE +/- 0.58, N = 1537.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTCantRemember918273645SE +/- 1.56, N = 1537.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYCantRemember510152025SE +/- 0.25, N = 1520.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYCantRemember714212835SE +/- 0.27, N = 1530.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTCantRemember714212835SE +/- 0.50, N = 1429.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NCantRemember1020304050SE +/- 0.70, N = 1542.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TCantRemember918273645SE +/- 1.19, N = 1340.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNCantRemember612182430SE +/- 0.07, N = 1526.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTCantRemember612182430SE +/- 0.15, N = 1425.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNCantRemember714212835SE +/- 0.36, N = 1527.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTCantRemember612182430SE +/- 0.24, N = 1325.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYCantRemember60120180240300SE +/- 0.67, N = 32841. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYCantRemember70140210280350SE +/- 4.93, N = 33421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTCantRemember70140210280350SE +/- 6.56, N = 33111. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYCantRemember80160240320400SE +/- 3.84, N = 33771. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTCantRemember80160240320400SE +/- 2.03, N = 33851. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NCantRemember4080120160200SE +/- 4.26, N = 31681. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TCantRemember70140210280350SE +/- 3.38, N = 33161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNCantRemember70140210280350SE +/- 0.88, N = 33271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTCantRemember70140210280350SE +/- 1.45, N = 33281. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNCantRemember70140210280350SE +/- 9.21, N = 33211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTCantRemember70140210280350SE +/- 10.50, N = 23211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYCantRemember80160240320400SE +/- 4.50, N = 23601. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Boat - Acceleration: OpenCLCantRemember0.38340.76681.15021.53361.917SE +/- 0.014, N = 151.704

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Masskrug - Acceleration: OpenCLCantRemember0.75111.50222.25333.00443.7555SE +/- 0.025, N = 33.338

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Server Rack - Acceleration: OpenCLCantRemember0.03170.06340.09510.12680.1585SE +/- 0.001, N = 30.141

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.2.0Test: Server Room - Acceleration: OpenCLCantRemember0.17550.3510.52650.7020.8775SE +/- 0.003, N = 30.780

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: CausticCantRemember400M800M1200M1600M2000MSE +/- 25.12, N = 316727265551. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: CornellCantRemember400M800M1200M1600M2000MSE +/- 24.25, N = 316727266901. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 5120 x 1440 - Scene: Caustic3CantRemember400M800M1200M1600M2000MSE +/- 24.83, N = 316727268251. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Hotel

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelCantRemember2K4K6K8K10KSE +/- 61.92, N = 311041

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: HotelCantRemember2K4K6K8K10KSE +/- 102.63, N = 1211142

OpenCL Device: CPU - Scene: Microphone

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneCantRemember8K16K24K32K40KSE +/- 189.69, N = 335183

OpenCL Device: CPU - Scene: Luxball HDR

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRCantRemember11K22K33K44K55KSE +/- 164.85, N = 350404

OpenCL Device: Hybrid GPU - Scene: Hotel

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: MicrophoneCantRemember8K16K24K32K40KSE +/- 5.61, N = 335587

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRCantRemember11K22K33K44K55KSE +/- 47.99, N = 351047

OpenCL Device: Hybrid GPU - Scene: Microphone

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Luxball HDR

CantRemember: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

Test: LU Decomposition

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Compressed Sparse Row

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Cyclic Redundancy Check

CantRemember: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyCantRemember1.18132.36263.54394.72525.9065SE +/- 0.01, N = 35.251. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTCantRemember2K4K6K8K10KSE +/- 36.62, N = 310411.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatCantRemember4K8K12K16K20KSE +/- 53.88, N = 320546.881. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleCantRemember80160240320400SE +/- 0.60, N = 3358.241. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthCantRemember80160240320400SE +/- 0.02, N = 3390.581. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferCantRemember48121620SE +/- 0.05, N = 314.131. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferCantRemember612182430SE +/- 0.19, N = 324.451. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06CantRemember13002600390052006500SE +/- 6.98, N = 35940.071. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

64 Results Shown

SHOC Scalable HeterOgeneous Computing:
  OpenCL - S3D
  OpenCL - Triad
  OpenCL - FFT SP
  OpenCL - MD5 Hash
  OpenCL - Reduction
  OpenCL - GEMM SGEMM_N
  OpenCL - Max SP Flops
  OpenCL - Bus Speed Download
  OpenCL - Bus Speed Readback
  OpenCL - Texture Read Bandwidth
cl-mem:
  Copy
  Read
  Write
FluidX3D:
  FP32-FP32
  FP32-FP16C
  FP32-FP16S
Rodinia:
  OpenCL Myocyte
  OpenCL Leukocyte
  OpenCL Particle Filter
ViennaCL:
  CPU BLAS - sCOPY
  CPU BLAS - sAXPY
  CPU BLAS - sDOT
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-TT
  OpenCL BLAS - sCOPY
  OpenCL BLAS - sAXPY
  OpenCL BLAS - sDOT
  OpenCL BLAS - dAXPY
  OpenCL BLAS - dDOT
  OpenCL BLAS - dGEMV-N
  OpenCL BLAS - dGEMV-T
  OpenCL BLAS - dGEMM-NN
  OpenCL BLAS - dGEMM-NT
  OpenCL BLAS - dGEMM-TN
  OpenCL BLAS - dGEMM-TT
  OpenCL BLAS - dCOPY
Darktable:
  Boat - OpenCL
  Masskrug - OpenCL
  Server Rack - OpenCL
  Server Room - OpenCL
SmallPT GPU:
  GPU - 5120 x 1440 - Caustic
  GPU - 5120 x 1440 - Cornell
  GPU - 5120 x 1440 - Caustic3
LuxMark:
  GPU - Hotel
  CPU+GPU - Hotel
  GPU - Microphone
  GPU - Luxball HDR
  CPU+GPU - Microphone
  CPU+GPU - Luxball HDR
clpeak:
  Kernel Latency
  Integer Compute INT
  Single-Precision Float
  Double-Precision Double
  Global Memory Bandwidth
  Transfer Bandwidth enqueueReadBuffer
  Transfer Bandwidth enqueueWriteBuffer
Lulesh OpenCL