clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark clpeak.

Project Site

github.com

Test Created

16 January 2019

Last Updated

9 February 2019

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

6 Seconds

Average Run Time

3 Minutes, 37 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenCL + CMake

Accolades

20k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventsclpeak Popularity Statisticspts/clpeak2019.012019.022019.032019.042019.052019.062019.072019.082019.092019.102019.112019.122020.012020.022020.032020.042020.052020.062020.072020.082020.092020.102020.112020.122021.012021.022021.032021.0430060090012001500
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Sat, 10 Apr 2021 06:20:45 GMT.
Transfer Bandwidth enqueueWriteBuffer11.2%Integer Compute INT21.6%Single-Precision Float23.1%Transfer Bandwidth enqueueReadBuffer10.6%Double-Precision Double21.3%Kernel Latency12.2%OpenCL Test Option PopularityOpenBenchmarking.org

Revision History

pts/clpeak-1.0.1   [View Source]   Sat, 09 Feb 2019 10:28:08 GMT
Always select first platform, per https://github.com/phoronix-test-suite/test-profiles/issues/56

pts/clpeak-1.0.0   [View Source]   Wed, 16 Jan 2019 05:52:26 GMT
Commit of a pts/clpeak test profile that builds clpeak itself rather than system/clpeak relying upon system supplied binary.

Suites Using This Test

OpenCL

NVIDIA GPU Compute


Performance Metrics

Analyze Test Configuration:

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.org metrics for this test profile configuration based on 657 public results since 16 January 2019 with the latest data as of 9 April 2021.

Additional benchmark metrics will come after OpenBenchmarking.org has collected a sufficient data-set.

OpenBenchmarking.orgDistribution Of Public Results - OpenCL Test: Single-Precision Float657 Results Range From 3 To 35205 GFLOPS37081413211828233528423349385643634870537758846391689873105781128311988126931339814103148081551316218169231762818333190381974320448211532185822563232682397324678253832608826793274982820328908296133031831023317283243333138338433454835253306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (clpeak - OpenCL Test: Single-Precision Float) has an average run-time of 2 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkOpenCL Test: Single-Precision FloatRun-Time246810Min: 1 / Avg: 1 / Max: 2

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsOpenCL Test: Single-Precision FloatDeviation3691215Min: 0 / Avg: 1.98 / Max: 8

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
MOVDQU PUNPCKLQDQ MOVDQA CVTDQ2PS CVTDQ2PD PSHUFD CVTSI2SD CVTSS2SD MOVD
Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VINSERTF128 VEXTRACTF128 VPERM2F128
Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

 
VEXTRACTI128 VPERMQ VINSERTI128 VPBROADCASTD VPBROADCASTQ
The test / benchmark does honor compiler flag changes.
Last automated analysis: 30 January 2021

This test profile binary relies on the shared libraries libOpenCL.so.1, libm.so.6, libc.so.6, libdl.so.2.

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 7 Benchmark Results

ARMv8 Cortex-A57 - NVIDIA Jetson Nano Developer Kit - 4096MB

Ubuntu 18.04 - 4.9.201-tegra - Unity 7.5.0

1 System - 7 Benchmark Results

Rockchip ARMv8 Cortex-A72 - FriendlyElec NanoPi M4 - 2048MB

Ubuntu 20.04 - 4.19.111 - X Server 1.20.8

3 Systems - 99 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.8.0-48-generic - GNOME Shell 3.36.7

1 System - 1 Benchmark Result

Intel Xeon E5-2686 v4 - Xen HVM domU - Intel 440FX 82441FX PMC

Ubuntu 20.04 - 5.4.0-1041-aws - GNOME Shell 3.36.7

3 Systems - 100 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.8.0-48-generic - GNOME Shell 3.36.7

1 System - 1 Benchmark Result

Intel Xeon E5-2686 v4 - Xen HVM domU - Intel 440FX 82441FX PMC

Ubuntu 20.04 - 5.4.0-1041-aws - X Server 1.20.9

1 System - 1 Benchmark Result

Intel Xeon E5-2686 v4 - Xen HVM domU - Intel 440FX 82441FX PMC

Ubuntu 20.04 - 5.4.0-1041-aws - GNOME Shell 3.36.7

1 System - 100 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.8.0-48-generic - GNOME Shell 3.36.7

4 Systems - 138 Benchmark Results

Intel Core i9-11900K - ASUS ROG MAXIMUS XIII HERO - Intel Tiger Lake-H

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

1 System - 20 Benchmark Results

Intel Core i7-5775C - ASUS Z97I-PLUS - Intel Broadwell-U DMI

Ubuntu 20.04 - 5.4.0-67-generic - OpenCL 2.0 beignet 1.3

1 System - 34 Benchmark Results

2 x Intel Xeon E5-2640 v4 - Supermicro X10DRG-O+-CPU v1.00 - Intel Xeon E7 v4

Ubuntu 18.04 - 5.4.0-67-generic - X Server

3 Systems - 33 Benchmark Results

Intel Core i7-4700MQ - HP 1909 - Intel Xeon E3-1200 v3

Ubuntu 20.04 - 5.6.0-1042-oem - GNOME Shell 3.36.4

Most Popular Test Results

OpenBenchmarking.org Results Compare

15 Systems - 128 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 4.20.3-042003-generic - GNOME Shell 3.30.1

14 Systems - 33 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 5.0.0-999-generic - GNOME Shell 3.30.1

2 Systems - 29 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

Ubuntu 18.04 - 4.18.0-18-generic - GNOME Shell 3.28.3

3 Systems - 75 Benchmark Results

Intel Core i9-10980XE - ASRock X299 Steel Legend - Intel Sky Lake-E DMI3 Registers

Ubuntu 20.04 - 5.4.0-48-generic - GNOME Shell 3.36.3

13 Systems - 23 Benchmark Results

AMD Ryzen 9 3950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.4.0-42-generic - GNOME Shell 3.36.4

1 System - 132 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship

Ubuntu 20.04 - 5.4.0-48-generic - GNOME Shell 3.36.4

14 Systems - 124 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 5.0.0-050000rc4-generic - GNOME Shell 3.30.1

8 Systems - 478 Benchmark Results

Intel Core i9-7980XE - ASRock X299 Taichi XE - Intel Sky Lake-E DMI3 Registers

Gentoo - 5.0.17-gentoo - X Server 1.20.5

Find More Test Results