clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark clpeak.

Project Site

github.com

Test Created

16 January 2019

Last Updated

9 February 2019

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

6 Seconds

Average Run Time

17 Minutes, 7 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenCL + CMake

Accolades

20k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsclpeak Popularity Statisticspts/clpeak2019.012019.032019.052019.072019.092019.112020.012020.032020.052020.072020.092020.112021.012021.032021.052021.072021.092021.112022.012022.032022.052022.072022.092022.1111002200330044005500
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 26 November 2022.
Transfer Bandwidth enqueueWriteBuffer8.8%Integer Compute INT23.4%Single-Precision Float25.3%Transfer Bandwidth enqueueReadBuffer8.8%Double-Precision Double23.1%Kernel Latency10.6%OpenCL Test Option PopularityOpenBenchmarking.org

Revision History

pts/clpeak-1.0.1   [View Source]   Sat, 09 Feb 2019 10:28:08 GMT
Always select first platform, per https://github.com/phoronix-test-suite/test-profiles/issues/56

pts/clpeak-1.0.0   [View Source]   Wed, 16 Jan 2019 05:52:26 GMT
Commit of a pts/clpeak test profile that builds clpeak itself rather than system/clpeak relying upon system supplied binary.

Suites Using This Test

OpenCL

NVIDIA GPU Compute


Performance Metrics

Analyze Test Configuration:

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.org metrics for this test profile configuration based on 698 public results since 16 January 2019 with the latest data as of 30 October 2022.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
GFLOPS (Average)
75th
6
13513 +/- 229
72nd
3
12932 +/- 3
70th
3
12546 +/- 142
67th
3
11315 +/- 13
66th
18
10737 +/- 244
Median
50th
8219
40th
3
6191 +/- 299
37th
3
5897 +/- 41
Low-Tier
25th
< 4544
18th
9
3675 +/- 144
16th
3
3512 +/- 14
13th
3
1990 +/- 7
11th
5
1487 +/- 13
8th
5
1066 +/- 12
8th
8
1039 +/- 5
OpenBenchmarking.orgDistribution Of Public Results - OpenCL Test: Single-Precision Float698 Results Range From 2 To 81212 GFLOPS2162732524877650281279752113771300214627162521787719502211272275224377260022762729252308773250234127357523737739002406274225243877455024712748752503775200253627552525687758502601276175263377650026662768252698777150273127747527637778002796278125220406080100

Based on OpenBenchmarking.org data, the selected test / test configuration (clpeak - OpenCL Test: Single-Precision Float) has an average run-time of 2 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkOpenCL Test: Single-Precision FloatRun-Time246810Min: 1 / Avg: 1.01 / Max: 3

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1.9%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsOpenCL Test: Single-Precision FloatDeviation3691215Min: 0 / Avg: 1.87 / Max: 9

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
MOVDQU PUNPCKLQDQ MOVDQA CVTDQ2PS CVTDQ2PD PSHUFD CVTSI2SD CVTSS2SD MOVD
Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VINSERTF128 VEXTRACTF128 VPERM2F128
Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

 
VEXTRACTI128 VPERMQ VINSERTI128 VPBROADCASTD VPBROADCASTQ
Advanced Vector Extensions 512 (AVX512)
Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).
 
(ZMM REGISTER USE)
The test / benchmark does honor compiler flag changes.
Last automated analysis: 17 January 2022

This test profile binary relies on the shared libraries libOpenCL.so.1, libc.so.6, libm.so.6.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
aarch64
ARMv8 Cortex-A53 4-Core, ARMv8 Cortex-A57 4-Core, ARMv8 Cortex-A73 6-Core, Rockchip ARMv8 Cortex-A72 6-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 107 Benchmark Results

AMD Ryzen 7 5700X 8-Core - MSI PRO B550M-VC WIFI - AMD Starship

Ubuntu 22.04 - 5.15.0-52-generic - KDE Plasma 5.24.6

1 System - 107 Benchmark Results

AMD Ryzen 7 5700X 8-Core - MSI PRO B550M-VC WIFI - AMD Starship

Ubuntu 22.04 - 5.15.0-52-generic - KDE Plasma 5.24.6

1 System - 107 Benchmark Results

AMD Ryzen 7 5700X 8-Core - MSI PRO B550M-VC WIFI - AMD Starship

Ubuntu 22.04 - 5.15.0-52-generic - KDE Plasma 5.24.6

1 System - 42 Benchmark Results

AMD Ryzen 9 6900HS - ASUS GA402RJ v1.0 - AMD Device 14b5

openSUSE 20221027 - 6.0.3-1-default - KDE Plasma

1 System - 124 Benchmark Results

Intel Core i7-9700K - MSI MPG Z390 GAMING PRO CARBON AC - Intel Cannon Lake PCH

Ubuntu 20.04 - 5.4.0-131-generic - KDE Plasma 5.18.8

4 Systems - 177 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS TUF GAMING X670E-PLUS WIFI - AMD Device 14d8

Ubuntu 22.04 - 5.15.0-25-generic - GNOME Shell 42.4

4 Systems - 177 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 21.10 - 5.13.0-22-generic - GNOME Shell 40.5

4 Systems - 177 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 21.10 - 5.13.0-22-generic - GNOME Shell 40.5

1 System - 7 Benchmark Results

Unknown - D2000 Development Board - 8GB

Ubuntu 20.04 - 4.19.115-g349e93465 - GNOME Shell 3.36.1

1 System - 37 Benchmark Results

Intel Core i3-3217U - Intel D33217GKE - Intel 3rd Gen Core DRAM

EndeavourOS rolling - 5.19.4-lqx1-2-lqx - GNOME Shell 42.4

1 System - 21 Benchmark Results

Intel Core i3-3217U - Intel D33217GKE - Intel 3rd Gen Core DRAM

EndeavourOS rolling - 5.19.4-lqx1-2-lqx - GNOME Shell 42.4

1 System - 7 Benchmark Results

Intel Core 2 Duo E4600 - Dell 0RK936 - Intel 82G33

Arch Linux - 5.19.2-arch1-1 - 4.5 Mesa 22.1.7

8 Systems - 236 Benchmark Results

AMD Ryzen 7 5800X3D 8-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 22.04 - 5.18.0-051800-generic - GNOME Shell 42.2

7 Systems - 241 Benchmark Results

Intel Core i9-12900K - ASUS ROG STRIX Z690-E GAMING WIFI - Intel Alder Lake-S PCH

Ubuntu 22.04 - 5.18.0-051800-generic - GNOME Shell 42.2

6 Systems - 41 Benchmark Results

Intel Atom E3815 - Intel DE3815TYKH - Intel Atom Z36xxx

ManjaroLinux 21.3.6 - 5.19.0-xanmod1-1-x64v2 - KDE Plasma 5.24.6

Most Popular Test Results

OpenBenchmarking.org Results Compare

2 Systems - 29 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

Ubuntu 18.04 - 4.18.0-18-generic - GNOME Shell 3.28.3

14 Systems - 33 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 5.0.0-999-generic - GNOME Shell 3.30.1

15 Systems - 128 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 4.20.3-042003-generic - GNOME Shell 3.30.1

3 Systems - 253 Benchmark Results

Intel Core i9-10885H - HP 8736 - Intel Comet Lake PCH

Ubuntu 20.04 - 5.6.0-1034-oem - GNOME Shell 3.36.4

13 Systems - 23 Benchmark Results

AMD Ryzen 9 3950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.4.0-42-generic - GNOME Shell 3.36.4

4 Systems - 138 Benchmark Results

Intel Core i5-10600K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

3 Systems - 100 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.8.0-48-generic - GNOME Shell 3.36.7

3 Systems - 175 Benchmark Results

Intel Core i7-8565U - Dell 0KTW76 - Intel Cannon Point-LP

Ubuntu 20.10 - 5.9.1-050901-generic - GNOME Shell 3.38.1

8 Systems - 478 Benchmark Results

Intel Core i9-7980XE - ASRock X299 Taichi XE - Intel Sky Lake-E DMI3 Registers

Gentoo - 5.0.17-gentoo - X Server 1.20.5

2 Systems - 190 Benchmark Results

Intel Core i7-1065G7 - Dell 06CDVY - Intel Device 34ef

Ubuntu 20.10 - 5.9.1-050901-generic - GNOME Shell 3.38.1

Find More Test Results