cl-mem

A basic OpenCL memory benchmark.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark cl-mem.

Project Site

github.com

Test Created

13 January 2017

Last Updated

27 January 2017

Test Maintainer

Michael Larabel 

Test Type

Graphics

Average Install Time

1 Second

Average Run Time

2 Minutes, 56 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenCL

Accolades

70k+ Downloads

Supported Platforms


Public Result UploadsReported Installs*Test Completions*OpenBenchmarking.orgEventscl-mem Popularity Statisticspts/cl-mem2017.012017.032017.052017.072017.092017.112018.012018.032018.052018.072018.092018.112019.012019.032019.052019.072019.092019.112020.012020.032020.052020.072020.092020.112021.012021.036001200180024003000
* Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data current as of Fri, 05 Mar 2021 22:39:47 GMT.
Copy33.6%Read33.2%Write33.2%Benchmark Option PopularityOpenBenchmarking.org

Revision History

pts/cl-mem-1.0.1   [View Source]   Fri, 27 Jan 2017 10:11:26 GMT
Fix re-installation process by ensuring dirs removed

pts/cl-mem-1.0.0   [View Source]   Fri, 13 Jan 2017 16:57:13 GMT
Basic OpenCL memory benchmark addition.

Suites Using This Test

OpenCL

NVIDIA GPU Compute


Performance Metrics

Analyze Test Configuration:

cl-mem 2017-01-13

Benchmark: Copy

OpenBenchmarking.org metrics for this test profile configuration based on 2,213 public results since 13 January 2017 with the latest data as of 17 February 2021.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Matching Public Results
GB/s (Average)
100th
14
348 +/- 13
94th
69
313
Mid-Tier
75th
< 282
62nd
5
214 +/- 10
Median
50th
190
47th
3
180 +/- 7
47th
4
177 +/- 5
42nd
14
144 +/- 1
27th
4
115 +/- 11
Low-Tier
25th
< 111
19th
4
79 +/- 2
OpenBenchmarking.orgDistribution Of Public Results - Benchmark: Copy2179 Results Range From 1 To 15183449 GB/s1303670607339911008121467715183461822015212568424293532733022303669133403603644029394769842513674555036485870551623745466043576971260733816377050668071969843887288057759172678953958199064850273388064029110071941374097174091002107810324747106284161093208511235754115394231184309212146761124504301275409913057768133614371366510613968775142724441457611314879782151834515001000150020002500

Based on OpenBenchmarking.org data, the selected test / test configuration (cl-mem 2017-01-13 - Benchmark: Copy) has an average run-time of 2 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBenchmark: CopyRun-Time3691215Min: 1 / Avg: 1.09 / Max: 7

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.1%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBenchmark: CopyDeviation3691215Min: 0 / Avg: 0.11 / Max: 9

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
CVTSI2SD MOVAPD DIVSD ADDSD
The test / benchmark does honor compiler flag changes.
Last automated analysis: 30 January 2021

This test profile binary relies on the shared libraries libOpenCL.so.1, libc.so.6, libdl.so.2.

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 120 Benchmark Results

2 x AMD EPYC 7V12 64-Core - Microsoft Virtual Machine - 434GB

Ubuntu 20.04 - 5.4.0-1039-azure - GNOME Shell 3.36.4

1 System - 74 Benchmark Results

2 x AMD EPYC 7V12 64-Core - Microsoft Virtual Machine - 434GB

Ubuntu 20.04 - 5.4.0-1039-azure - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-64 - 236GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-64 - 236GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 119 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-32 - 118GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-8 - 30GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-16 - 60GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-4 - 16GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-4 - 16GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 124 Benchmark Results

Intel Xeon E5-2690 v4 - Microsoft Virtual Machine v7.0 - Intel 440BX

Ubuntu 20.04 - 5.4.0-1039-azure - X Server 1.20.9

1 System - 118 Benchmark Results

Intel Xeon - Google Compute Engine n1-standard-8 - 30GB

Ubuntu 20.04 - 5.4.0-1036-gcp - GNOME Shell 3.36.4

1 System - 120 Benchmark Results

2 x Intel Xeon Platinum 8168 - Microsoft Virtual Machine - 662GB

Ubuntu 20.04 - 5.4.0-1039-azure - GNOME Shell 3.36.4

Most Popular Test Results

OpenBenchmarking.org Results Compare

1 System - 1004 Benchmark Results

SiFive RISC-V - FriendlyElec NanoPC-T4 - Rockchip RK3399

Ubuntu 18.04 - 4.4.138 - LXDE 0.9.3

17 Systems - 50 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.04 - 4.20.0-042000-generic - GNOME Shell 3.28.3

15 Systems - 47 Benchmark Results

Intel Core i7-7740K - ASUS PRIME X299-A - Intel Device 591f

Ubuntu 16.04 - 4.9.0-kfd-compute-rocm-rel-1.6-77 - Unity 7.4.0

15 Systems - 128 Benchmark Results

Intel Core i9-9900K - ASUS PRIME Z390-A - Intel Cannon Lake PCH Shared SRAM

Ubuntu 18.10 - 4.20.3-042003-generic - GNOME Shell 3.30.1

13 Systems - 14 Benchmark Results

Intel Core i7-7700K - MSI Z270-A PRO - Intel Device 591f + Z270

Ubuntu 17.04 - 4.8.0-040800-generic - modesetting 1.19.3

17 Systems - 48 Benchmark Results

Intel Core i7-7740K - ASUS PRIME X299-A - Intel Device 591f

Ubuntu 16.04 - 4.13.0-999-generic - Unity 7.4.0

3 Systems - 75 Benchmark Results

Intel Core i9-10980XE - ASRock X299 Steel Legend - Intel Sky Lake-E DMI3 Registers

Ubuntu 20.04 - 5.4.0-48-generic - GNOME Shell 3.36.3

16 Systems - 50 Benchmark Results

Intel Core i7-7740K - ASUS PRIME X299-A - Intel Device 591f

Ubuntu 16.04 - 4.9.0-kfd-compute-rocm-rel-1.6-77 - Unity 7.4.0

Find More Test Results