vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark vkpeak.

Project Site

github.com

Source Repository

github.com

Test Created

24 April 2021

Last Updated

1 August 2023

Test Maintainer

Michael Larabel 

Test Type

Graphics

Average Install Time

3 Seconds

Average Run Time

11 Minutes, 9 Seconds

Test Dependencies

Vulkan

Accolades

20k+ Downloads

Supported Platforms

Supported Architectures

x86_64

Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page ViewsOpenBenchmarking.orgEventsvkpeak Popularity Statisticspts/vkpeak2021.042021.062021.082021.102021.122022.022022.042022.062022.082022.102022.122023.022023.042023.062023.082023.102023.122024.022024.042024.062024.082024.102K4K6K8K10K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 19 November 2024.

Revision History

pts/vkpeak-1.1.0   [View Source]   Tue, 01 Aug 2023 11:42:09 GMT
Update against latest upstream.

pts/vkpeak-1.0.2   [View Source]   Sat, 01 May 2021 06:41:01 GMT
Update per https://github.com/phoronix-test-suite/test-profiles/pull/194

pts/vkpeak-1.0.1   [View Source]   Sat, 24 Apr 2021 11:55:54 GMT
Update hashes, add macOS support while at it.

pts/vkpeak-1.0.0   [View Source]   Sat, 24 Apr 2021 11:34:45 GMT
Initial commit of vkpeak test profile, per https://github.com/phoronix-test-suite/test-profiles/issues/192

Suites Using This Test

NVIDIA GPU Compute

Vulkan Compute


Performance Metrics

Analyze Test Configuration:

vkpeak 20230730

fp32-scalar

OpenBenchmarking.org metrics for this test profile configuration based on 317 public results since 1 August 2023 with the latest data as of 19 September 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
GFLOPS (Average)
100th
10
48481 +/- 324
98th
11
45183 +/- 1104
94th
3
25628 +/- 14
93rd
3
25498 +/- 168
89th
6
21600 +/- 603
87th
10
20706 +/- 388
80th
8
16637 +/- 41
Mid-Tier
75th
< 13990
72nd
7
12987 +/- 193
69th
5
12259 +/- 60
66th
7
11521 +/- 4
61st
3
8261 +/- 106
60th
3
7853 +/- 15
Median
50th
4281
48th
3
3287 +/- 19
44th
14
3149 +/- 456
44th
4
2991 +/- 209
36th
3
1639 +/- 1
34th
3
1313 +/- 16
31st
6
1080 +/- 159
Low-Tier
25th
< 567
8th
4
192
4th
5
114 +/- 1
OpenBenchmarking.orgDistribution Of Public Results - fp32-scalar317 Results Range From 20 To 48941 GFLOPS209991978295739364915589468737852883198101078911768127471372614705156841666317642186211960020579215582253723516244952547426453274322841129390303693134832327333063428535264362433722238201391804015941138421174309644075450544603347012479914897020406080100

Based on OpenBenchmarking.org data, the selected test / test configuration (vkpeak 20230730 - fp32-scalar) has an average run-time of 10 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete Benchmarkfp32-scalarRun-Time510152025Min: 3 / Avg: 10.01 / Max: 20

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between Runsfp32-scalarDeviation246810Min: 0 / Avg: 0.2 / Max: 3

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE2 (SSE2)
Used by default on supported hardware.
 
MOVDQA MOVDQU MOVAPD SUBSD COMISD MULSD ADDSD DIVSD MAXSD CVTSI2SD PUNPCKLQDQ PSRLDQ MOVD CVTTSD2SI PSHUFD SQRTSD PADDQ PSUBQ SHUFPD CVTSD2SS ANDPD UCOMISD CVTSS2SD MOVUPD MOVHPD MULPD UNPCKHPD CMPNLESD ANDNPD ORPD XORPD PUNPCKHQDQ MINSD SUBPD
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VINSERTF128 VEXTRACTF128 VPERM2F128
Advanced Vector Extensions 512 (AVX512)
Used by default on supported hardware.
 
(ZMM REGISTER USE)
Last automated analysis: 18 September 2023

This test profile binary relies on the shared libraries libvulkan.so.1, libpthread.so.0, libm.so.6, libc.so.6.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
arm64
Apple M1

Recent Test Results

OpenBenchmarking.org Results Compare

9 Systems - 10 Benchmark Results

Intel Core i5-4300M - Dell 0VWNW8 - Intel Xeon E3-1200 v3

cachyos rolling - 6.7.6-1-cachyos-rt-bore-lto - KDE Plasma 5.27.10

8 Systems - 10 Benchmark Results

Intel Core i5-4300M - Dell 0VWNW8 - Intel Xeon E3-1200 v3

cachyos rolling - 6.7.6-1-cachyos-rt-bore-lto - KDE Plasma 5.27.10

1 System - 4 Benchmark Results

AMD Ryzen 9 7945HX - Alienware 0DWD2H - AMD Device 14d8

cachyos rolling - 6.10.10-2-cachyos-lto - GNOME Shell 46.5

7 Systems - 10 Benchmark Results

Intel Core i5-4300M - Dell 0VWNW8 - Intel Xeon E3-1200 v3

cachyos rolling - 6.7.6-1-cachyos-rt-bore-lto - KDE Plasma 5.27.10

6 Systems - 10 Benchmark Results

AMD E2-3800 APU - TOSHIBA Portable PC - AMD 16h Root Complex

cachyos rolling - 6.7.1-1-cachyos - KDE Plasma 5.27.10

1 System - 116 Benchmark Results

AMD Ryzen 7 3700X 8-Core - ASUS TUF GAMING X570-PLUS - AMD Starship

RockyLinux 9.4 - 6.1.102-1.el9.elrepo.x86_64 - KDE Plasma 5.27.11

1 System - 10 Benchmark Results

AMD Ryzen 7 7840U - Framework Laptop 13 - AMD Device 14e8

Ubuntu 24.04 - 6.8.0-35-generic - KDE Plasma 5.27.11

2 Systems - 20 Benchmark Results

AMD PRO A6-8570E R5 6 COMPUTE CORES 2C+4G - LENOVO 30FD - 1 x 4096 MB 2400MHz Samsung M471A5244CB0-CRC

Microsoft Windows 10 Pro Build 19045 - 10.0.19045.4291 - 27.20.1034.6

Find More Test Results