AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark ai-benchmark.

Project Site

ai-benchmark.com

Test Created

8 July 2020

Last Updated

19 July 2022

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

1 Minute, 1 Second

Average Run Time

21 Minutes, 23 Seconds

Test Dependencies

Python

Accolades

50k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsAI Benchmark Alpha Popularity Statisticspts/ai-benchmark2020.072020.092020.112021.012021.032021.052021.072021.092021.112022.012022.032022.052022.072022.092022.112023.012023.032023.052023.072023.092023.112024.012024.032024.055K10K15K20K25K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 23 June 2024.

Revision History

pts/ai-benchmark-1.0.2   [View Source]   Tue, 19 Jul 2022 13:38:31 GMT
Update tensorflow pip dependency so it works on newer distributuions.

pts/ai-benchmark-1.0.1   [View Source]   Wed, 11 Nov 2020 18:43:35 GMT
Fix for macOS support.

pts/ai-benchmark-1.0.0   [View Source]   Wed, 08 Jul 2020 14:25:53 GMT
Initial commit of AI Benchmark Alpha.

Suites Using This Test

Machine Learning

HPC - High Performance Computing


Performance Metrics

Analyze Test Configuration:

AI Benchmark Alpha 0.1.2

Device Inference Score

OpenBenchmarking.org metrics for this test profile configuration based on 1,176 public results since 8 July 2020 with the latest data as of 21 June 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Score (Average)
97th
10
3209 +/- 213
95th
6
2434 +/- 39
93rd
3
2309 +/- 18
92nd
11
2274 +/- 139
91st
3
2221 +/- 145
90th
3
2139 +/- 247
90th
5
2138 +/- 260
83rd
8
2002 +/- 102
83rd
9
1989 +/- 102
82nd
6
1973 +/- 104
81st
8
1961 +/- 73
80th
3
1950 +/- 2
80th
25
1946 +/- 90
80th
10
1944 +/- 91
79th
10
1935 +/- 95
79th
14
1933 +/- 155
76th
16
1908 +/- 70
Mid-Tier
75th
< 1906
75th
7
1898 +/- 39
74th
8
1889 +/- 128
71st
31
1846 +/- 98
71st
9
1845 +/- 43
70th
10
1842 +/- 102
70th
4
1835 +/- 265
68th
3
1818 +/- 245
64th
12
1715 +/- 60
63rd
11
1667 +/- 121
62nd
8
1659 +/- 164
59th
3
1608 +/- 65
57th
10
1576 +/- 78
57th
9
1575 +/- 3
56th
19
1549 +/- 93
54th
16
1503 +/- 170
54th
3
1488 +/- 64
53rd
8
1466 +/- 60
Median
50th
1436
50th
10
1420 +/- 55
48th
5
1376 +/- 125
47th
14
1373 +/- 43
47th
5
1366 +/- 43
46th
11
1358 +/- 111
46th
13
1343 +/- 185
45th
6
1320 +/- 57
44th
19
1299 +/- 56
43rd
8
1295 +/- 7
42nd
6
1270 +/- 25
40th
12
1238 +/- 56
38th
15
1167 +/- 14
35th
10
1116 +/- 40
34th
6
1104 +/- 31
34th
6
1103 +/- 80
34th
13
1099 +/- 5
33rd
4
1094 +/- 85
33rd
4
1084 +/- 4
33rd
3
1081 +/- 10
31st
10
1064 +/- 46
31st
6
1046 +/- 26
29th
4
1022 +/- 2
28th
3
1008 +/- 32
28th
5
1001 +/- 59
27th
4
956 +/- 1
26th
7
951 +/- 11
Low-Tier
25th
< 942
25th
4
941 +/- 52
24th
3
889 +/- 4
24th
4
888 +/- 6
24th
3
878 +/- 14
24th
3
865 +/- 4
23rd
3
834 +/- 4
22nd
5
812 +/- 13
20th
10
760 +/- 21
19th
6
743 +/- 2
19th
8
740 +/- 11
18th
4
733 +/- 5
18th
3
730 +/- 42
18th
3
730 +/- 24
18th
5
729 +/- 5
17th
6
721 +/- 10
16th
3
702 +/- 2
14th
23
670 +/- 29
14th
3
668 +/- 4
14th
9
665 +/- 2
13th
6
662 +/- 2
12th
4
644 +/- 8
10th
4
609 +/- 43
10th
17
595 +/- 56
10th
3
581 +/- 3
9th
4
574 +/- 5
7th
4
510 +/- 2
5th
15
459 +/- 27
4th
4
406 +/- 10
3rd
5
251 +/- 19
OpenBenchmarking.orgDistribution Of Public Results - Device Inference Score1176 Results Range From 46 To 30395 Score466531260186724743081368842954902550961166723733079378544915197581036510972115791218612793134001400714614152211582816435170421764918256188631947020077206842129121898225052311223719243262493325540261472675427361279682857529182297893039680160240320400

Based on OpenBenchmarking.org data, the selected test / test configuration (AI Benchmark Alpha 0.1.2 - Device Inference Score) has an average run-time of 21 minutes. By default this test profile is set to run at least 1 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkDevice Inference ScoreRun-Time1020304050Min: 6 / Avg: 20.06 / Max: 48

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

AMDIntelOpenBenchmarking.orgRelative Core Scaling To BaseAI Benchmark Alpha CPU Core ScalingDevice Inference Score468101216243248641282.064.126.188.2410.3

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
arm64
Apple M1
ARMv8 64-bit
aarch64
ARMv8 9-Core, ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 80-Core, ARMv8 Neoverse-V1

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 3 Benchmark Results

AMD Ryzen 9 5950X 16-Core - ASRock APEXX A3_02 X570 Creator - AMD Starship

Debian 12 - 6.1.0-21-amd64 - Xfce

2 Systems - 47 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Ubuntu 22.04 - 6.5.0-1020-aws - GCC 11.4.0

1 System - 5 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Ubuntu 22.04 - 6.5.0-1020-aws - GCC 11.4.0

1 System - 3 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Ubuntu 22.04 - 6.5.0-1020-aws - 1.3.255

1 System - 3 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Ubuntu 22.04 - 6.5.0-1017-aws - GCC 11.4.0

1 System - 3 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Ubuntu 22.04 - 6.5.0-1017-aws - GCC 11.4.0

1 System - 6 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Amazon Linux 2023.4.20240611 - 6.1.92-99.174.amzn2023.x86_64 - GCC 11.4.1 20230605

1 System - 6 Benchmark Results

Intel Xeon Platinum 8375C - Amazon EC2 c6i.4xlarge - Intel 440FX 82441FX PMC

Amazon Linux 2023.4.20240611 - 6.1.92-99.174.amzn2023.x86_64 - GCC 11.4.1 20230605

1 System - 56 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASUS ROG STRIX X370-F GAMING - AMD Starship

Linuxmint 21.3 - 6.5.0-41-lowlatency - X Server 1.21.1.4

1 System - 3 Benchmark Results

Intel Core i7-10700K - MSI Z490-A PRO - Intel Comet Lake PCH

Linuxmint 21.2 - 6.5.0-14-generic - Xfce 4.18

1 System - 3 Benchmark Results

Intel Core i7-3630QM - MSI MS-1762 - Intel 3rd Gen Core DRAM

Linuxmint 21.3 - 5.15.0-101-generic - Cinnamon 6.0.4

1 System - 3 Benchmark Results

AMD Ryzen Threadripper PRO 7965WX 24-Cores - ASUS Pro WS WRX90E-SAGE SE - AMD Device 14a4

Ubuntu 22.04 - 6.5.0-26-generic - GNOME Shell 42.9

1 System - 341 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ProArt X670E-CREATOR WIFI - AMD Device 14d8

Pop 22.04 - 6.6.10-76060610-generic - GNOME Shell 42.5

1 System - 3 Benchmark Results

AMD Ryzen 7 7800X3D 8-Core - ASUS ROG STRIX B650E-I GAMING WIFI - AMD Device 14d8

Pop 22.04 - 6.6.6-76060606-generic - GNOME Shell 42.5

Find More Test Results