LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark lczero.

Project Site

lczero.org

Source Repository

github.com

Test Created

12 January 2019

Last Updated

25 August 2021

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 40 Seconds

Average Run Time

4 Minutes, 8 Seconds

Test Dependencies

C/C++ Compiler Toolchain + BLAS (Basic Linear Algebra Sub-Routine) + OpenCL + Meson Build System + Eigen + Zlib

Accolades

50k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsLeelaChessZero Popularity Statisticspts/lczero2019.012019.032019.052019.072019.092019.112020.012020.032020.052020.072020.092020.112021.012021.032021.052021.072021.092021.112022.012022.032022.052022.072022.093K6K9K12K15K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 27 September 2022.
Eigen45.1%BLAS47.5%OpenCL7.4%Backend Option PopularityOpenBenchmarking.org

Revision History

pts/lczero-1.6.0   [View Source]   Wed, 25 Aug 2021 14:55:01 GMT
Update against upstream LC0 v0.28 to workaround build issues on newer Linux distributions.

pts/lczero-1.5.1   [View Source]   Sun, 27 Sep 2020 17:41:46 GMT
Limit max CPU threads to 64 to workaround upstream issue with lc0 bailing out otherwise.

pts/lczero-1.5.0   [View Source]   Sun, 06 Sep 2020 14:18:27 GMT
Update against latest upstream along with updated network, add eigen as possible external dependency.

pts/lczero-1.4.0   [View Source]   Thu, 30 Apr 2020 09:06:47 GMT
Update against lc0 0.25, use new network as old one was removed.

pts/lczero-1.3.0   [View Source]   Fri, 10 Jan 2020 20:28:53 GMT
Update against latest lc0 v0.23.2 upstream, other test improvements.

pts/lczero-1.2.1   [View Source]   Thu, 03 Oct 2019 14:07:11 GMT
Add Windows support.

pts/lczero-1.2.0   [View Source]   Thu, 26 Sep 2019 16:49:44 GMT
Update against lczero upstream

pts/lczero-1.1.1   [View Source]   Wed, 16 Jan 2019 05:41:40 GMT
Add zlib to external dependency list.

pts/lczero-1.1.0   [View Source]   Tue, 15 Jan 2019 10:11:47 GMT
Set threads option always for CPU testing.

pts/lczero-1.0.1   [View Source]   Sun, 13 Jan 2019 14:22:54 GMT
Allow CUDA and BLAS benchmarking back-end support.

pts/lczero-1.0.0   [View Source]   Sat, 12 Jan 2019 20:10:43 GMT
Initial commit of lc0 / lczero chess benchmark using neural networks with GPU compute.

Suites Using This Test

Chess Test Suite

Machine Learning

CPU Massive

HPC - High Performance Computing

NVIDIA GPU Compute


Performance Metrics

Analyze Test Configuration:

LeelaChessZero 0.28

Backend: BLAS

OpenBenchmarking.org metrics for this test profile configuration based on 637 public results since 25 August 2021 with the latest data as of 28 September 2022.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Nodes Per Second (Average)
100th
17
6126 +/- 195
97th
6
5321 +/- 144
95th
5
4457 +/- 167
93rd
3
4129 +/- 75
91st
4
3823 +/- 57
86th
9
2305 +/- 328
83rd
3
2020 +/- 25
81st
8
1995 +/- 102
80th
5
1965 +/- 46
78th
15
1871 +/- 101
77th
9
1804 +/- 78
Mid-Tier
75th
< 1756
69th
4
1607 +/- 89
69th
4
1527 +/- 29
65th
5
1346 +/- 145
64th
12
1342 +/- 32
64th
9
1315 +/- 78
59th
3
1212 +/- 3
58th
5
1190 +/- 19
57th
7
1147 +/- 35
54th
4
1090 +/- 20
53rd
3
1048 +/- 59
51st
3
1017 +/- 20
51st
3
1006 +/- 17
51st
4
1000 +/- 24
Median
50th
994
50th
9
967 +/- 72
47th
4
932 +/- 22
46th
3
908 +/- 3
44th
3
854 +/- 18
42nd
3
806 +/- 11
42nd
3
795 +/- 95
41st
4
784 +/- 12
39th
6
765 +/- 4
37th
3
741 +/- 21
35th
5
717 +/- 15
32nd
3
680 +/- 69
29th
11
644 +/- 47
28th
3
637 +/- 13
27th
3
616 +/- 10
Low-Tier
25th
< 601
25th
4
600 +/- 6
22nd
4
557 +/- 12
17th
3
473 +/- 9
16th
3
429 +/- 5
15th
3
414 +/- 24
15th
3
409 +/- 18
15th
3
402 +/- 9
13th
3
360 +/- 17
11th
4
285 +/- 10
10th
4
248 +/- 5
7th
8
173 +/- 18
4th
4
132 +/- 11
OpenBenchmarking.orgDistribution Of Public Results - Backend: BLAS606 Results Range From 1 To 6346 Nodes Per Second112825538250963676389010171144127113981525165217791906203321602287241425412668279529223049317633033430355736843811393840654192431944464573470048274954508152085335546255895716584359706097622463511530456075

Based on OpenBenchmarking.org data, the selected test / test configuration (LeelaChessZero 0.28 - Backend: BLAS) has an average run-time of 28 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBackend: BLASRun-Time1224364860Min: 12 / Avg: 27 / Max: 60

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.8%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBackend: BLASDeviation246810Min: 0 / Avg: 0.77 / Max: 6

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseLeelaChessZero CPU Core ScalingBackend: BLAS468162432642.454.97.359.812.25

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
SSE 4.2 (SSE4_2)
Used by default on supported hardware.
Found on Intel processors since at least 2010.
Found on AMD processors since Bulldozer (2011).

 
POPCNT
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VZEROUPPER VBROADCASTSS VEXTRACTF128 VINSERTF128 VPERM2F128 VPERMILPS
Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

 
VPBROADCASTQ VINSERTI128 VEXTRACTI128 VPBROADCASTD VPSLLVQ VPBROADCASTW VPERMPS VPERM2I128 VPERMQ VPERMD
Advanced Vector Extensions 512 (AVX512)
Used by default on supported hardware.
 
(ZMM REGISTER USE)
FMA (FMA)
Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD132SD VFMADD231SS VFMADD231SD VFMADD213SD VFNMADD132SD VFMADD132SS VFNMADD213SS VFMADD213SS VFNMSUB132SS VFMSUB132SS VFNMADD132SS VFMSUB231SS VFNMADD213SD VFMADD132PS VFMADD231PS VFNMADD231SS VFMADD213PS
The test / benchmark does honor compiler flag changes.
Last automated analysis: 17 January 2022

This test profile binary relies on the shared libraries libopenblas.so.0, libOpenCL.so.1, libz.so.1, libm.so.6, libc.so.6, libgfortran.so.5, libquadmath.so.0.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
ARMv8 64-bit
aarch64
ARMv8 Cortex-A57 4-Core, ARMv8 Cortex-A72, ARMv8 Cortex-A72 16-Core, ARMv8 Cortex-A72 4-Core, ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 64-Core, ARMv8 Neoverse-V1, Ampere ARMv8 Neoverse-N1 128-Core, Ampere ARMv8 Neoverse-N1 160-Core, Ampere ARMv8 Neoverse-N1 256-Core, Ampere Altra ARMv8 Neoverse-N1 160-Core, Ampere Altra ARMv8 Neoverse-N1 80-Core, Ampere eMAG ARMv8 32-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 64 Benchmark Results

Intel Core i7-7820HQ - Dell 0X41RR - Intel Xeon E3-1200 v6

Fedora Linux 36 - 5.19.11-200.fc36.x86_64 - GNOME Shell 42.5

4 Systems - 537 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASRock X570 Pro4 - AMD Starship

Ubuntu 22.04 - 5.17.4-051704-generic - GNOME Shell 42.0

4 Systems - 361 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ROG CROSSHAIR X670E HERO - AMD Device 14d8

Ubuntu 22.04 - 6.0.0-060000rc1daily20220820-generic - GNOME Shell 42.2

13 Systems - 333 Benchmark Results

Intel Core i9-12900K - ASUS ROG STRIX Z690-E GAMING WIFI - Intel Device 7aa7

Ubuntu 22.04 - 6.0.0-060000rc1daily20220820-generic - GNOME Shell 42.2

2 Systems - 226 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ROG CROSSHAIR X670E HERO - AMD Device 14d8

Ubuntu 22.04 - 6.0.0-060000rc1daily20220820-generic - GNOME Shell 42.2

1 System - 7 Benchmark Results

AMD Ryzen 7 PRO 4750G - ASRock X300M-STX - 2 x 8192 MB 3200MHz 4ATF1G64HZ-3G2E1

Microsoft Windows 10 Pro Build 19044 - 10.0.19044.2006 - 31.0.12019.9007

1 System - 7 Benchmark Results

Intel Atom D525 - Intel D525MW - Intel Atom D4xx

Debian 11 - 5.10.0-18-amd64 - 1.0.2

1 System - 7 Benchmark Results

Intel Core i7-3517U - ASUS UX31A - 2 x 2048 MB 1600MHz Elpida

Microsoft Windows 10 Home Build 19044 - 10.0.19044.2006 - 10.18.10.4358

1 System - 17 Benchmark Results

Intel Core i7-4790K - MSI B85-G43 GAMING - Intel 4th Gen Core DRAM

Fedora Linux 36 - 5.19.8-200.fc36.x86_64 - KDE Plasma 5.25.5

2 Systems - 705 Benchmark Results

2 x AMD EPYC 7713 64-Core - AMD DAYTONA_X - AMD Starship

Ubuntu 22.04 - 5.15.0-47-generic - GNOME Shell 42.4

1 System - 91 Benchmark Results

Intel Core i7-5820K - ASUS X99-A/USB 3.1 - Intel Xeon E7 v3

Fedora Linux 36 - 5.19.4-200.fc36.x86_64 - X Server 1.20.14

1 System - 86 Benchmark Results

AMD Ryzen 7 PRO 4750U - LENOVO 20UES00L00 - AMD Renoir

Fedora 36 - 5.19.6-200.fc36.x86_64 - GNOME Shell 42.4

1 System - 366 Benchmark Results

2 x AMD EPYC 7713 64-Core - AMD DAYTONA_X - AMD Starship

Ubuntu 22.10 - 6.0.0-060000rc3daily20220904-generic - GNOME Shell

1 System - 5 Benchmark Results

Intel Core i9-10850K - ASRock Z490 Taichi - 2 x 16384 MB 3600MHz

Microsoft Windows 10 Pro Build 19044 - 10.0.19044.1949 - 512.15

1 System - 11 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASRock B450M Pro4-F - 4 x 16384 MB 2800MHz Kingston KHX2666C16

Microsoft Windows 10 Pro Build 19044 - 10.0.19044.1889 - 456.71

Most Popular Test Results

OpenBenchmarking.org Results Compare

4 Systems - 196 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

Ubuntu 20.04 - 5.9.0-050900rc6daily20200923-generic - GNOME Shell 3.36.4

2 Systems - 16 Benchmark Results

Ampere Altra ARMv8 Neoverse-N1 - WIWYNN Mt.Jade - Ampere Computing LLC Altra PCI Root Complex A

Ubuntu 21.04 - 5.11.0-25-generic - 1.0.2

3 Systems - 531 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASRock X570 Pro4 - AMD Starship

Ubuntu 22.04 - 5.17.4-051704-generic - GNOME Shell 42.0

8 Systems - 364 Benchmark Results

2 x AMD EPYC 7742 64-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 21.10 - 5.13.0-19-generic - GNOME Shell 40.5

4 Systems - 146 Benchmark Results

Intel Core i5-7600K - Gigabyte Z270M-D3H-CF - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.9.0-050900rc7daily20200928-generic - GNOME Shell 3.36.4

3 Systems - 68 Benchmark Results

Intel Xeon E3-1275 v6 - ASUS P10S-M WS - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.9.0-050900rc8daily20201007-generic - X Server 1.20.8

3 Systems - 55 Benchmark Results

2 x AMD EPYC 7763 64-Core - AMD DAYTONA_X - AMD Starship

Ubuntu 20.10 - 5.8.0-29-generic - X Server 1.20.8

13 Systems - 333 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ROG CROSSHAIR X670E HERO - AMD Device 14d8

Ubuntu 22.04 - 6.0.0-060000rc1daily20220820-generic - GNOME Shell 42.2

4 Systems - 66 Benchmark Results

AMD Ryzen 7 5700G - ASUS TUF GAMING B550M-PLUS - AMD Renoir Root Complex

Ubuntu 21.10 - 5.14.0-rc7-amd-pstate-phx - GNOME Shell 40.2

2 Systems - 161 Benchmark Results

Intel Core i3-7100 - Gigabyte B250M-DS3H-CF - Intel Xeon E3-1200 v6

Ubuntu 20.10 - 5.8.0-55-generic - GNOME Shell 3.38.1

Find More Test Results