LeelaChessZero Benchmark - OpenBenchmarking.org

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark lczero.

Revision History

pts/lczero-1.8.0 [View Source] Sun, 11 Aug 2024 10:10:22 GMT
Update against lc0 v0.31.1 upstream.

pts/lczero-1.7.0 [View Source] Sat, 16 Dec 2023 06:59:59 GMT
Update against lczero v0.30 upstream.

pts/lczero-1.6.0 [View Source] Wed, 25 Aug 2021 14:55:01 GMT
Update against upstream LC0 v0.28 to workaround build issues on newer Linux distributions.

pts/lczero-1.5.1 [View Source] Sun, 27 Sep 2020 17:41:46 GMT
Limit max CPU threads to 64 to workaround upstream issue with lc0 bailing out otherwise.

pts/lczero-1.5.0 [View Source] Sun, 06 Sep 2020 14:18:27 GMT
Update against latest upstream along with updated network, add eigen as possible external dependency.

pts/lczero-1.4.0 [View Source] Thu, 30 Apr 2020 09:06:47 GMT
Update against lc0 0.25, use new network as old one was removed.

pts/lczero-1.3.0 [View Source] Fri, 10 Jan 2020 20:28:53 GMT
Update against latest lc0 v0.23.2 upstream, other test improvements.

pts/lczero-1.2.1 [View Source] Thu, 03 Oct 2019 14:07:11 GMT
Add Windows support.

pts/lczero-1.2.0 [View Source] Thu, 26 Sep 2019 16:49:44 GMT
Update against lczero upstream

pts/lczero-1.1.1 [View Source] Wed, 16 Jan 2019 05:41:40 GMT
Add zlib to external dependency list.

pts/lczero-1.1.0 [View Source] Tue, 15 Jan 2019 10:11:47 GMT
Set threads option always for CPU testing.

pts/lczero-1.0.1 [View Source] Sun, 13 Jan 2019 14:22:54 GMT
Allow CUDA and BLAS benchmarking back-end support.

pts/lczero-1.0.0 [View Source] Sat, 12 Jan 2019 20:10:43 GMT
Initial commit of lc0 / lczero chess benchmark using neural networks with GPU compute.

Performance Metrics

LeelaChessZero 0.28

Backend: BLAS

OpenBenchmarking.org metrics for this test profile configuration based on 936 public results since 25 August 2021 with the latest data as of 12 December 2023.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

Nodes Per Second (Average)

2 x AMD EPYC 9684X 96-Core

100th

12423 ^{+/- 1196}

2 x AMD EPYC 9554 64-Core

100th

11307 ^{+/- 313}

2 x AMD EPYC 9654 96-Core

100th

10200 ^{+/- 1299}

AMD EPYC 9684X 96-Core

99th

9380 ^{+/- 659}

2 x AMD EPYC 9754 128-Core

99th

8486 ^{+/- 96}

2 x AMD EPYC 7573X 32-Core

98th

8108 ^{+/- 160}

2 x AMD EPYC 75F3 32-Core

96th

6126 ^{+/- 195}

AMD EPYC 7773X 64-Core

94th

5414 ^{+/- 161}

2 x AMD EPYC 7713 64-Core

91st

4457 ^{+/- 167}

AMD EPYC 7763 64-Core

90th

4129 ^{+/- 75}

AMD EPYC 7713 64-Core

89th

3823 ^{+/- 57}

2 x Intel Xeon Platinum 8280

89th

3544 ^{+/- 226}

ARMv8 Neoverse-N1 128-Core

85th

2343 ^{+/- 28}

Ampere ARMv8 Neoverse-N1 256-Core

83rd

2184 ^{+/- 70}

Ampere Altra ARMv8 Neoverse-N1 160-Core

83rd

2172 ^{+/- 104}

Intel Xeon Gold 6226R

82nd

2020 ^{+/- 25}

2 x AMD EPYC 7742 64-Core

81st

1995 ^{+/- 102}

AMD EPYC 7373X 16-Core

80th

1962 ^{+/- 38}

AMD Ryzen 9 7950X3D 16-Core

79th

1932 ^{+/- 25}

Mid-Tier

75th

< 1885

AMD Ryzen 9 7900X3D 12-Core

75th

1882 ^{+/- 14}

AMD EPYC 72F3 8-Core

75th

1871 ^{+/- 101}

AMD EPYC 7742 64-Core

74th

1804 ^{+/- 78}

AMD Ryzen 9 7900X 12-Core

74th

1766 ^{+/- 109}

AMD Ryzen 9 7950X 16-Core

73rd

1749 ^{+/- 105}

2 x AMD EPYC 7373X 16-Core

71st

1690 ^{+/- 22}

AMD Ryzen Threadripper 3990X 64-Core

67th

1653 ^{+/- 27}

AMD Ryzen 7 7700X 8-Core

66th

1616 ^{+/- 54}

AMD Ryzen 9 7900 12-Core

66th

1614 ^{+/- 12}

AMD EPYC 7F52 16-Core

66th

1607 ^{+/- 89}

AMD Ryzen 7 7700 8-Core

64th

1548 ^{+/- 30}

AMD Ryzen 5 7600X 6-Core

63rd

1527 ^{+/- 68}

Intel Xeon E5-2678 v3

63rd

1526 ^{+/- 23}

Intel Core i7-9750H

61st

1431 ^{+/- 95}

AMD EPYC 74F3 24-Core

58th

1346 ^{+/- 145}

AMD EPYC 75F3 32-Core

58th

1342 ^{+/- 32}

AMD EPYC 7543 32-Core

57th

1315 ^{+/- 78}

AMD Ryzen Threadripper 3970X 32-Core

56th

1289 ^{+/- 95}

AMD Ryzen 7 5800X3D 8-Core

54th

1256 ^{+/- 18}

Intel Core i9-9900KS

51st

1212 ^{+/- 3}

Intel Core i9-10980XE

51st

1190 ^{+/- 19}

Median

50th

1177

AMD Ryzen Threadripper 3960X 24-Core

50th

1162 ^{+/- 83}

Intel Core i5-8400

49th

1147 ^{+/- 35}

Intel Core i7-8700K

47th

1090 ^{+/- 20}

Intel Core i7-8086K

46th

1048 ^{+/- 59}

Intel Core i9-13900K

45th

1043 ^{+/- 131}

Intel Core i5-13600K

45th

1036 ^{+/- 39}

Intel Core i9-9900K

44th

1017 ^{+/- 20}

Intel Core i7-6800K

43rd

1006 ^{+/- 17}

Intel Core i5-9400F

43rd

1000 ^{+/- 24}

AMD EPYC 7F32 8-Core

42nd

967 ^{+/- 72}

Intel Core i7-12700K

40th

932 ^{+/- 22}

Intel Xeon E3-1275 v6

38th

908 ^{+/- 3}

Intel Core i3-8100

36th

854 ^{+/- 18}

Intel Core i5-13400

35th

819 ^{+/- 2}

Intel Xeon E3-1280 v5

34th

806 ^{+/- 11}

Intel Core i7-5960X

33rd

795 ^{+/- 95}

Intel Xeon E3-1270 v5

33rd

784 ^{+/- 12}

AMD Ryzen 5 3600XT 6-Core

33rd

778 ^{+/- 59}

Intel Core i5-6500

31st

765 ^{+/- 4}

Intel Core i7-10700T

30th

741 ^{+/- 21}

AMD Ryzen 5 5600G

29th

728 ^{+/- 91}

Intel Core i7-1065G7

26th

680 ^{+/- 69}

AMD Ryzen 9 3900XT 12-Core

26th

677 ^{+/- 26}

Low-Tier

25th

< 673

AMD Ryzen 7 5700G

24th

644 ^{+/- 47}

Intel Xeon E3-1235L v5

23rd

637 ^{+/- 13}

Intel Xeon E5-2609 v4

22nd

616 ^{+/- 10}

Intel Core i5-7600K

21st

600 ^{+/- 6}

AMD Ryzen 3 3300X 4-Core

20th

587 ^{+/- 21}

Intel Core i7-4770K

19th

557 ^{+/- 12}

AMD Ryzen 9 3950X 16-Core

17th

535 ^{+/- 24}

AMD Ryzen Threadripper 2970WX 24-Core

15th

476 ^{+/- 22}

Intel Core i7-8565U

15th

473 ^{+/- 9}

AMD Ryzen Threadripper 2990WX 32-Core

15th

459 ^{+/- 40}

Intel Core i9-10900K

14th

429 ^{+/- 5}

AMD Ryzen 5 4500U

14th

414 ^{+/- 24}

AMD Ryzen 7 4700U

13th

409 ^{+/- 18}

Intel Core i5-10600K

13th

402 ^{+/- 9}

AMD Ryzen 5 5500U

12th

360 ^{+/- 17}

AMD Ryzen 3 2200G

10th

285 ^{+/- 10}

Intel Core i7-2700K

9th

248 ^{+/- 5}

Ampere eMAG ARMv8 32-Core

7th

195 ^{+/- 7}

Intel Core i7-1185G7

7th

174 ^{+/- 2}

Intel Core i7-1165G7

7th

173 ^{+/- 18}

AMD FX-8150 Eight-Core

5th

143 ^{+/- 7}

AMD Ryzen 3 3200U

4th

132 ^{+/- 11}

Intel Core i7-7820HQ

4th

111

ARMv8 Cortex-A72 4-Core

2nd

27 ^{+/- 2}

Based on OpenBenchmarking.org data, the selected test / test configuration (LeelaChessZero 0.28 - Backend: BLAS) has an average run-time of 27 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.9%.

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set

Support

Instructions Detected

SSE 4.2 (SSE4_2)

Used by default on supported hardware.
Found on Intel processors since at least 2010.
Found on AMD processors since Bulldozer (2011).

POPCNT

Advanced Vector Extensions (AVX)

Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

VZEROUPPER VBROADCASTSS VEXTRACTF128 VPERMILPS VINSERTF128 VPERM2F128

Advanced Vector Extensions 2 (AVX2)

Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

VPBROADCASTQ VINSERTI128 VEXTRACTI128 VPBROADCASTW VPBROADCASTD VPSLLVQ VPERMPS VPERM2I128 VPERMQ VPERMD

Advanced Vector Extensions 512 (AVX512)

Used by default on supported hardware.

(ZMM REGISTER USE)

FMA (FMA)

Used by default on supported hardware.
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

VFMADD132SD VFMADD231SD VFMADD213SD VFNMADD132SD VFMADD132SS VFNMADD132SS VFMSUB132SS VFMADD213SS VFMADD231SS VFNMSUB231SS VFNMADD213SS VFNMSUB132SS VFMSUB231SS VFNMADD213SD VFMADD132PS VFMADD231PS VFMADD213PS VFNMADD231SS

The test / benchmark does honor compiler flag changes.

Last automated analysis: 23 August 2024

This test profile binary relies on the shared libraries libopenblas.so.0, libOpenCL.so.1, libz.so.1, libm.so.6, libc.so.6, libgfortran.so.5, libquadmath.so.0.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

ccta_bio-sci-fin-comp_suite_w10-pro3_R_25nov21 2 Systems - 14 Benchmark Results	AMD Ryzen Threadripper 3960X 24-Core - Intel 440BX - 1 x 16384 MB 0MHz VMW-16384MB Microsoft Windows 10 Pro Build 18362 - 10.0 - 2.0.105.0
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
AMD Ryzen 5800X, 5800X3D, 5950X Linux Benchmarks 5 Systems - 531 Benchmark Results	AMD EPYC 7702 64-Core - Supermicro Super Server H12SSL-NT v1.02 - AMD Starship Debian 12 - 6.8.8-2-pve - NVIDIA

LeelaChessZero

Project Site

Source Repository

Test Created

Last Updated

Test Maintainer

Test Type

Average Install Time

Average Run Time

Test Dependencies

Accolades

Supported Platforms

Revision History

Suites Using This Test

Chess Test Suite

Machine Learning

HPC - High Performance Computing

CPU Massive

NVIDIA GPU Compute

Performance Metrics

LeelaChessZero 0.28

Backend: BLAS

Does It Scale Well With Increasing Cores?

Notable Instruction Set Usage

Tested CPU Architectures

Recent Test Results

Compare

ccta_bio-sci-fin-comp_suite_w10-pro3_R_25nov21

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

AMD Ryzen 5800X, 5800X3D, 5950X Linux Benchmarks