Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark graph500.

Project Site

graph500.org

Source Repository

github.com

Test Created

28 January 2022

Last Updated

6 April 2024

Test Maintainer

Michael Larabel

Test Type

System

Average Install Time

6 Seconds

Average Run Time

34 Minutes, 37 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenMPI

Accolades

10k+ Downloads

Supported Platforms

* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 23 October 2024.

Revision History

pts/graph500-1.0.2 [View Source] Sat, 06 Apr 2024 06:42:49 GMT
Fix with newer compilers.

pts/graph500-1.0.1 [View Source] Sat, 29 Jan 2022 11:06:02 GMT
Add PROCS_PER_NODE_NOT_POWER_OF_TWO handling to detect otherwise the program fails to run for non power of 2 CPU core counts.

pts/graph500-1.0.0 [View Source] Fri, 28 Jan 2022 07:16:09 GMT
Initial commit of Graph500, long overdue...

Suites Using This Test

HPC - High Performance Computing

Performance Metrics

Analyze Test Configuration:

Graph500 3.0

Scale: 26

OpenBenchmarking.org metrics for this test profile configuration based on 732 public results since 28 January 2022 with the latest data as of 11 October 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

sssp median_TEPS (Average)

2 x AMD EPYC 9755 128-Core

100th

1062140000 ^{+/- 6871594}

2 x AMD EPYC 9754 128-Core

99th

763451889 ^{+/- 34156366}

2 x AMD EPYC 9684X 96-Core

95th

624749680 ^{+/- 32679516}

2 x AMD EPYC 9654 96-Core

94th

599056714 ^{+/- 46353259}

2 x AMD EPYC 9554 64-Core

89th

496178611 ^{+/- 34397377}

2 x INTEL XEON PLATINUM 8592

86th

473044500 ^{+/- 33182461}

AMD EPYC 9654 96-Core

82nd

407990154 ^{+/- 11862849}

INTEL XEON PLATINUM 8592

82nd

407245778 ^{+/- 26388992}

AMD EPYC 9684X 96-Core

79th

383754000

2 x AMD EPYC 9374F 32-Core

78th

380707800 ^{+/- 13023747}

AmpereOne 192-Core

78th

380116333 ^{+/- 30928264}

2 x Intel Xeon Platinum 8490H

77th

377258214 ^{+/- 2895847}

Mid-Tier

75th

< 373260000

AMD EPYC 9754 128-Core

75th

368908571 ^{+/- 16402782}

AMD EPYC 9554 64-Core

73rd

357670692 ^{+/- 7317443}

AMD Ryzen Threadripper PRO 7995WX 96-Cores

71st

353346667 ^{+/- 26736816}

2 x AMD EPYC 7763 64-Core

67th

330989739 ^{+/- 18943171}

Intel Xeon Platinum 8490H

67th

330394455 ^{+/- 3859265}

ARMv8 Neoverse-V2 72-Core

63rd

311922533 ^{+/- 15201224}

2 x AMD EPYC 7713 64-Core

62nd

310018444 ^{+/- 17767504}

2 x AMD EPYC 7773X 64-Core

62nd

308534756 ^{+/- 31938642}

Median

50th

283396000

2 x Intel Xeon Max 9480

50th

283291375 ^{+/- 16511699}

Ampere ARMv8 Neoverse-N1 256-Core

50th

281195500 ^{+/- 4224695}

2 x AMD EPYC 75F3 32-Core

47th

271577111 ^{+/- 10299537}

2 x Intel Xeon Platinum 8380

47th

270935875 ^{+/- 29812973}

AMD EPYC 8534P 64-Core

45th

267939750 ^{+/- 1345992}

2 x Intel Xeon Platinum 8362

45th

267899000 ^{+/- 2808000}

AMD EPYC 7773X 64-Core

41st

262063556 ^{+/- 19210989}

AMD EPYC 8534PN 64-Core

41st

262060000 ^{+/- 13799322}

AMD EPYC 7763 64-Core

41st

261698737 ^{+/- 9967228}

AMD Ryzen Threadripper 7980X 64-Cores

38th

255047333 ^{+/- 29685119}

2 x AMD EPYC 7543 32-Core

37th

254262750 ^{+/- 17861973}

AMD EPYC 7713 64-Core

37th

250495533 ^{+/- 10754330}

2 x AMD EPYC 7573X 32-Core

37th

250443667 ^{+/- 11430370}

2 x AMD EPYC 7513 32-Core

37th

248282000 ^{+/- 11698271}

2 x AMD EPYC 7601 32-Core

33rd

228313000 ^{+/- 1739078}

AMD EPYC 7713P 64-Core

33rd

227175000

ARMv8 Neoverse-N1 128-Core

31st

224132100 ^{+/- 1708299}

2 x AMD EPYC 74F3 24-Core

31st

223171500 ^{+/- 12313149}

AMD EPYC 9374F 32-Core

29th

215074750 ^{+/- 8443500}

Intel Xeon Platinum 8380

28th

209719455 ^{+/- 10108181}

AMD EPYC 7543 32-Core

26th

201369000 ^{+/- 5456854}

AMD EPYC 75F3 32-Core

26th

200128250 ^{+/- 8116420}

Low-Tier

25th

< 197386000

Intel Xeon Platinum 8362

25th

196539333 ^{+/- 1466470}

2 x Intel Xeon Gold 6346

23rd

186677600 ^{+/- 2220376}

2 x Intel Xeon Platinum 8280

22nd

179964250 ^{+/- 1300586}

AMD EPYC 7513 32-Core

22nd

176858000 ^{+/- 974000}

2 x AMD EPYC 7343 16-Core

20th

166108000 ^{+/- 14048664}

AMD Ryzen Threadripper 3990X 64-Core

19th

162734143 ^{+/- 941062}

2 x AMD EPYC 7373X 16-Core

18th

159731000 ^{+/- 21038042}

AMD EPYC 74F3 24-Core

18th

157047667 ^{+/- 3051296}

AMD Ryzen Threadripper 3970X 32-Core

17th

155421600 ^{+/- 1506277}

ARMv8 Neoverse-N1 32-Core

15th

138202000

2 x Intel Xeon Gold 5220R

13th

123755500 ^{+/- 457839}

AMD Ryzen Threadripper PRO 5965WX 24-Cores

11th

119973114 ^{+/- 8913913}

Intel Xeon Gold 6346

11th

117185000 ^{+/- 1572343}

AMD EPYC 7343 16-Core

10th

113232000 ^{+/- 1841170}

AMD Ryzen 9 7950X 16-Core

10th

106031271 ^{+/- 13725396}

Intel Core i9-12900K

9th

102174556 ^{+/- 1284813}

Intel Xeon Gold 6226R

8th

96798067 ^{+/- 1103602}

AMD Ryzen 9 7900X 12-Core

7th

92662233 ^{+/- 960570}

Intel Xeon Silver 4216

6th

91456067 ^{+/- 979949}

AMD Ryzen 9 5950X 16-Core

6th

82937580 ^{+/- 11461092}

AMD EPYC 7551 32-Core

5th

79429471 ^{+/- 4282382}

AMD EPYC 72F3 8-Core

3rd

61164150 ^{+/- 4971920}

Intel Xeon E-2288G

2nd

50707825 ^{+/- 239293}

Based on OpenBenchmarking.org data, the selected test / test configuration (Graph500 3.0 - Scale: 26) has an average run-time of 17 minutes. By default this test profile is set to run at least 1 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set

Support

Instructions Detected

SSE2 (SSE2)

Used by default on supported hardware.

MOVD SUBSD MOVAPD ADDSD MULSD CVTTSD2SI DIVSD MOVUPD DIVPD COMISD UCOMISD CVTSI2SD UNPCKLPD SUBPD MULPD UNPCKHPD SQRTSD CVTSS2SD MOVDQA PUNPCKLQDQ CVTSD2SS

Advanced Vector Extensions (AVX)

Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

VINSERTF128 VZEROUPPER VEXTRACTF128 VBROADCASTSD VBROADCASTSS

Advanced Vector Extensions 2 (AVX2)

Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

VPBROADCASTQ VPERM2I128 VPERMQ VINSERTI128 VPBROADCASTD

FMA (FMA)

VFMADD231SD VFMADD132SD

Advanced Vector Extensions 512 (AVX512)

Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).

(ZMM REGISTER USE)

The test / benchmark does honor compiler flag changes.

Last automated analysis: 6 April 2024

This test profile binary relies on the shared libraries libm.so.6, libmpi.so.40, libc.so.6, libopen-rte.so.40, libopen-pal.so.40, libhwloc.so.15, libz.so.1, libudev.so.1.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture

Kernel Identifier

Verified On

Intel / AMD x86 64-bit

x86_64

(Many Processors)

ARMv8 64-bit

aarch64

ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 128-Core, ARMv8 Neoverse-N1 32-Core, ARMv8 Neoverse-V1, ARMv8 Neoverse-V2 72-Core, Ampere ARMv8 Neoverse-N1 128-Core, Ampere ARMv8 Neoverse-N1 160-Core, Ampere ARMv8 Neoverse-N1 256-Core, AmpereOne 192-Core

Recent Test Results

Compare

2024-10-08-0736 1 System - 461 Benchmark Results	AMD Ryzen 7 5800X 8-Core - GIGABYTE MC12-LE0-00 v01000100 - AMD Starship Ubuntu 24.04 - 6.11.0-061100-generic - GNOME Shell 46.0
EPYC 9755 Smoke Run 3 Systems - 21 Benchmark Results	2 x AMD EPYC 9755 128-Core - AMD VOLCANO - AMD Device 153a Ubuntu 24.04 - 6.8.0-45-generic - GCC 13.2.0 + Clang 18.1.3
Intel Xeon 6980P Granite Rapids 1 System - 41 Benchmark Results	Intel Xeon 6980P - Intel BIRCHSTREAM - Intel Ice Lake IEH Ubuntu 24.04 - 6.8.0-22-generic - GCC 13.2.0
Ryzen 9 9950X Memory Performance 1 System - 102 Benchmark Results	AMD Ryzen 9 9950X 16-Core - ASUS ROG STRIX X670E-E GAMING WIFI - AMD Device 14d8 Ubuntu 24.04 - 6.10.0-phx - GNOME Shell 46.0
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks 6 Systems - 87 Benchmark Results	ARMv8 Neoverse-V1 - Amazon EC2 c7g.16xlarge - Amazon Device 0200 Ubuntu 22.04 - 5.19.0-1025-aws - GCC 11.3.0

Graph500

Project Site

Source Repository

Test Created

Last Updated

Test Maintainer

Test Type

Average Install Time

Average Run Time

Test Dependencies

Accolades

Supported Platforms

Revision History

Suites Using This Test

HPC - High Performance Computing

Performance Metrics

Graph500 3.0

Scale: 26

Does It Scale Well With Increasing Cores?

Notable Instruction Set Usage

Tested CPU Architectures

Recent Test Results

Compare

2024-10-08-0736

EPYC 9755 Smoke Run

Intel Xeon 6980P Granite Rapids

Ryzen 9 9950X Memory Performance

Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks