Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark graph500.

Project Site

graph500.org

Source Repository

github.com

Test Created

28 January 2022

Last Updated

6 April 2024

Test Maintainer

Michael Larabel

Test Type

System

Average Install Time

6 Seconds

Average Run Time

34 Minutes, 37 Seconds

Test Dependencies

C/C++ Compiler Toolchain + OpenMPI

Accolades

10k+ Downloads

Supported Platforms

* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 25 July 2024.

Revision History

pts/graph500-1.0.2 [View Source] Sat, 06 Apr 2024 06:42:49 GMT
Fix with newer compilers.

pts/graph500-1.0.1 [View Source] Sat, 29 Jan 2022 11:06:02 GMT
Add PROCS_PER_NODE_NOT_POWER_OF_TWO handling to detect otherwise the program fails to run for non power of 2 CPU core counts.

pts/graph500-1.0.0 [View Source] Fri, 28 Jan 2022 07:16:09 GMT
Initial commit of Graph500, long overdue...

Suites Using This Test

HPC - High Performance Computing

Performance Metrics

Analyze Test Configuration:

Graph500 3.0

Scale: 26

OpenBenchmarking.org metrics for this test profile configuration based on 707 public results since 28 January 2022 with the latest data as of 28 May 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

sssp median_TEPS (Average)

2 x AMD EPYC 9754 128-Core

100th

763451889 ^{+/- 34156366}

2 x AMD EPYC 9684X 96-Core

96th

624749680 ^{+/- 32679516}

2 x AMD EPYC 9654 96-Core

95th

599056714 ^{+/- 46353259}

2 x AMD EPYC 9554 64-Core

90th

496178611 ^{+/- 34397377}

2 x INTEL XEON PLATINUM 8592

86th

473044500 ^{+/- 33182461}

AMD EPYC 9654 96-Core

82nd

407990154 ^{+/- 11862849}

INTEL XEON PLATINUM 8592

82nd

407245778 ^{+/- 26388992}

AMD EPYC 9684X 96-Core

81st

383754000

2 x AMD EPYC 9374F 32-Core

80th

380707800 ^{+/- 13023747}

2 x Intel Xeon Platinum 8490H

79th

377258214 ^{+/- 2895847}

AMD EPYC 9754 128-Core

76th

368908571 ^{+/- 16402782}

Mid-Tier

75th

< 363936000

AMD EPYC 9554 64-Core

75th

357670692 ^{+/- 7317443}

AMD Ryzen Threadripper PRO 7995WX 96-Cores

73rd

353346667 ^{+/- 26736816}

2 x AMD EPYC 7763 64-Core

69th

330989739 ^{+/- 18943171}

Intel Xeon Platinum 8490H

69th

330394455 ^{+/- 3859265}

ARMv8 Neoverse-V2 72-Core

65th

311922533 ^{+/- 15201224}

2 x AMD EPYC 7713 64-Core

64th

310018444 ^{+/- 17767504}

2 x AMD EPYC 7773X 64-Core

64th

308534756 ^{+/- 31938642}

2 x Intel Xeon Max 9480

52nd

283291375 ^{+/- 16511699}

Ampere ARMv8 Neoverse-N1 256-Core

52nd

281195500 ^{+/- 4224695}

Median

50th

275755000

2 x AMD EPYC 75F3 32-Core

49th

271577111 ^{+/- 10299537}

2 x Intel Xeon Platinum 8380

49th

270935875 ^{+/- 29812973}

AMD EPYC 8534P 64-Core

46th

267939750 ^{+/- 1345992}

2 x Intel Xeon Platinum 8362

46th

267899000 ^{+/- 2808000}

AMD EPYC 7773X 64-Core

43rd

262063556 ^{+/- 19210989}

AMD EPYC 8534PN 64-Core

43rd

262060000 ^{+/- 13799322}

AMD EPYC 7763 64-Core

42nd

261698737 ^{+/- 9967228}

AMD Ryzen Threadripper 7980X 64-Cores

39th

255047333 ^{+/- 29685119}

2 x AMD EPYC 7543 32-Core

38th

254262750 ^{+/- 17861973}

AMD EPYC 7713 64-Core

38th

250495533 ^{+/- 10754330}

2 x AMD EPYC 7573X 32-Core

38th

250443667 ^{+/- 11430370}

2 x AMD EPYC 7513 32-Core

38th

248282000 ^{+/- 11698271}

2 x AMD EPYC 7601 32-Core

34th

228313000 ^{+/- 1739078}

AMD EPYC 7713P 64-Core

34th

227175000

ARMv8 Neoverse-N1 128-Core

32nd

224132100 ^{+/- 1708299}

2 x AMD EPYC 74F3 24-Core

31st

223171500 ^{+/- 12313149}

AMD EPYC 9374F 32-Core

30th

215074750 ^{+/- 8443500}

Intel Xeon Platinum 8380

29th

209719455 ^{+/- 10108181}

AMD EPYC 7543 32-Core

26th

201369000 ^{+/- 5456854}

AMD EPYC 75F3 32-Core

26th

200128250 ^{+/- 8116420}

Intel Xeon Platinum 8362

26th

196539333 ^{+/- 1466470}

Low-Tier

25th

< 193091000

2 x Intel Xeon Gold 6346

24th

186677600 ^{+/- 2220376}

2 x Intel Xeon Platinum 8280

23rd

179964250 ^{+/- 1300586}

AMD EPYC 7513 32-Core

22nd

176858000 ^{+/- 974000}

2 x AMD EPYC 7343 16-Core

21st

166108000 ^{+/- 14048664}

AMD Ryzen Threadripper 3990X 64-Core

19th

162734143 ^{+/- 941062}

2 x AMD EPYC 7373X 16-Core

18th

159731000 ^{+/- 21038042}

AMD EPYC 74F3 24-Core

18th

157047667 ^{+/- 3051296}

AMD Ryzen Threadripper 3970X 32-Core

18th

155421600 ^{+/- 1506277}

ARMv8 Neoverse-N1 32-Core

16th

138202000

2 x Intel Xeon Gold 5220R

13th

123755500 ^{+/- 457839}

AMD Ryzen Threadripper PRO 5965WX 24-Cores

12th

119973114 ^{+/- 8913913}

Intel Xeon Gold 6346

11th

117185000 ^{+/- 1572343}

AMD EPYC 7343 16-Core

10th

113232000 ^{+/- 1841170}

AMD Ryzen 9 7950X 16-Core

10th

106031271 ^{+/- 13725396}

Intel Core i9-12900K

9th

102174556 ^{+/- 1284813}

Intel Xeon Gold 6226R

8th

96798067 ^{+/- 1103602}

AMD Ryzen 9 7900X 12-Core

7th

92662233 ^{+/- 960570}

Intel Xeon Silver 4216

6th

91456067 ^{+/- 979949}

AMD Ryzen 9 5950X 16-Core

6th

82937580 ^{+/- 11461092}

AMD EPYC 7551 32-Core

5th

79429471 ^{+/- 4282382}

AMD EPYC 72F3 8-Core

3rd

61164150 ^{+/- 4971920}

Intel Xeon E-2288G

2nd

50707825 ^{+/- 239293}

Based on OpenBenchmarking.org data, the selected test / test configuration (Graph500 3.0 - Scale: 26) has an average run-time of 18 minutes. By default this test profile is set to run at least 1 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Does It Scale Well With Increasing Cores?

Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set

Support

Instructions Detected

SSE2 (SSE2)

Used by default on supported hardware.

MOVD SUBSD MOVAPD ADDSD MULSD CVTTSD2SI DIVSD MOVUPD DIVPD COMISD UCOMISD CVTSI2SD UNPCKLPD SUBPD MULPD UNPCKHPD SQRTSD CVTSS2SD MOVDQA PUNPCKLQDQ CVTSD2SS

Advanced Vector Extensions (AVX)

Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

VINSERTF128 VZEROUPPER VEXTRACTF128 VBROADCASTSD VBROADCASTSS

Advanced Vector Extensions 2 (AVX2)

Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

VPBROADCASTQ VPERM2I128 VPERMQ VINSERTI128 VPBROADCASTD

FMA (FMA)

VFMADD231SD VFMADD132SD

Advanced Vector Extensions 512 (AVX512)

Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).

(ZMM REGISTER USE)

The test / benchmark does honor compiler flag changes.

Last automated analysis: 6 April 2024

This test profile binary relies on the shared libraries libm.so.6, libmpi.so.40, libc.so.6, libopen-rte.so.40, libopen-pal.so.40, libhwloc.so.15, libz.so.1, libudev.so.1.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture

Kernel Identifier

Verified On

Intel / AMD x86 64-bit

x86_64

(Many Processors)

ARMv8 64-bit

aarch64

ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 128-Core, ARMv8 Neoverse-N1 32-Core, ARMv8 Neoverse-V1, ARMv8 Neoverse-V2 72-Core, Ampere ARMv8 Neoverse-N1 128-Core, Ampere ARMv8 Neoverse-N1 160-Core, Ampere ARMv8 Neoverse-N1 256-Core

Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks 6 Systems - 87 Benchmark Results	ARMv8 Neoverse-V1 - Amazon EC2 m7g.16xlarge - Amazon Device 0200 Ubuntu 22.04 - 5.19.0-1025-aws - GCC 11.3.0
dlsla 2 Systems - 32 Benchmark Results	AMD Ryzen Threadripper PRO 5965WX 24-Cores - ASUS Pro WS WRX80E-SAGE SE WIFI - AMD Starship Ubuntu 23.10 - 6.5.0-26-generic - GNOME Shell 45.0
skks 3 Systems - 43 Benchmark Results	2 x INTEL XEON PLATINUM 8592+ - Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS - Intel Device 1bce Ubuntu 23.10 - 6.6.0-rc5-phx-patched - GNOME Shell 45.0
pre 2024 4 Systems - 28 Benchmark Results	Intel Core i9-14900K - ASUS PRIME Z790-P WIFI - Intel Device 7a27 Ubuntu 23.10 - 6.8.0-phx - GNOME Shell 45.1
dgg 3 Systems - 21 Benchmark Results	AMD EPYC 8534P 64-Core - AMD Cinnabar - AMD Device 14a4 Ubuntu 23.10 - 6.8.1-060801-generic - GNOME Shell 45.2
saturday genoa 4 Systems - 21 Benchmark Results	2 x AMD EPYC 9684X 96-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 23.10 - 6.9.0-060900rc1daily20240327-generic - GCC 13.2.0
graph500-specfem3d 4 Systems - 26 Benchmark Results	AMD EPYC 7551 32-Core - GIGABYTE MZ31-AR0-00 v01010101 - AMD 17h Debian 12 - 6.1.0-10-amd64 - GCC 12.2.0
graph500 ryzen 3 Systems - 4 Benchmark Results	AMD Ryzen 9 7950X 16-Core - ASUS ROG STRIX X670E-E GAMING WIFI - AMD Device 14d8 Neon 22.04 - 6.5.0-26-generic - GNOME Shell 42.9
graph500 comp 2 Systems - 4 Benchmark Results	AMD Ryzen Threadripper 3970X 32-Core - ASUS ROG ZENITH II EXTREME - AMD Starship Ubuntu 22.04 - 6.5.0-25-generic - GNOME Shell 42.2
g500 compiler update 2 Systems - 4 Benchmark Results	AMD Ryzen Threadripper 7980X 64-Cores - System76 Thelio Major - AMD Device 14a4 Fedora Linux 40 - 6.8.0-0.rc6.49.fc40.x86_64 - GNOME Shell 46.0
Ubuntu 24.04 AMD EPYC Genoa-X Benchmark Preview 1 System - 91 Benchmark Results	2 x AMD EPYC 9684X 96-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 24.04 - 6.8.0-11-generic - GNOME Shell 45.3
Linux Distros Emerald Rapids 4 Systems - 125 Benchmark Results	2 x INTEL XEON PLATINUM 8592+ - Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS - Intel Device 1bce CentOS Stream 9 - 5.14.0-419.el9.x86_64 - GNOME Shell 40.10
Linux Distros Emerald Rapids 3 Systems - 125 Benchmark Results	2 x INTEL XEON PLATINUM 8592+ - Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS - Intel Device 1bce Ubuntu 23.10 - 6.5.0-17-generic - GCC 13.2.0
bandwidth 6 Systems - 10 Benchmark Results	ARMv8 Neoverse-V2 - Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB - 1 x 480GB DRAM-6400MT Ubuntu 22.04 - 6.5.0-1007-NVIDIA-64k - NVIDIA
Linux Distros Emerald Rapids 1 System - 125 Benchmark Results	2 x INTEL XEON PLATINUM 8592+ - Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS - Intel Device 1bce Ubuntu 23.10 - 6.5.0-17-generic - GCC 13.2.0

Linux Servers AlmaLinux vs. RHEL 9.0 Featured OS Comparison	2 x AMD EPYC 7773X 64-Core - AMD DAYTONA_X - AMD Starship AlmaLinux 9.0 - 5.14.0-70.13.1.el9_0.x86_64 - GNOME Shell 40.9
Ubuntu 22.04 Server Benchmarks 2 Systems - 705 Benchmark Results	2 x AMD EPYC 7713 64-Core - AMD DAYTONA_X - AMD Starship Ubuntu 22.04 - 5.15.0-47-generic - GNOME Shell 42.4
EPYC Milan-X Linux 5.19 Benchmarks Featured Kernel Comparison	2 x AMD EPYC 7773X 64-Core - AMD DAYTONA_X - AMD Starship Ubuntu 21.04 - 5.18.0-051800-generic - GNOME Shell 3.38.4
epyc extra Featured Processor Comparison	AMD EPYC 7763 64-Core - AMD DAYTONA_X - AMD Starship Ubuntu 22.04 - 5.19.0-051900rc4daily20220628-generic - GNOME Shell 42.2
GPTshop GH200 Linux Benchmarks 18 Systems - 33 Benchmark Results	2 x Intel Xeon Platinum 8380 - Intel M50CYP2SB2U - Intel Ice Lake IEH Ubuntu 23.10 - 6.6.0-rc5-phx - GNOME Shell 45.0
AMD Ryzen 7000 Series ECC DRAM 2 Systems - 244 Benchmark Results	AMD Ryzen 9 7900X 12-Core - ASRockRack B650D4U-2L2T/BCM - AMD Device 14d8 Ubuntu 22.04 - 6.6.0-060600rc1daily20230913-generic - GNOME Shell 42.9
cascade lake january 2022 2 Systems - 22 Benchmark Results	2 x Intel Xeon Platinum 8280 - GIGABYTE MD61-SC2-00 v01000100 - Intel Sky Lake-E DMI3 Registers Ubuntu 21.04 - 5.11.0-40-generic - GNOME Shell 3.38.4
Intel Xeon Platinum 8490H Linux Benchmarks 18 Systems - 110 Benchmark Results	2 x AMD EPYC 9654 96-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 22.10 - 6.0.0-060000rc3daily20220904-generic - GNOME Shell
AMD EPYC 9374F Genoa Zen 4 Linux Performance 24 Systems - 193 Benchmark Results	Intel Xeon Platinum 8362 - Intel M50CYP2SB2U - Intel Ice Lake IEH Ubuntu 22.10 - 6.0.0-060000rc3daily20220904-generic - GNOME Shell
AMD EPYC 9554/9654 Genoa Linux Review Benchmarks 20 Systems - 199 Benchmark Results	2 x AMD EPYC 9554 64-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 22.10 - 6.0.0-060000rc3daily20220904-generic - GNOME Shell
Linux LTS Performance Threadripper 3 Systems - 124 Benchmark Results	AMD Ryzen Threadripper 3990X 64-Core - Gigabyte TRX40 AORUS PRO WIFI - AMD Starship Ubuntu 22.10 - 6.1.0-rc8-phx - GNOME Shell 43.0
amd epyc extra 6 Systems - 53 Benchmark Results	AMD EPYC 7763 64-Core - AMD DAYTONA_X - AMD Starship Ubuntu 22.04 - 5.19.0-051900rc4daily20220628-generic - GNOME Shell 42.2
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks 5 Systems - 87 Benchmark Results	ARMv8 Neoverse-V1 - Amazon EC2 c7g.16xlarge - Amazon Device 0200 Ubuntu 22.04 - 5.19.0-1025-aws - GCC 11.3.0
Tau T2A 16 vCPUs 2 Systems - 136 Benchmark Results	ARMv8 Neoverse-N1 - Amazon EC2 m6g.8xlarge - Amazon Device 0200 Ubuntu 22.04 - 5.15.0-1009-aws - GCC 12.0.1 20220319
Tau T2A 16 vCPUs 3 Systems - 136 Benchmark Results	ARMv8 Neoverse-N1 - KVM Google Compute Engine - 128GB Ubuntu 22.04 - 5.15.0-1016-gcp - GCC 12.0.1 20220319

Graph500

Project Site

Source Repository

Test Created

Last Updated

Test Maintainer

Test Type

Average Install Time

Average Run Time

Test Dependencies

Accolades

Supported Platforms

Revision History

Suites Using This Test

Performance Metrics

Graph500 3.0

Scale: 26

Does It Scale Well With Increasing Cores?

Notable Instruction Set Usage

Tested CPU Architectures

Recent Test Results

Most Popular Test Results