Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark himeno.

Test Created

6 December 2010

Last Updated

24 October 2019

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Second

Average Run Time

3 Minutes, 5 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

250k+ Downloads + 10k+ Public Benchmark Results

Supported Platforms


Public Result Uploads *Reported Test Completions **Reported Installs **Test Profile Page Views ***OpenBenchmarking.orgEventsHimeno Benchmark Popularity Statisticspts/himeno2010.122011.052011.102012.032012.082013.012013.062013.112014.042014.092015.022015.072015.122016.052016.102017.032017.082018.012018.062018.112019.042019.092020.022020.072020.122021.052021.102022.032022.082023.012023.062023.1120K40K60K80K100K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 29 November 2023.

Revision History

pts/himeno-1.3.0   [View Source]   Thu, 24 Oct 2019 14:29:04 GMT
Update himeno based on https://www.phoronix.com/forums/forum/phoronix/phoronix-test-suite/1134692-himeno-benchmark / https://blogs.fau.de/hager/archives/7850

pts/himeno-1.2.0   [View Source]   Wed, 20 Jan 2016 19:22:28 GMT
Use AVX2 by default if available.

pts/himeno-1.1.0   [View Source]   Wed, 07 Dec 2011 09:49:26 GMT
Use -O3 cc flag by default and add CFLAGS var.

pts/himeno-1.0.0   [View Source]   Mon, 06 Dec 2010 14:52:26 GMT
Initial import into OpenBenchmarking.org

Suites Using This Test

C/C++ Compiler Tests

Common Workstation Benchmarks

HPC - High Performance Computing

CPU Massive

Server CPU Tests

Scientific Computing

Bioinformatics


Performance Metrics

Analyze Test Configuration:

Himeno Benchmark 3.0

Poisson Pressure Solver

OpenBenchmarking.org metrics for this test profile configuration based on 2,906 public results since 24 October 2019 with the latest data as of 3 December 2023.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
MFLOPS (Average)
100th
9
10254 +/- 179
100th
3
10242 +/- 456
100th
7
9144 +/- 1063
100th
3
8447 +/- 21
99th
6
7783 +/- 355
99th
7
7307 +/- 177
98th
24
6270 +/- 585
97th
6
5491 +/- 125
95th
52
5295 +/- 305
93rd
61
5199 +/- 171
91st
20
5132 +/- 411
90th
9
5099 +/- 232
88th
41
5021 +/- 176
88th
48
5009 +/- 158
87th
10
4993 +/- 204
87th
26
4989 +/- 147
87th
10
4963 +/- 210
86th
34
4944 +/- 514
86th
9
4916 +/- 121
86th
3
4913 +/- 214
85th
27
4862 +/- 155
82nd
5
4747 +/- 190
82nd
5
4720 +/- 230
79th
3
4535 +/- 461
77th
4
4446 +/- 149
77th
13
4421 +/- 281
77th
10
4404 +/- 194
76th
6
4368 +/- 70
Mid-Tier
75th
< 4367
75th
13
4361 +/- 89
75th
7
4336 +/- 298
74th
45
4299 +/- 559
74th
6
4294 +/- 178
73rd
12
4271 +/- 141
73rd
16
4237 +/- 517
72nd
7
4221 +/- 85
71st
13
4189 +/- 181
71st
4
4184 +/- 68
71st
6
4177 +/- 179
71st
15
4173 +/- 60
71st
4
4166 +/- 68
70th
14
4155 +/- 289
70th
12
4147 +/- 89
70th
4
4141 +/- 163
69th
53
4118 +/- 512
68th
4
4070 +/- 210
68th
4
4048 +/- 148
68th
14
4039 +/- 61
66th
3
3968 +/- 46
65th
3
3941 +/- 240
64th
14
3933 +/- 100
64th
6
3918 +/- 44
63rd
9
3901 +/- 66
61st
9
3879 +/- 73
60th
27
3858 +/- 78
60th
16
3847 +/- 130
59th
9
3828 +/- 68
58th
21
3813 +/- 93
57th
9
3799 +/- 170
57th
6
3780 +/- 237
57th
25
3774 +/- 124
55th
10
3742 +/- 247
55th
10
3740 +/- 99
55th
3
3726 +/- 73
55th
10
3725 +/- 21
55th
12
3707 +/- 165
55th
8
3707 +/- 177
54th
3
3697 +/- 62
53rd
6
3677 +/- 155
53rd
4
3677 +/- 19
51st
9
3631 +/- 367
51st
4
3630 +/- 36
51st
6
3628 +/- 56
51st
10
3627 +/- 130
51st
3
3624 +/- 57
51st
4
3623 +/- 484
51st
18
3616 +/- 286
Median
50th
3611
50th
5
3608 +/- 54
50th
7
3588 +/- 255
49th
10
3563 +/- 246
49th
30
3546 +/- 459
49th
7
3544 +/- 306
49th
5
3533 +/- 51
48th
8
3519 +/- 206
46th
10
3490 +/- 22
46th
4
3488 +/- 14
46th
6
3465 +/- 305
46th
24
3453 +/- 41
46th
4
3452 +/- 109
45th
3
3448 +/- 110
45th
18
3429 +/- 304
43rd
6
3403 +/- 41
42nd
3
3385 +/- 24
42nd
12
3377 +/- 50
42nd
13
3376 +/- 50
42nd
3
3375 +/- 8
41st
32
3357 +/- 304
41st
6
3343 +/- 15
41st
3
3341 +/- 298
41st
4
3333 +/- 93
40th
5
3328 +/- 84
40th
3
3319 +/- 61
39th
5
3285 +/- 128
38th
5
3233 +/- 350
37th
11
3199 +/- 165
37th
3
3194 +/- 162
36th
4
3186 +/- 101
36th
13
3185 +/- 5
36th
3
3179 +/- 96
36th
4
3179 +/- 28
36th
13
3167 +/- 69
35th
6
3148 +/- 351
35th
14
3144 +/- 94
34th
10
3081 +/- 63
33rd
3
3067 +/- 105
33rd
3
3047 +/- 84
32nd
4
3007 +/- 9
32nd
5
2991 +/- 51
30th
3
2881 +/- 45
30th
15
2866 +/- 42
28th
3
2800 +/- 6
28th
16
2798 +/- 45
27th
4
2749 +/- 30
26th
4
2682 +/- 99
Low-Tier
25th
< 2668
25th
6
2642 +/- 11
25th
7
2627 +/- 104
25th
5
2627 +/- 198
24th
3
2612 +/- 8
24th
6
2578 +/- 285
24th
3
2572 +/- 147
23rd
5
2501 +/- 342
23rd
4
2454 +/- 87
22nd
6
2421 +/- 138
22nd
7
2384 +/- 92
21st
4
2255 +/- 170
21st
3
2246 +/- 1
19th
4
2082 +/- 110
19th
11
2067 +/- 102
17th
3
1868 +/- 77
17th
5
1825 +/- 89
17th
4
1802 +/- 4
17th
5
1792 +/- 84
16th
7
1702 +/- 53
16th
3
1695 +/- 11
16th
3
1674 +/- 38
15th
3
1640 +/- 22
15th
9
1633 +/- 23
13th
3
1449 +/- 157
13th
3
1420 +/- 85
12th
3
1376 +/- 52
11th
3
1178 +/- 105
11th
3
1163 +/- 2
10th
5
1011 +/- 48
7th
12
651 +/- 29
6th
6
500 +/- 37
5th
11
367 +/- 14
4th
3
305 +/- 1
OpenBenchmarking.orgDistribution Of Public Results - Poisson Pressure Solver2904 Results Range From 11 To 10709 MFLOPS112254396538671081129515091723193721512365257927933007322134353649386340774291450547194933514753615575578960036217643166456859707372877501771579298143835785718785899992139427964198551006910283104971071150100150200250

Based on OpenBenchmarking.org data, the selected test / test configuration (Himeno Benchmark 3.0 - Poisson Pressure Solver) has an average run-time of 5 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkPoisson Pressure SolverRun-Time510152025Min: 1 / Avg: 4.78 / Max: 18

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1.1%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsPoisson Pressure SolverDeviation3691215Min: 0 / Avg: 1.15 / Max: 8

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseHimeno Benchmark CPU Core ScalingPoisson Pressure Solver24681012161820243248640.73961.47922.21882.95843.698

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set
Support
Instructions Detected
Used by default on supported hardware.
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

 
VBROADCASTSS VZEROUPPER VEXTRACTF128 VINSERTF128
FMA (FMA)
Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Bulldozer (2011).

 
VFMADD231SS VFMADD132SS VFMSUB132SS
Advanced Vector Extensions 512 (AVX512)
Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).
 
(ZMM REGISTER USE)
The test / benchmark does honor compiler flag changes.
Last automated analysis: 17 January 2022

This test profile binary relies on the shared libraries libc.so.6.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
(Many Processors)
IBM Z
s390x
(Many Processors)
RISC-V 64-bit
riscv64
SiFive RISC-V, rv64imafdcvsu
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 4-Core, POWER9 44-Core
MIPS 64-bit
mips64
Loongson-3A R4
Loongson LoongArch 64-bit
loongarch64
Loongson-3A5000, Loongson-3A5000-HV-7A2000-1w-EVB-V1.0, Loongson-3A5000LL, Loongson-3A6000, Loongson-3A6000-HV
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 4-Core, ARMv7 Cortex-A15 8-Core, ARMv7 Cortex-A7 4-Core, ARMv7 Cortex-A72 4-Core, ARMv7 rev 1 4-Core, ARMv7 rev 2, ARMv7 rev 3 4-Core, ARMv7 rev 4 4-Core, ARMv7 rev 5 4-Core, Pi 4 2GB 32-bit
ARMv6 32-bit
armv6l
ARMv6-compatible rev 7, ARMv7
DEC Alpha
alpha
Alpha
ARMv8 64-bit
aarch64
AArch64 rev 1, AArch64 rev 4, ARMv8, ARMv8 Cortex-A53 4-Core, ARMv8 Cortex-A55 4-Core, ARMv8 Cortex-A57 6-Core, ARMv8 Cortex-A57 8-Core, ARMv8 Cortex-A72, ARMv8 Cortex-A72 4-Core, ARMv8 Cortex-A72 6-Core, ARMv8 Cortex-A73 2-Core, ARMv8 Cortex-A73 6-Core, ARMv8 Cortex-A73 8-Core, ARMv8 Cortex-A76 4-Core, ARMv8 Cortex-A78E 6-Core, ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 4-Core, ARMv8 Neoverse-N1 64-Core, ARMv8 Neoverse-V1, ARMv8 Neoverse-V1 4-Core, ARMv8 rev 0 6-Core, Ampere ARMv8 Neoverse-N1 160-Core, Ampere Altra ARMv8 Neoverse-N1 160-Core, Apple, Apple M1, Apple M2, FT2000AHK, HUAWEI Kunpeng 920, Pi 4 2GB 64-bit, Rockchip ARMv8 Cortex-A53 4-Core, Rockchip ARMv8 Cortex-A55 4-Core, Rockchip ARMv8 Cortex-A72 6-Core, Rockchip ARMv8 Cortex-A76 4-Core, Rockchip ARMv8 Cortex-A76 6-Core, phytium FT1500a

Recent Test Results

OpenBenchmarking.org Results Compare

4 Systems - 55 Benchmark Results

Intel Core i7-8665U - Dell 0PD9KD - Intel Cannon Point-LP

Debian - 6.4.0-4-amd64 - KDE Plasma 5.27.7

6 Systems - 97 Benchmark Results

AMD Ryzen 7 2700 Eight-Core - ASRock X370 Taichi - AMD 17h

Debian - 6.4.0-4-amd64 - KDE Plasma 5.27.7

6 Systems - 86 Benchmark Results

AMD Ryzen 7 2700 Eight-Core - ASRock X370 Taichi - AMD 17h

Debian - 6.4.0-4-amd64 - KDE Plasma 5.27.7

4 Systems - 44 Benchmark Results

Intel Core i7-8665U - Dell 0PD9KD - Intel Cannon Point-LP

Debian - 6.4.0-4-amd64 - KDE Plasma 5.27.7

1 System - 566 Benchmark Results

ARMv8 Cortex-A72 - BCM2835 Raspberry Pi 4 Model B Rev 1.5 - Broadcom BCM2711

Arch Linux ARM - 6.1.58-2-rpi-ARCH - GCC 12.1.0 + Clang 16.0.6

1 System - 6 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

8 Systems - 22 Benchmark Results

AMD Ryzen 9 3950X 16-Core - ASRockRack X570D4U - AMD Starship

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

8 Systems - 22 Benchmark Results

Intel Core i9-10900K - ASUS PRO Q470M-C - Intel Comet Lake PCH

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

1 System - 4 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

1 System - 7 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

1 System - 6 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

1 System - 6 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

2 Systems - 9 Benchmark Results

Intel Xeon E3-1245 v5 - Colorful And Development C.B250M-D - Intel Xeon E3-1200 v5

Ubuntu 22.04 - 6.2.0-36-generic - GNOME Shell 42.5

1 System - 9 Benchmark Results

Loongson-3A6000-HV - O.E.M - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

13 Systems - 102 Benchmark Results

AMD Ryzen 7 4700U - HP 876E v12.40 - 2 x 8 GB DDR4-3200MT

Debian GNU - 5.10.2 - Sway

Most Popular Test Results

OpenBenchmarking.org Results Compare

3 Systems - 268 Benchmark Results

Intel Core i5-2520M - HP 161C - Intel 2nd Generation Core DRAM

Ubuntu 18.04 - 4.18.0-20-generic - GNOME Shell 3.28.3

16 Systems - 119 Benchmark Results

2 x Intel Xeon Platinum 8259L - ASRockRack EP2C621D16-4LP - Intel Sky Lake-E DMI3 Registers

Ubuntu 19.10 - 5.3.0-64-generic - GNOME Shell 3.34.1

2 Systems - 535 Benchmark Results

AMD Ryzen 5 4500U - LENOVO LNVNB161216 - AMD Renoir Root Complex

Ubuntu 20.04 - 5.9.0-050900rc7daily20201002-generic - GNOME Shell 3.36.4

4 Systems - 15 Benchmark Results

Intel Core i3-7020U - HP 84CA - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.8.0-44-generic - GNOME

12 Systems - 593 Benchmark Results

Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH

Ubuntu 20.04 - 5.8.0-050800daily20200622-generic - GNOME Shell 3.36.2

7 Systems - 62 Benchmark Results

Intel Core i9-7960X - MSI X299 SLI PLUS - 4 x 4096 MB 3000MHz

Microsoft Windows 10 Pro Build 18362 - 10.0 - 26.20.12028.2

6 Systems - 62 Benchmark Results

AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4

4 Systems - 131 Benchmark Results

Intel Core i7-10700T - Insyde CometLake TBD by OEM - Intel

FreeBSD - 12.2-RELEASE - Clang 10.0.1

3 Systems - 173 Benchmark Results

AMD Ryzen Threadripper 3970X 32-Core - 52GB - 2 x 275GB Virtual Disk

Ubuntu 20.04 - 4.19.104-microsoft-standard - X Server

8 Systems - 439 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

3 Systems - 30 Benchmark Results

Intel Core i9-9900KS - ASUS PRIME Z390-A - Intel Cannon Lake PCH

Ubuntu 19.10 - 5.3.0-24-generic - GNOME Shell 3.34.1

3 Systems - 143 Benchmark Results

AMD EPYC 7742 64-Core - AMD DAYTONA_X - AMD Starship

Ubuntu 20.04 - 5.4.0-31-generic - GNOME Shell 3.36.1

Find More Test Results