2022-08-10-1558

KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2208128-DUST-220811054&sro.

2022-08-10-1558ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelVulkanCompilerFile-SystemScreen ResolutionSystem LayerBlockVolume - Intel Xeon Platinum 8358amdARMv8 Neoverse-N1 - BlockVolumeIntel Xeon Platinum 8358 (2 Cores / 4 Threads)QEMU Standard PC (i440FX + PIIX 1996) (1.5.1 BIOS)Intel 440FX 82441FX PMC32GB50GB BlockVolumebochs-drmdrmfbQEMU MonitorRed Hat Virtio deviceUbuntu 22.045.15.0-1016-oracle (x86_64)1.2.204GCC 11.2.0ext41024x768KVMAMD EPYC 7J13 64-Core (2 Cores / 4 Threads)ARMv8 Neoverse-N1 (2 Cores)QEMU KVM Virtual Machine (1.5.1 BIOS)Red Hat QEMU PCIe12GB21475GB BlockVolumevirtio_gpudrmfb5.15.0-1016-oracle (aarch64)OpenBenchmarking.orgKernel Details- libiscsi.debug_libiscsi_eh=1 - Transparent Huge Pages: madviseCompiler Details- BlockVolume - Intel Xeon Platinum 8358: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - amd: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - ARMv8 Neoverse-N1 - BlockVolume: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Disk Details- MQ-DEADLINE / discard,errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- BlockVolume - Intel Xeon Platinum 8358: CPU Microcode: 0x1- amd: CPU Microcode: 0x1000065Security Details- BlockVolume - Intel Xeon Platinum 8358: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - amd: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - ARMv8 Neoverse-N1 - BlockVolume: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

2022-08-10-1558dbench: 1dbench: 6dbench: 12dbench: 48dbench: 128dbench: 256ramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Copy - Floating Pointramspeed: Scale - Floating Pointramspeed: Triad - Floating Pointramspeed: Average - Floating Pointstream: Copystream: Scalestream: Triadstream: Addhpcg: npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cffte: N=256, 3D Complex FFT Routineblake2: cachebench: Readcachebench: Writecachebench: Read / Modify / Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationx264: Bosphorus 4Kx264: Bosphorus 1080phimeno: Poisson Pressure Solvercompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingstockfish: Total Timeebizzy: c-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplescompress-gzip: Linux Source Tree Archiving To .tar.gzencode-flac: WAV To FLACencode-mp3: WAV To MP3hint: FLOATBlockVolume - Intel Xeon Platinum 8358amdARMv8 Neoverse-N1 - BlockVolume321.756391.670423.661491.926461.125403.59028687.3025662.8922264.0028856.0726393.4226503.8525719.2623124.9428699.7426017.0550419.631210.032428.632428.92.984603999.802970.64272.64274.465822.6810883.719700.493059.343151.5411987.4851713894.822780.13086122829.59968636071.252619508.61111.88258.35572.11687.23913.505.9025.063490.9691711685210649476302495054398.26953.40144.89019.7249.744391167612.65606294.806377.967412.398419.632414.234366.53936536.1635819.3135913.8635537.6336120.7136593.4336058.6836530.2336290.7036290.0744701.732760.634717.134739.54.179564811.203003.26173.33173.327492.0911476.7611909.683889.223703.1913802.6574601304.712457.76753022469.56429344805.483821589.67125.64341.00554.421010.54916.746.5227.803498.8969531765913701534654244594306.03950.11547.06918.4918.658347937957.73165183.665399.485518.781669.669666.546644.58541424.5344591.9326630.1832154.0836271.0641324.0844605.2843775.4824151.7638511.5445914.844915.742424.942194.93.25129487.75453.1681.4880.95897.09787.671049.20170.95169.238706.3411424.13252318929.47355130160.546376462.78116.42134.48473.45435.211154.353.0813.193221.2794171095089402595956143273417.14370.74346.58755.5979.279325579648.30842OpenBenchmarking.org

Dbench

Client Count: 1

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 1ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd70140210280350SE +/- 0.78, N = 3SE +/- 1.13, N = 3SE +/- 2.00, N = 3183.67321.76294.811. (CC) gcc options: -lpopt -O2

Dbench

Client Count: 6

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 6ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd90180270360450SE +/- 0.30, N = 3SE +/- 1.47, N = 3SE +/- 1.01, N = 3399.49391.67377.971. (CC) gcc options: -lpopt -O2

Dbench

Client Count: 12

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 12ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd110220330440550SE +/- 1.61, N = 3SE +/- 0.36, N = 3SE +/- 0.99, N = 3518.78423.66412.401. (CC) gcc options: -lpopt -O2

Dbench

Client Count: 48

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 48ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd140280420560700SE +/- 2.46, N = 3SE +/- 1.03, N = 3SE +/- 2.86, N = 3669.67491.93419.631. (CC) gcc options: -lpopt -O2

Dbench

Client Count: 128

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 128ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd140280420560700SE +/- 2.71, N = 3SE +/- 1.05, N = 3SE +/- 0.81, N = 3666.55461.13414.231. (CC) gcc options: -lpopt -O2

Dbench

Client Count: 256

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 256ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd140280420560700SE +/- 0.70, N = 3SE +/- 1.12, N = 3SE +/- 0.26, N = 3644.59403.59366.541. (CC) gcc options: -lpopt -O2

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: IntegerARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd9K18K27K36K45KSE +/- 22.49, N = 3SE +/- 8.50, N = 3SE +/- 126.66, N = 341424.5328687.3036536.161. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: IntegerARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd10K20K30K40K50KSE +/- 38.17, N = 3SE +/- 25.39, N = 3SE +/- 161.57, N = 344591.9325662.8935819.311. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: IntegerARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8K16K24K32K40KSE +/- 13.98, N = 3SE +/- 7.84, N = 3SE +/- 277.48, N = 326630.1822264.0035913.861. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: IntegerARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8K16K24K32K40KSE +/- 264.03, N = 3SE +/- 17.25, N = 3SE +/- 448.55, N = 332154.0828856.0735537.631. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: IntegerARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8K16K24K32K40KSE +/- 7.54, N = 3SE +/- 2.05, N = 3SE +/- 5.26, N = 336271.0626393.4236120.711. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating PointARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd9K18K27K36K45KSE +/- 152.40, N = 3SE +/- 15.85, N = 3SE +/- 101.96, N = 341324.0826503.8536593.431. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating PointARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd10K20K30K40K50KSE +/- 39.65, N = 3SE +/- 6.32, N = 3SE +/- 82.07, N = 344605.2825719.2636058.681. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating PointARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd9K18K27K36K45KSE +/- 4.27, N = 3SE +/- 12.31, N = 3SE +/- 97.39, N = 343775.4823124.9436530.231. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating PointARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8K16K24K32K40KSE +/- 187.57, N = 3SE +/- 5.10, N = 3SE +/- 262.44, N = 324151.7628699.7436290.701. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating PointARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8K16K24K32K40KSE +/- 13.16, N = 3SE +/- 4.34, N = 3SE +/- 56.45, N = 338511.5426017.0536290.071. (CC) gcc options: -O3 -march=native

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: CopyARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd11K22K33K44K55KSE +/- 19.39, N = 5SE +/- 317.51, N = 5SE +/- 22.24, N = 545914.850419.644701.71. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: ScaleARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd10K20K30K40K50KSE +/- 22.92, N = 5SE +/- 10.50, N = 5SE +/- 15.93, N = 544915.731210.032760.61. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: TriadARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd9K18K27K36K45KSE +/- 13.79, N = 5SE +/- 5.39, N = 5SE +/- 12.78, N = 542424.932428.634717.11. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: AddARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd9K18K27K36K45KSE +/- 23.55, N = 5SE +/- 7.65, N = 5SE +/- 15.38, N = 542194.932428.934739.51. (CC) gcc options: -O3 -march=native -fopenmp

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd0.94041.88082.82123.76164.702SE +/- 0.00484, N = 3SE +/- 0.02282, N = 3SE +/- 0.00667, N = 33.251292.984604.179561. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd10002000300040005000SE +/- 0.13, N = 3SE +/- 3.81, N = 3SE +/- 7.18, N = 3487.753999.804811.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd6001200180024003000SE +/- 0.73, N = 3SE +/- 10.92, N = 3SE +/- 5.89, N = 3453.162970.643003.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd60120180240300SE +/- 0.24, N = 3SE +/- 2.40, N = 15SE +/- 0.08, N = 381.48272.64173.331. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd60120180240300SE +/- 0.11, N = 3SE +/- 2.52, N = 9SE +/- 0.06, N = 380.95274.46173.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd16003200480064008000SE +/- 6.62, N = 3SE +/- 4.97, N = 3SE +/- 1.43, N = 3897.095822.687492.091. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd2K4K6K8K10KSE +/- 6.67, N = 3SE +/- 3.67, N = 3SE +/- 15.03, N = 3787.6710883.7111476.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd3K6K9K12K15KSE +/- 0.53, N = 3SE +/- 12.95, N = 3SE +/- 22.11, N = 31049.209700.4911909.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8001600240032004000SE +/- 0.28, N = 3SE +/- 2.19, N = 3SE +/- 6.03, N = 3170.953059.343889.221. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8001600240032004000SE +/- 0.14, N = 3SE +/- 1.15, N = 3SE +/- 1.85, N = 3169.233151.543703.191. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

FFTE

N=256, 3D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd3K6K9K12K15KSE +/- 9.64, N = 3SE +/- 20.08, N = 3SE +/- 53.94, N = 38706.3411987.4913802.661. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

BLAKE2

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20170307BlockVolume - Intel Xeon Platinum 8358amd1.08452.1693.25354.3385.4225SE +/- 0.00, N = 3SE +/- 0.00, N = 34.824.711. (CC) gcc options: -O3 -march=native -lcrypto -lz

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd2K4K6K8K10KSE +/- 0.25, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 311424.132780.132457.77MIN: 11421.78 / MAX: 11425.65MIN: 2778.63 / MAX: 2780.85MIN: 2456.73 / MAX: 2458.61. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd5K10K15K20K25KSE +/- 7.37, N = 3SE +/- 1.36, N = 3SE +/- 16.31, N = 318929.4722829.6022469.56MIN: 17722.07 / MAX: 21000.42MIN: 19871.28 / MAX: 24617.69MIN: 19977.87 / MAX: 23209.541. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd10K20K30K40K50KSE +/- 10.94, N = 3SE +/- 10.75, N = 3SE +/- 32.09, N = 330160.5536071.2544805.48MIN: 29601.63 / MAX: 30381.52MIN: 33212.97 / MAX: 45676.07MIN: 39133.3 / MAX: 46543.391. (CC) gcc options: -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd130260390520650SE +/- 2.03, N = 3SE +/- 0.13, N = 3SE +/- 0.77, N = 3462.78508.61589.671. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd306090120150SE +/- 0.43, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3116.42111.88125.641. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd70140210280350SE +/- 0.30, N = 3SE +/- 0.42, N = 3SE +/- 6.34, N = 3134.48258.35341.001. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd120240360480600SE +/- 0.46, N = 3SE +/- 0.18, N = 3SE +/- 2.48, N = 3473.45572.11554.421. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd2004006008001000SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.79, N = 3435.21687.231010.541. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd2004006008001000SE +/- 9.93, N = 3SE +/- 0.29, N = 3SE +/- 0.21, N = 31154.35913.50916.741. (CC) gcc options: -lm

x264

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 4KARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd246810SE +/- 0.03, N = 9SE +/- 0.00, N = 3SE +/- 0.01, N = 33.085.906.52-m64-m641. (CC) gcc options: -ldl -lm -lpthread -O3 -flto

x264

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 1080pARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd714212835SE +/- 0.14, N = 5SE +/- 0.04, N = 3SE +/- 0.07, N = 313.1925.0627.80-m64-m641. (CC) gcc options: -ldl -lm -lpthread -O3 -flto

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd8001600240032004000SE +/- 13.75, N = 3SE +/- 12.53, N = 3SE +/- 48.22, N = 153221.283490.973498.90-mavx2-mavx21. (CC) gcc options: -O3

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.00Test: Compression RatingARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd4K8K12K16K20KSE +/- 18.32, N = 3SE +/- 107.60, N = 3SE +/- 49.67, N = 31095016852176591. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.00Test: Decompression RatingARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd3K6K9K12K15KSE +/- 5.21, N = 3SE +/- 27.39, N = 3SE +/- 13.05, N = 3894010649137011. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd1.1M2.2M3.3M4.4M5.5MSE +/- 36660.72, N = 3SE +/- 58003.29, N = 4SE +/- 63510.80, N = 3259595647630245346542-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd30K60K90K120K150KSE +/- 2457.15, N = 15SE +/- 1857.21, N = 15SE +/- 18.05, N = 314327395054445941. (CC) gcc options: -pthread -lpthread -O3 -march=native

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd90180270360450SE +/- 2.38, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3417.14398.27306.041. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd1632486480SE +/- 0.60, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 370.7453.4050.121. (CXX) g++ options: -fopenmp -O3

Gzip Compression

Linux Source Tree Archiving To .tar.gz

OpenBenchmarking.orgSeconds, Fewer Is BetterGzip CompressionLinux Source Tree Archiving To .tar.gzARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd1122334455SE +/- 1.09, N = 15SE +/- 0.39, N = 8SE +/- 0.32, N = 1446.5944.8947.07

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLACARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd1224364860SE +/- 0.03, N = 5SE +/- 0.02, N = 5SE +/- 0.02, N = 555.6019.7218.49-logg1. (CXX) g++ options: -fvisibility=hidden -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3ARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd3691215SE +/- 0.020, N = 3SE +/- 0.012, N = 3SE +/- 0.012, N = 39.2799.7448.6581. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATARMv8 Neoverse-N1 - BlockVolumeBlockVolume - Intel Xeon Platinum 8358amd80M160M240M320M400MSE +/- 208072.89, N = 3SE +/- 3855213.99, N = 6SE +/- 334657.84, N = 3325579648.31391167612.66347937957.731. (CC) gcc options: -O3 -march=native -lm


Phoronix Test Suite v10.8.4