cpu-benchmarks

ARMv8 Neoverse-N1 testing on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2105082-IB-2105076IB40&sro.

cpu-benchmarksProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionSystem Layerecs.c7.2xlargeC5.2xlargeC6g.2xlargeIntel Xeon Platinum 8369B (4 Cores / 8 Threads)Alibaba Cloud ECS (9a72d74 BIOS)Intel 440FX 82441FX PMC1 x 16384 MB RAM99GBCirrus Logic GD 5446Red Hat Virtio deviceUbuntu 20.045.4.0-73-generic (x86_64)GCC 9.3.0ext41024x768KVMIntel Xeon Platinum 8124M (4 Cores / 8 Threads)Amazon EC2 c5.2xlarge (1.0 BIOS)16GB107GB Amazon Elastic Block StoreAmazon Elastic5.4.0-1047-aws (x86_64)ARMv8 Neoverse-N1 (8 Cores)Amazon EC2 c6g.2xlarge (1.0 BIOS)Amazon Device 02005.4.0-1047-aws (aarch64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- ecs.c7.2xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - C5.2xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - C6g.2xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- ecs.c7.2xlarge: CPU Microcode: 0x1- C5.2xlarge: CPU Microcode: 0x2006906Security Details- ecs.c7.2xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - C5.2xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown - C6g.2xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

cpu-benchmarksrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solvernamd: ATPase Simulation - 327,506 Atomskvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video Encodingx265: Bosphorus 4Kx265: Bosphorus 1080pcompress-7zip: Compress Speed Teststockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthradiance: Serialradiance: SMP Parallelopenssl: RSA 4096-bit Performancectx-clock: Context Switch Timesysbench: CPUecs.c7.2xlargeC5.2xlargeC6g.2xlarge64.10850.0123.546625.9410.7423.7842.1344.131.295.99267321133349611830984655.267203.1071076.456212506.230574.31846.1934.240845.6010.1822.4140.3035.915.9528.6021771963514310133287869.958260.6351033.6103553.86731.8611.723.587.0814.2833.043.4614.352627212781025328.5OpenBenchmarking.org

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDC5.2xlargeC6g.2xlargeecs.c7.2xlarge1632486480SE +/- 0.22, N = 3SE +/- 0.00, N = 3SE +/- 0.21, N = 374.3253.8764.111. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverC5.2xlargeC6g.2xlargeecs.c7.2xlarge1122334455SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 346.1931.8650.011. (CXX) g++ options: -O2 -lOpenCL

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsC5.2xlargeecs.c7.2xlarge0.95421.90842.86263.81684.771SE +/- 0.00473, N = 3SE +/- 0.00407, N = 34.240843.54662

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastC5.2xlargeC6g.2xlargeecs.c7.2xlarge1.33652.6734.00955.3466.6825SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.601.725.94-std=gnu991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastC5.2xlargeC6g.2xlargeecs.c7.2xlarge3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.183.5810.74-std=gnu991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastC5.2xlargeC6g.2xlargeecs.c7.2xlarge612182430SE +/- 0.15, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 322.417.0823.78-std=gnu991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastC5.2xlargeC6g.2xlargeecs.c7.2xlarge1020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 340.3014.2842.13-std=gnu991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingC5.2xlargeC6g.2xlargeecs.c7.2xlarge1020304050SE +/- 0.27, N = 11SE +/- 0.30, N = 7SE +/- 0.47, N = 535.9133.0444.13-m64 -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize-m64 -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize1. (CC) gcc options: -ldl -lm -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KC5.2xlargeC6g.2xlargeecs.c7.2xlarge1.33882.67764.01645.35526.694SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.953.461.29-lnuma-lnuma1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pC5.2xlargeC6g.2xlargeecs.c7.2xlarge714212835SE +/- 0.30, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 328.6014.355.99-lnuma-lnuma1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestC5.2xlargeC6g.2xlargeecs.c7.2xlarge6K12K18K24K30KSE +/- 277.58, N = 3SE +/- 56.52, N = 3SE +/- 302.84, N = 32177126272267321. (CXX) g++ options: -pipe -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeC5.2xlargeecs.c7.2xlarge2M4M6M8M10MSE +/- 115435.75, N = 4SE +/- 57310.27, N = 39635143113334961. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthC5.2xlargeC6g.2xlargeecs.c7.2xlarge3M6M9M12M15MSE +/- 108494.97, N = 3SE +/- 87036.75, N = 3SE +/- 51543.52, N = 3101332871278102511830984

Radiance Benchmark

Test: Serial

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SerialC5.2xlargeecs.c7.2xlarge2004006008001000869.96655.27

Radiance Benchmark

Test: SMP Parallel

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP ParallelC5.2xlargeecs.c7.2xlarge60120180240300260.64203.11

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceC5.2xlargeC6g.2xlargeecs.c7.2xlarge2004006008001000SE +/- 0.83, N = 3SE +/- 0.03, N = 3SE +/- 0.73, N = 31033.6328.51076.4-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch TimeC5.2xlargeecs.c7.2xlarge2004006008001000SE +/- 1.33, N = 31035562

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPUecs.c7.2xlarge3K6K9K12K15KSE +/- 0.14, N = 312506.231. (CC) gcc options: -std=gnu99 -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm


Phoronix Test Suite v10.8.4