Altra Friday

Ampere Altra ARMv8 Neoverse-N1 testing with a WIWYNN Mt.Jade (1.1.20201019 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101222-HA-ALTRAFRID62
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C++ Boost Tests 2 Tests
Finance 2 Tests
HPC - High Performance Computing 3 Tests
MPI Benchmarks 2 Tests
OpenMPI Tests 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Run 1
January 22 2021
  1 Hour, 12 Minutes
2
January 22 2021
  27 Minutes
3
January 22 2021
  24 Minutes
Invert Hiding All Results Option
  41 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Altra FridayProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionRun 123Ampere Altra ARMv8 Neoverse-N1 @ 3.30GHz (160 Cores)WIWYNN Mt.Jade (1.1.20201019 BIOS)Ampere Computing LLC Device e100502GB3841GB Micron_9300_MTFDHAL3T8TDP + 960GB SAMSUNG MZ1LB960HAJQ-00007ASPEEDVE228Mellanox MT28908 + Intel I210Ubuntu 20.045.4.0-62-generic (aarch64)GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq ondemandPython Details- Run 1: Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Run 123Result OverviewPhoronix Test Suite100%101%103%104%106%QuantLibQMCPACKNAS Parallel BenchmarksEtcpak

Altra Fridayquantlib: etcpak: DXT1etcpak: ETC1etcpak: ETC2etcpak: ETC1 + Ditheringnpb: EP.Cnpb: EP.Dnpb: LU.Cqmcpack: simple-H2Ofinancebench: Repo OpenMPfinancebench: Bonds OpenMPonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPURun 1232077.3856.36187.25758.97986.9586667.917224.4554670.8132.23358490.70138995706.414063615423113018872193.6860.75086.82858.99486.6206742.187258.9454836.8032.02864538.73437595160.6510422127.5852.54786.62758.89187.0656658.637254.8854897.4932.019OpenBenchmarking.org

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Run 1235001000150020002500SE +/- 28.12, N = 15SE +/- 5.05, N = 3SE +/- 20.63, N = 152077.32193.62127.51. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Run 123400800120016002000Min: 1931.2 / Avg: 2077.27 / Max: 2192.3Min: 2184.8 / Avg: 2193.6 / Max: 2202.3Min: 2009.5 / Avg: 2127.52 / Max: 2207.51. (CXX) g++ options: -O3 -march=native -rdynamic

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Run 1232004006008001000SE +/- 2.40, N = 3SE +/- 2.11, N = 3SE +/- 3.05, N = 3856.36860.75852.551. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Run 123150300450600750Min: 853.58 / Avg: 856.36 / Max: 861.14Min: 857.33 / Avg: 860.75 / Max: 864.58Min: 849.38 / Avg: 852.55 / Max: 858.651. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Run 12320406080100SE +/- 0.18, N = 3SE +/- 0.44, N = 3SE +/- 0.49, N = 387.2686.8386.631. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Run 12320406080100Min: 86.9 / Avg: 87.26 / Max: 87.44Min: 85.96 / Avg: 86.83 / Max: 87.3Min: 85.84 / Avg: 86.63 / Max: 87.511. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Run 1231326395265SE +/- 0.22, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 358.9858.9958.891. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Run 1231224364860Min: 58.56 / Avg: 58.98 / Max: 59.29Min: 58.98 / Avg: 58.99 / Max: 59.01Min: 58.49 / Avg: 58.89 / Max: 59.231. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringRun 12320406080100SE +/- 0.32, N = 3SE +/- 0.35, N = 3SE +/- 0.33, N = 386.9686.6287.071. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread
OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringRun 12320406080100Min: 86.63 / Avg: 86.96 / Max: 87.6Min: 86.15 / Avg: 86.62 / Max: 87.3Min: 86.41 / Avg: 87.07 / Max: 87.441. (CXX) g++ options: -O3 -mcpu=native -std=c++11 -lpthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CRun 12314002800420056007000SE +/- 14.11, N = 3SE +/- 36.31, N = 3SE +/- 26.87, N = 36667.916742.186658.631. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CRun 12312002400360048006000Min: 6640.67 / Avg: 6667.91 / Max: 6687.91Min: 6682.54 / Avg: 6742.18 / Max: 6807.88Min: 6622.05 / Avg: 6658.63 / Max: 6711.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DRun 12316003200480064008000SE +/- 42.46, N = 3SE +/- 19.94, N = 3SE +/- 23.85, N = 37224.457258.947254.881. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DRun 12313002600390052006500Min: 7146.34 / Avg: 7224.45 / Max: 7292.38Min: 7219.54 / Avg: 7258.94 / Max: 7283.92Min: 7223.14 / Avg: 7254.88 / Max: 7301.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CRun 12312K24K36K48K60KSE +/- 173.59, N = 3SE +/- 87.58, N = 3SE +/- 22.08, N = 354670.8154836.8054897.491. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CRun 12310K20K30K40K50KMin: 54462.61 / Avg: 54670.81 / Max: 55015.51Min: 54664.95 / Avg: 54836.8 / Max: 54952.08Min: 54869.07 / Avg: 54897.49 / Max: 54940.971. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2ORun 123714212835SE +/- 0.15, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 332.2332.0332.021. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -mcpu=native -O3 -fomit-frame-pointer -ffast-math -pthread
OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2ORun 123714212835Min: 31.99 / Avg: 32.23 / Max: 32.52Min: 32 / Avg: 32.03 / Max: 32.07Min: 31.97 / Avg: 32.02 / Max: 32.071. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -mcpu=native -O3 -fomit-frame-pointer -ffast-math -pthread

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPRun 1214K28K42K56K70KSE +/- 465.57, N = 9SE +/- 252.72, N = 358490.7064538.731. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPRun 1211K22K33K44K55KMin: 56738.96 / Avg: 58490.7 / Max: 60708.24Min: 64275.38 / Avg: 64538.73 / Max: 65044.021. (CXX) g++ options: -O3 -march=native -fopenmp

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPRun 1220K40K60K80K100KSE +/- 1284.50, N = 3SE +/- 293.30, N = 395706.4195160.651. (CXX) g++ options: -O3 -march=native -fopenmp
OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPRun 1217K34K51K68K85KMin: 94239.86 / Avg: 95706.41 / Max: 98266.37Min: 94592.05 / Avg: 95160.65 / Max: 95569.831. (CXX) g++ options: -O3 -march=native -fopenmp

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPURun 11428425670SE +/- 0.73, N = 3611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPURun 11224364860SE +/- 0.44, N = 3541. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPURun 1612182430SE +/- 0.17, N = 3231. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPURun 12004006008001000SE +/- 4.34, N = 311301. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPURun 1400800120016002000SE +/- 6.58, N = 318871. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt