loongson1

ARMv8 rev 4 testing with a Qualcomm MSM8998 MSM 8998 v2.1 MTP and mdssfb_90000 on Debian GNU/Linux 10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1905238-HV-1905236HV93.

loongson1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionLoongson-3B R2msm8998-sdm835Loongson-3B R2 @ 1.20GHz (8 Cores)loongson genericAMD RS780 + SB7x0/SB8x0/SB9x04096MB500GB Western Digital WD5000AAKS-0AMD OLAND 1GBConexant CX206312243WIntel 82574LFedora 284.14.77gs-tweak-1+ (mips64)MATE 1.20.3X Server 1.19.6modesetting 1.19.64.5 Mesa 18.2.2 (LLVM 6.0.1)Clang 6.0.1ext41920x1080ARMv8 rev 4 @ 2.04GHz (8 Cores)Qualcomm MSM8998 MSM 8998 v2.1 MTP6144MB58GB KLUCG4J1ED-B0C1 + 4GB KLUCG4J1ED-B0C1 + 2GB KLUCG4J1ED-B0C1mdssfb_90000Debian GNU/Linux 104.4.153-ElementalX-OP5-5.02 (armv8l)X Server 1.20.3 + SurfaceFlingerfbdev 0.5.0GCC 8.3.0 + Clang 7.0.1-8 + LLVM 7.0.11080x3840OpenBenchmarking.orgProcessor Details- Loongson-3B R2: Scaling Governor: loongson3 performance- msm8998-sdm835: Scaling Governor: msm performanceCompiler Details- msm8998-sdm835: --build=aarch64-linux-gnu --disable-libphobos --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v

loongson1polybench-c: Correlation Computationpolybench-c: 3 Matrix Multiplicationspolybench-c: Covariance Computationsample-program: mafft: Multiple Sequence Alignmentscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationstockfish: Total Timempcbench: Multi-Precision Benchmarkhpcg: ffte: N=256, 1D Complex FFT Routinecachebench: Readcachebench: Writecachebench: Read / Modify / Writetscp: AI Chess Performancec-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace TimeLoongson-3B R2msm8998-sdm83583.37179.3984.02250.6363.9421.7211.397.9413.5323.0852.6818097351930.061237.06130229857601452621600137413.2020.0413.6134.0310.10229.2972.5954.36219.70254.52545.28518094627830.754908961215440611296356283OpenBenchmarking.org

PolyBench-C

Test: Correlation Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Correlation ComputationLoongson-3B R2msm8998-sdm83520406080100SE +/- 0.02, N = 3SE +/- 0.20, N = 1283.3713.20-march=loongson3a-march=native1. (CC) gcc options: -O3

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: 3 Matrix MultiplicationsLoongson-3B R2msm8998-sdm8354080120160200SE +/- 0.71, N = 3SE +/- 0.33, N = 3179.3920.04-march=loongson3a-march=native1. (CC) gcc options: -O3

PolyBench-C

Test: Covariance Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 4.2Test: Covariance ComputationLoongson-3B R2msm8998-sdm83520406080100SE +/- 0.74, N = 3SE +/- 0.14, N = 384.0213.61-march=loongson3a-march=native1. (CC) gcc options: -O3

Sample Pi Program

OpenBenchmarking.orgSeconds, Fewer Is BetterSample Pi ProgramLoongson-3B R2msm8998-sdm83550100150200250SE +/- 0.29, N = 3SE +/- 0.01, N = 3250.6334.03

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence AlignmentLoongson-3B R2msm8998-sdm8351428425670SE +/- 0.20, N = 3SE +/- 0.32, N = 1563.9510.10-march=loongson3a1. (CC) gcc options: -std=c99 -O3 -lm -lpthread

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeLoongson-3B R2msm8998-sdm83550100150200250SE +/- 0.20, N = 3SE +/- 0.17, N = 321.72229.291. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloLoongson-3B R2msm8998-sdm8351632486480SE +/- 0.00, N = 3SE +/- 0.08, N = 311.3972.59-march=loongson3a1. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformLoongson-3B R2msm8998-sdm8351224364860SE +/- 0.04, N = 3SE +/- 0.52, N = 37.9454.36-march=loongson3a1. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyLoongson-3B R2msm8998-sdm83550100150200250SE +/- 0.20, N = 3SE +/- 0.19, N = 313.53219.701. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationLoongson-3B R2msm8998-sdm83560120180240300SE +/- 0.43, N = 3SE +/- 0.07, N = 323.08254.521. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationLoongson-3B R2msm8998-sdm835120240360480600SE +/- 0.94, N = 3SE +/- 0.27, N = 352.68545.281. (CC) gcc options: -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeLoongson-3B R2msm8998-sdm8351.1M2.2M3.3M4.4M5.5MSE +/- 62451.87, N = 9SE +/- 25154.71, N = 318394635180946-march=loongson3a-mtune=cortex-a731. (CXX) g++ options: -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -flto

GNU MPC

Multi-Precision Benchmark

OpenBenchmarking.orgGlobal Score, More Is BetterGNU MPC 1.1.0Multi-Precision BenchmarkLoongson-3B R2msm8998-sdm8356001200180024003000SE +/- 3.33, N = 31932783-lm-O2 -pedantic -fomit-frame-pointer1. (CC) gcc options:

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0Loongson-3B R2msm8998-sdm8350.16880.33760.50640.67520.844SE +/- 0.00, N = 3SE +/- 0.00, N = 30.060.75

FFTE

Test: N=256, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 6.0Test: N=256, 1D Complex FFT RoutineLoongson-3B R230060090012001500SE +/- 0.67, N = 31237.061. (F9X) gfortran options: -O3 -march=loongson3a -fomit-frame-pointer -fopenmp

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadLoongson-3B R2msm8998-sdm83511002200330044005500SE +/- 0.17, N = 3SE +/- 0.49, N = 3130249081. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteLoongson-3B R2msm8998-sdm8352K4K6K8K10KSE +/- 0.81, N = 3SE +/- 1.36, N = 3298596121. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteLoongson-3B R2msm8998-sdm8353K6K9K12K15KSE +/- 0.08, N = 3SE +/- 2.07, N = 3760154401. (CC) gcc options: -lrt

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLoongson-3B R2msm8998-sdm835130K260K390K520K650KSE +/- 19.53, N = 5SE +/- 214.56, N = 5145262611296-march=loongson3a-march=native1. (CC) gcc options: -O3

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelLoongson-3B R2msm8998-sdm83530060090012001500SE +/- 1.22, N = 3SE +/- 0.90, N = 31600356-march=loongson3a1. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeLoongson-3B R2msm8998-sdm83530060090012001500SE +/- 44.90, N = 9SE +/- 0.58, N = 31374283-march=loongson3a -R/usr/lib -lSDL -lpthread -lXpm1. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system


Phoronix Test Suite v10.8.4