multicore-result

ARMv8 rev 0 testing with a e3360_1099 and NVIDIA TEGRA on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2109208-IB-MULTICORE22&sor.

multicore-result ProcessorMotherboardMemoryDiskGraphicsOSKernelDisplay ServerVulkanCompilerFile-SystemScreen Resolutionmu_co 7zipmu_coARMv8 rev 0 @ 1.91GHz (8 Cores)e3360_109932GB125GB DUTA42NVIDIA TEGRAUbuntu 20.045.10.41-tegra (aarch64)X Server1.2.186GCC 9.3.0ext44096x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: tegra194 performanceSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening + srbds: Not affected + tsx_async_abort: Not affected

multicore-result compress-7zip: Compress Speed Testmt-dgemm: Sustained Floating-Point Rateaobench: 2048 x 2048 - Total Timeaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingbuild2: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelcoremark: CoreMark Size 666 - Iterations Per Secondffmpeg: H.264 HD To NTSC DVgraphics-magick: HWB Color Spacehpcg: intel-mpi: IMB-P2P PingPongjohn-the-ripper: MD5m-queens: Time To Solvemysqlslap: 1mysqlslap: 8mysqlslap: 16mysqlslap: 32mysqlslap: 64onednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUcompress-pbzip2: 256MB File Compressionpennant: sedovbigpennant: leblancbigrust-mandel: Time To Complete Serial/Parallel Mandelbrotrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timesysbench: CPUbuild-apache: Time To Compilebuild-eigen: Time To Compilebuild-erlang: Time To Compilebuild-gcc: Time To Compilebuild-godot: Time To Compilemu_co 7zipmu_co153350.58424880.3061168.001108.881010.130195.825124686.17556120.6932945.500721380584208295222.0681561825659399200118.931118.24913.733143.4677108.0833137.63039.80439.555385411911915.80153.221433.070636.7716711.7981094.919OpenBenchmarking.org

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testmu_co 7zip3K6K9K12K15KSE +/- 152.39, N = 6153351. (CXX) g++ options: -pipe -lpthread

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratemu_co0.13150.2630.39450.5260.6575SE +/- 0.000799, N = 30.5842481. (CC) gcc options: -O3 -march=native -fopenmp

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timemu_co20406080100SE +/- 0.94, N = 1580.311. (CC) gcc options: -lm -O3

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degriddingmu_co30060090012001500SE +/- 6.22, N = 31168.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Griddingmu_co2004006008001000SE +/- 9.52, N = 31108.881. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilemu_co2004006008001000SE +/- 12.56, N = 31010.13

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelmu_co4080120160200SE +/- 2.52, N = 3195.831. (CC) gcc options: -lm -lpthread -O3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondmu_co30K60K90K120K150KSE +/- 690.76, N = 3124686.181. (CC) gcc options: -O2 -lrt" -lrt

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVmu_co510152025SE +/- 0.13, N = 1520.691. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -lasound -pthread -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacemu_co60120180240300SE +/- 2.33, N = 32941. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1mu_co1.23772.47543.71314.95086.1885SE +/- 0.01475, N = 35.500721. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-P2P PingPong

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongmu_co300K600K900K1200K1500KSE +/- 16069.97, N = 31380584MIN: 2395 / MAX: 41321231. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5mu_co40K80K120K160K200KSE +/- 1731.44, N = 32082951. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvemu_co50100150200250SE +/- 0.12, N = 3222.071. (CXX) g++ options: -fopenmp -O2 -march=native

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 1mu_co30060090012001500SE +/- 19.46, N = 315611. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 8

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 8mu_co2004006008001000SE +/- 0.79, N = 38251. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 16

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 16mu_co140280420560700SE +/- 2.18, N = 36591. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 32mu_co90180270360450SE +/- 1.33, N = 33991. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 64

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 64mu_co4080120160200SE +/- 0.11, N = 32001. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUmu_co306090120150SE +/- 0.97, N = 3118.93MIN: 114.711. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUmu_co306090120150SE +/- 1.20, N = 3118.25MIN: 113.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File Compressionmu_co48121620SE +/- 0.26, N = 1513.731. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigmu_co306090120150SE +/- 0.65, N = 3143.471. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigmu_co20406080100SE +/- 0.73, N = 3108.081. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Rust Mandelbrot

Time To Complete Serial/Parallel Mandelbrot

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel Mandelbrotmu_co306090120150SE +/- 0.75, N = 3137.631. (CC) gcc options: -pie -nodefaultlibs -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc

Rust Prime Benchmark

Prime Number Test To 200,000,000

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000mu_co918273645SE +/- 0.49, N = 339.801. (CC) gcc options: -pie -nodefaultlibs -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesmu_co918273645SE +/- 0.23, N = 339.561. (CXX) g++ options: -fopenmp -O3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timemu_co800K1600K2400K3200K4000KSE +/- 8971.35, N = 338541191. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUmu_co3K6K9K12K15KSE +/- 49.06, N = 311915.801. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilemu_co306090120150SE +/- 0.56, N = 3153.22

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compilemu_co90180270360450SE +/- 6.08, N = 3433.07

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compilemu_co140280420560700SE +/- 2.41, N = 3636.77

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compilemu_co14002800420056007000SE +/- 10.80, N = 36711.80

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compilemu_co2004006008001000SE +/- 2.07, N = 31094.92


Phoronix Test Suite v10.8.4