multicore-result

ARMv8 rev 0 testing with a e3360_1099 and NVIDIA TEGRA on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2109171-IB-MULTICORE37
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 2 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 7 Tests
Compression Tests 2 Tests
CPU Massive 14 Tests
Creator Workloads 6 Tests
HPC - High Performance Computing 6 Tests
MPI Benchmarks 4 Tests
Multi-Core 23 Tests
OpenMPI Tests 4 Tests
Programmer / Developer System Benchmarks 3 Tests
Renderers 3 Tests
Rust Tests 2 Tests
Scientific Computing 2 Tests
Server CPU Tests 9 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
mu_co 7zip
September 15 2021
  6 Minutes
mu_co
September 15 2021
  5 Hours, 2 Minutes
Invert Hiding All Results Option
  2 Hours, 34 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


multicore-result OpenBenchmarking.orgPhoronix Test SuiteARMv8 rev 0 @ 1.91GHz (8 Cores)e3360_109932GB125GB DUTA42NVIDIA TEGRAUbuntu 20.045.10.41-tegra (aarch64)X Server1.2.186GCC 9.3.0ext44096x2160ProcessorMotherboardMemoryDiskGraphicsOSKernelDisplay ServerVulkanCompilerFile-SystemScreen ResolutionMulticore-result BenchmarksSystem Logs- Transparent Huge Pages: always- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - Scaling Governor: tegra194 performance- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening + srbds: Not affected + tsx_async_abort: Not affected

multicore-result compress-7zip: Compress Speed Testmt-dgemm: Sustained Floating-Point Rateaobench: 2048 x 2048 - Total Timeaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingbuild2: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelcoremark: CoreMark Size 666 - Iterations Per Secondffmpeg: H.264 HD To NTSC DVgraphics-magick: HWB Color Spacehpcg: intel-mpi: IMB-P2P PingPongjohn-the-ripper: MD5m-queens: Time To Solvemysqlslap: 1mysqlslap: 8mysqlslap: 16mysqlslap: 32mysqlslap: 64onednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUcompress-pbzip2: 256MB File Compressionpennant: sedovbigpennant: leblancbigrust-mandel: Time To Complete Serial/Parallel Mandelbrotrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timesysbench: CPUbuild-apache: Time To Compilemu_co 7zipmu_co153350.58424880.3061168.001108.881010.130195.825124686.17556120.6932945.500721380584208295222.0681561825659399200118.931118.24913.733143.4677108.0833137.63039.80439.555385411911915.80153.221OpenBenchmarking.org

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testmu_co 7zip3K6K9K12K15KSE +/- 152.39, N = 6153351. (CXX) g++ options: -pipe -lpthread

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratemu_co0.13150.2630.39450.5260.6575SE +/- 0.000799, N = 30.5842481. (CC) gcc options: -O3 -march=native -fopenmp

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timemu_co20406080100SE +/- 0.94, N = 1580.311. (CC) gcc options: -lm -O3

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degriddingmu_co30060090012001500SE +/- 6.22, N = 31168.001. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Griddingmu_co2004006008001000SE +/- 9.52, N = 31108.881. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilemu_co2004006008001000SE +/- 12.56, N = 31010.13

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelmu_co4080120160200SE +/- 2.52, N = 3195.831. (CC) gcc options: -lm -lpthread -O3

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondmu_co30K60K90K120K150KSE +/- 690.76, N = 3124686.181. (CC) gcc options: -O2 -lrt" -lrt

FFmpeg

This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVmu_co510152025SE +/- 0.13, N = 1520.691. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -lasound -pthread -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacemu_co60120180240300SE +/- 2.33, N = 32941. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -lpthread

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1mu_co1.23772.47543.71314.95086.1885SE +/- 0.01475, N = 35.500721. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongmu_co300K600K900K1200K1500KSE +/- 16069.97, N = 31380584MIN: 2395 / MAX: 41321231. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5mu_co40K80K120K160K200KSE +/- 1731.44, N = 32082951. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvemu_co50100150200250SE +/- 0.12, N = 3222.071. (CXX) g++ options: -fopenmp -O2 -march=native

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 1mu_co30060090012001500SE +/- 19.46, N = 315611. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 8mu_co2004006008001000SE +/- 0.79, N = 38251. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 16mu_co140280420560700SE +/- 2.18, N = 36591. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 32mu_co90180270360450SE +/- 1.33, N = 33991. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 64mu_co4080120160200SE +/- 0.11, N = 32001. (CXX) g++ options: -pie -fPIC -fstack-protector -moutline-atomics -O3 -pthread -llzma -lnuma -lpcre2-8 -lcrypt -laio -lz -lm -lssl -lcrypto -lpthread -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUmu_co306090120150SE +/- 0.97, N = 3118.93MIN: 114.711. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUmu_co306090120150SE +/- 1.20, N = 3118.25MIN: 113.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl

Parallel BZIP2 Compression

This test measures the time needed to compress a file (a .tar package of the Linux kernel source code) using BZIP2 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File Compressionmu_co48121620SE +/- 0.26, N = 1513.731. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigmu_co306090120150SE +/- 0.65, N = 3143.471. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigmu_co20406080100SE +/- 0.73, N = 3108.081. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Rust Mandelbrot

This test profile is of the combined time for the serial and parallel Mandelbrot sets written in Rustlang via willi-kappler/mandel-rust. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel Mandelbrotmu_co306090120150SE +/- 0.75, N = 3137.631. (CC) gcc options: -pie -nodefaultlibs -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc

Rust Prime Benchmark

Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000mu_co918273645SE +/- 0.49, N = 339.801. (CC) gcc options: -pie -nodefaultlibs -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesmu_co918273645SE +/- 0.23, N = 339.561. (CXX) g++ options: -fopenmp -O3

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timemu_co800K1600K2400K3200K4000KSE +/- 8971.35, N = 338541191. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUmu_co3K6K9K12K15KSE +/- 49.06, N = 311915.801. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilemu_co306090120150SE +/- 0.56, N = 3153.22