Benchmarking the effect of unrolling on AARCH64

Tegra X1 vs S905 1.5GHz Android TV boxen

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1702288-RI-1609085HA82
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 2 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 9 Tests
Creator Workloads 5 Tests
Database Test Suite 2 Tests
Encoding 3 Tests
Multi-Core 9 Tests
Renderers 2 Tests
Server 2 Tests
Server CPU Tests 3 Tests
Single-Threaded 3 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
NVIDIA Tegra X1
July 28 2015
 
MXQ PRO+ Debian
June 30 2016
 
Mini MXIII GCC7
September 06 2016
 
Mini MXIII GCC5
September 07 2016
 
Mini MXIII GCC5 unrolled
February 27 2017
 
Mini MXIII GCC7 unrolled
February 27 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Benchmarking the effect of unrolling on AARCH64 - Phoronix Test Suite

Benchmarking the effect of unrolling on AARCH64

Tegra X1 vs S905 1.5GHz Android TV boxen

HTML result view exported from: https://openbenchmarking.org/result/1702288-RI-1609085HA82&sro&grr.

Benchmarking the effect of unrolling on AARCH64ProcessorMotherboardMemoryDiskGraphicsOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionNVIDIA Tegra X1MXQ PRO+ DebianMini MXIII GCC7Mini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7 unrolledCortex A57 rev 1 @ 1.91GHz (4 Cores)foster_e_hdd3072MB500GB Seagate ST500LM000-1EJ16 + 16GB SDW16G + 32GB 00000NVIDIA TEGRAUbuntu 14.103.10.61 (aarch64)fbdev 0.4.4GCC 4.9.1 + CUDA 6.5ext41920x2400AArch64 rev 4 @ 2.02GHz (4 Cores)Amlogic2048MB60GB A + 16GB AGND3R + 16GB SD16GDebian 8.33.14.65-odroidc2 (aarch64)GCC 4.9.21280x1440Unknown @ 1.50GHz (4 Cores)16GB NCard + 32GB 00000Ubuntu 16.043.14.65-61 (aarch64)GCC 7.0.0 20160904 + LLVM 3.8.0GCC 5.3.1 20160413 + LLVM 3.8.0Unknown @ 1.54GHz (4 Cores)Amlogic3.14.79-vegas95 (aarch64)GCC 5.4.0 20160609 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0GCC 7.0.1 20170220 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0OpenBenchmarking.orgCompiler Details- NVIDIA Tegra X1: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-libsanitizer --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=arm64 -v - MXQ PRO+ Debian: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-libsanitizer --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=arm64 -v - Mini MXIII GCC7: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - Mini MXIII GCC5: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - Mini MXIII GCC5 unrolled: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - Mini MXIII GCC7 unrolled: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Processor Details- NVIDIA Tegra X1: Scaling Governor: tegra performance- MXQ PRO+ Debian: Scaling Governor: meson_cpufreq performance- Mini MXIII GCC7: Scaling Governor: meson_cpufreq performance- Mini MXIII GCC5: Scaling Governor: meson_cpufreq performance- Mini MXIII GCC5 unrolled: Scaling Governor: meson_cpufreq performance- Mini MXIII GCC7 unrolled: Scaling Governor: meson_cpufreq performance

Benchmarking the effect of unrolling on AARCH64redis: SETredis: GETpgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Writen-queens: Elapsed Timeffmpeg: H.264 HD To NTSC DVencode-flac: WAV To FLACstockfish: Total Timesmallpt: Global Illumination Renderer; 100 Samplesprimesieve: 1e12 Prime Number Generationc-ray: Total Timebuild-apache: Time To Compilevpxenc: vpxencfhourstones: Complex Connect-4 Solvingfftw: Stock - 2D FFT Size 2048NVIDIA Tegra X1MXQ PRO+ DebianMini MXIII GCC7Mini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7 unrolled432160.48615979.06223.82510.88109.6474.9140.00114311211340.5384.74134.5111.524705.57215.66186038.34247721.10281.37652.15164.23222.78187.9322793219656.71182.11535.295.782619.87108.88194408.30264703.30144.95598.25156.09191.79153.4820504169575.31192.70245.496.813136.27196.93197144.29270455.57210.77626.65152.03195.16158.1622375179561.50186.85250.336.693044.93196.79197794.05256786.90202.28543.32140.45151.28162.4522197169570.56157.42270.466.753228.87194.66200337.42272703.28194.57619.53154.18150.39163.5620446166524.88151.77267.697.273446.17188.14OpenBenchmarking.org

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X190K180K270K360K450KSE +/- 559.40, N = 3SE +/- 1485.31, N = 3SE +/- 957.56, N = 3SE +/- 826.93, N = 3SE +/- 594.80, N = 3SE +/- 3893.31, N = 3186038.34197144.29197794.05194408.30200337.42432160.48-std=gnu99 -pipe -g3 -O3 -funroll-loops -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O2 -O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-O2 -Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O2 -O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-O2 -Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X1130K260K390K520K650KSE +/- 3753.59, N = 3SE +/- 1411.09, N = 3SE +/- 2458.15, N = 3SE +/- 1308.37, N = 3SE +/- 1872.56, N = 3SE +/- 5219.38, N = 3247721.10270455.57256786.90264703.30272703.28615979.06-std=gnu99 -pipe -g3 -O3 -funroll-loops -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O2 -O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-O2 -Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O2 -O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-O2 -Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X160120180240300SE +/- 0.39, N = 3SE +/- 7.30, N = 6SE +/- 5.08, N = 6SE +/- 6.47, N = 6SE +/- 6.85, N = 6SE +/- 5.00, N = 6281.37210.77202.28144.95194.57223.82-mcpu=cortex-a53 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O21. (CC) gcc options: -fno-strict-aliasing -fwrapv -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X1140280420560700SE +/- 11.16, N = 3SE +/- 72.37, N = 6SE +/- 64.22, N = 6SE +/- 74.34, N = 6SE +/- 67.95, N = 6SE +/- 29.11, N = 6652.15626.65543.32598.25619.53510.88-mcpu=cortex-a53 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O21. (CC) gcc options: -fno-strict-aliasing -fwrapv -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X14080120160200SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3164.23152.03140.45156.09154.18109.64-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts1. (CC) gcc options: -static -fopenmp -O3

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.6.2H.264 HD To NTSC DVMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X150100150200250SE +/- 1.33, N = 3SE +/- 0.92, N = 3SE +/- 1.57, N = 3SE +/- 0.90, N = 3SE +/- 0.97, N = 3SE +/- 0.18, N = 3222.78195.16151.28191.79150.3974.91-mcpu=cortex-a53 -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-lxcb -lxcb-shm -lX11 -mcpu=cortex-a53 -fipa-pta -march=armv8-a+crc-lXv -lX11 -lXext -lxcb -lxcb-shm -lxcb-xfixes -lxcb-render -lxcb-shape -lasound -lSDL -llzma -lbz2 -Ofast -mcpu=thunderx -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-lxcb -lxcb-shm -lX11 -mcpu=cortex-a53 -fipa-pta -march=armv8-a+crc-lXv -lX11 -lXext -lxcb -lxcb-shm -lxcb-xfixes -lxcb-render -lxcb-shape -lasound -lSDL -llzma -lbz2 -Ofast -mcpu=thunderx -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -pthread -O3 -fomit-frame-pointer -std=c99 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X14080120160200SE +/- 0.61, N = 5SE +/- 0.94, N = 5SE +/- 0.09, N = 5SE +/- 0.10, N = 5SE +/- 0.89, N = 5SE +/- 0.04, N = 5187.93158.16162.45153.48163.5640.00-mcpu=cortex-a53 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts -logg-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts -logg-O21. (CXX) g++ options: -fvisibility=hidden -lm

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X15K10K15K20K25KSE +/- 376.99, N = 3SE +/- 85.99, N = 3SE +/- 70.24, N = 3SE +/- 78.03, N = 3SE +/- 11.72, N = 3SE +/- 45.88, N = 3227932237522197205042044611431-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts1. (CXX) g++ options: -lpthread -O3 -flto -fno-exceptions -fno-rtti -ansi -pedantic

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X130060090012001500SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 1.45, N = 32191791691691661211-mcpu=cortex-a53 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts1. (CXX) g++ options: -fopenmp

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X1140280420560700SE +/- 13.09, N = 3SE +/- 6.91, N = 3SE +/- 11.36, N = 3SE +/- 9.24, N = 4SE +/- 1.53, N = 3SE +/- 7.87, N = 6656.71561.50570.56575.31524.88340.53-mcpu=cortex-a53 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O3 -mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O21. (CXX) g++ options: -fopenmp

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X14080120160200SE +/- 0.71, N = 3SE +/- 3.19, N = 4SE +/- 1.52, N = 3SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.54, N = 3182.11186.85157.42192.70151.7784.74-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts1. (CC) gcc options: -lm -lpthread -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X1120240360480600SE +/- 0.29, N = 3SE +/- 0.25, N = 3SE +/- 0.39, N = 3SE +/- 0.03, N = 3SE +/- 0.25, N = 3SE +/- 1.47, N = 3535.29250.33270.46245.49267.69134.51

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 1.3.0vpxencMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X13691215SE +/- 0.10, N = 6SE +/- 0.03, N = 3SE +/- 0.11, N = 4SE +/- 0.09, N = 3SE +/- 0.11, N = 6SE +/- 0.15, N = 35.786.696.756.817.2711.52-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -flto -ffat-lto-objects -ftree-vectorize -fuse-linker-plugin-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-mcpu=cortex-a53 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc1. (CXX) g++ options: -lvpx -lgtest -lpthread -lm -O3

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingMXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X110002000300040005000SE +/- 0.95, N = 3SE +/- 1.77, N = 3SE +/- 0.72, N = 3SE +/- 2.43, N = 3SE +/- 1.88, N = 3SE +/- 1.07, N = 32619.873044.933228.873136.273446.174705.571. (CC) gcc options: -O3

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048MXQ PRO+ DebianMini MXIII GCC5Mini MXIII GCC5 unrolledMini MXIII GCC7Mini MXIII GCC7 unrolledNVIDIA Tegra X150100150200250SE +/- 0.11, N = 5SE +/- 0.12, N = 5SE +/- 0.13, N = 5SE +/- 0.13, N = 5SE +/- 0.16, N = 5SE +/- 0.96, N = 5108.88196.79194.66196.93188.14215.66-std=gnu99 -mcpu=cortex-a53 -O3 -fipa-pta -march=armv8-a+crc -ftree-vectorize -ffast-math-O3 -mcpu=cortex-a53 -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-O3 -mcpu=cortex-a53 -fipa-pta -march=armv8-a+crc-Ofast -mcpu=thunderx -fipa-pta -march=armv8-a+crc -ftree-vectorize -funroll-loops -ftree-loop-ivcanon -fivopts-std=gnu99 -O3 -fstrict-aliasing -fno-schedule-insns -ffast-math1. (CC) gcc options: -fomit-frame-pointer -lm


Phoronix Test Suite v10.8.4