Cortex A53 GCC7 codegen comparison

Benchmarking the effect of d8c4c75 ARM patch

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1702153-RI-GCCCOMPAR59
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 5 Tests
CPU Massive 9 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 2 Tests
Common Kernel Benchmarks 2 Tests
Multi-Core 5 Tests
Raytracing 2 Tests
Renderers 4 Tests
Scientific Computing 2 Tests
Server 2 Tests
Server CPU Tests 4 Tests
Single-Threaded 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
A53 vectorize, pre-patch
January 11 2017
 
thunderx/vectorize, pre-patch
January 10 2017
 
A53 vectorize/LTO, pre patch
January 11 2017
 
A53, post patch
January 14 2017
 
A53 mtune/vectorize, post-patch
January 14 2017
 
A53/clang 3.8
January 28 2017
 
A72 vectorize
February 02 2017
 
thunderx mtune
February 15 2017
 
A53 vectorize, updated
February 15 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Cortex A53 GCC7 codegen comparisonProcessorMotherboardMemoryDiskOSKernelCompilerFile-SystemScreen ResolutionA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updatedAArch64 rev 4 @ 1.50GHz (4 Cores)Amlogic2048MB32GB 00000 + 16GB NCardUbuntu 16.043.14.29 (aarch64)GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0ext41920x3240AArch64 rev 4 @ 1.55GHz (4 Cores)GCC 7.0.0 20170113 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0Clang 3.8.0-2ubuntu4 + GCC 5.4.0 20160609 + LLVM 3.8.0AArch64 rev 4 @ 2.00GHz (4 Cores)16GB NCard + 32GB 00000GCC 7.0.1 20170127 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0Unknown @ 1.54GHz (4 Cores)3.14.79-vegas95 (aarch64)GCC 7.0.1 20170214 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.01280x1440OpenBenchmarking.orgCompiler Details- A53 vectorize, pre-patch, thunderx/vectorize, pre-patch, A53 vectorize/LTO, pre patch, A53, post patch, A53 mtune/vectorize, post-patch, A72 vectorize, thunderx mtune, A53 vectorize, updated: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Disk Details- A53 vectorize, pre-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- thunderx/vectorize, pre-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- A53 vectorize/LTO, pre patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- A53, post patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- A53 mtune/vectorize, post-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- A53/clang 3.8: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- A72 vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- thunderx mtune: DEADLINE / commit=45,errors=remount-ro,noatime,nodiratime,rw- A53 vectorize, updated: DEADLINE / commit=45,errors=remount-ro,noatime,nodiratime,rwProcessor Details- A53 vectorize, pre-patch: Scaling Governor: meson_cpufreq performance- thunderx/vectorize, pre-patch: Scaling Governor: meson_cpufreq performance- A53 vectorize/LTO, pre patch: Scaling Governor: meson_cpufreq performance- A53, post patch: Scaling Governor: meson_cpufreq performance- A53 mtune/vectorize, post-patch: Scaling Governor: meson_cpufreq interactive- A53/clang 3.8: Scaling Governor: meson_cpufreq performance- A72 vectorize: Scaling Governor: meson_cpufreq performance- thunderx mtune: Scaling Governor: meson_cpufreq performance- A53 vectorize, updated: Scaling Governor: meson_cpufreq performance

A53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updatedLogarithmic Result OverviewPhoronix Test SuitePrimesieveC-RayTimed MAFFT AlignmentPostMarkTachyonFFTWFhourstonesOpenSSLSudokut

Cortex A53 GCC7 codegen comparisonpostmark: Disk Transaction Performanceramspeed: Copy - Integerramspeed: Copy - Floating Pointfftw: Stock - 2D FFT Size 2048mafft: Multiple Sequence Alignmentgmpbench: Total Timefhourstones: Complex Connect-4 Solvingttsiod-renderer: Phong Rendering With Soft-Shadow Mappingc-ray: Total Timeprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplessudokut: Total Timetachyon: Total Timeopenssl: RSA 4096-bit Performanceredis: GETA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated13634581.324580.39196.9035.42552.843212.1023.16187.97543.16167101.9569.2721.50310344.7313512821.432817.45190.6334.46554.833210.2023.01149.82566.21167102.7571.4121.50318926.0213784829.914825.13180.5333.16554.373213.7723.77184.81540.95168101.7567.6421.50311785.0213814965.064964.66186.2133.06552.563209.6723.47186.69553.13168101.8869.4021.50309030.6413784955.974965.60184.8132.17555.103205.4023.49186.61573.13167102.1769.3421.50313438.911358184.6227.873358.40250.772256.35102.3375.9620.7013994769.374862.68175.6133.02554.063206.0023.48150.90549.83167102.2471.9621.47314705.3712252604.102606.30175.4631.89550.573189.4023.34155.28607.41167103.0970.9121.40277191.1312174706.404785.59185.1533.90554.113223.5723.29161.80574.65166102.7269.9021.40277268.23OpenBenchmarking.org

PostMark

This is a test of NetApp's PostMark benchmark designed to simulate small-file testing similar to the tasks endured by web and mail servers. This test profile will set PostMark to perform 25,000 transactions with 500 files simultaneously with the file sizes ranging between 5 and 512 kilobytes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction PerformanceA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated30060090012001500SE +/- 2.67, N = 3SE +/- 0.00, N = 3SE +/- 2.67, N = 3SE +/- 4.33, N = 3SE +/- 5.00, N = 3SE +/- 4.33, N = 3SE +/- 2.67, N = 3SE +/- 3.46, N = 3SE +/- 2.00, N = 3136313511378138113781358139912251217gccgccgccgccgccclanggccgccgcc
OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction PerformanceA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated2004006008001000Min: 1358 / Avg: 1363.33 / Max: 1366Min: 1351 / Avg: 1351 / Max: 1351Min: 1373 / Avg: 1378.33 / Max: 1381Min: 1373 / Avg: 1380.67 / Max: 1388Min: 1373 / Avg: 1378 / Max: 1388Min: 1351 / Avg: 1358.33 / Max: 1366Min: 1396 / Avg: 1398.67 / Max: 1404Min: 1219 / Avg: 1225 / Max: 1231Min: 1213 / Avg: 1217 / Max: 1219

RAMspeed SMP

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: IntegerA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated110022003300440055004581.322821.434829.914965.064955.974769.372604.104706.40

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating PointA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated110022003300440055004580.392817.454825.134964.664965.604862.682606.304785.59

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048A53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated4080120160200SE +/- 0.99, N = 5SE +/- 1.10, N = 5SE +/- 0.49, N = 5SE +/- 0.08, N = 5SE +/- 0.21, N = 5SE +/- 0.11, N = 5SE +/- 0.14, N = 5SE +/- 0.11, N = 5SE +/- 0.06, N = 5196.90190.63180.53186.21184.81184.62175.61175.46185.15-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize-Ofast -mcpu=thunderx -fipa-pta -ftree-vectorize-O3 -mcpu=cortex-a53 -fipa-pta -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -fipa-pta-Ofast -mtune=cortex-a53 -fipa-pta -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -fipa-pta -ftree-vectorize-Ofast -mtune=thunderx -fipa-pta-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize1. (CC) gcc options: -fomit-frame-pointer -march=armv8-a+crc -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048A53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated4080120160200Min: 194.64 / Avg: 196.9 / Max: 200.5Min: 188.41 / Avg: 190.63 / Max: 193.37Min: 179.17 / Avg: 180.53 / Max: 181.62Min: 185.91 / Avg: 186.21 / Max: 186.38Min: 184.26 / Avg: 184.81 / Max: 185.55Min: 184.25 / Avg: 184.62 / Max: 184.86Min: 175.32 / Avg: 175.61 / Max: 176.09Min: 175.04 / Avg: 175.46 / Max: 175.71Min: 184.94 / Avg: 185.15 / Max: 185.261. (CC) gcc options: -fomit-frame-pointer -march=armv8-a+crc -lm

Timed MAFFT Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated816243240SE +/- 0.80, N = 6SE +/- 0.73, N = 6SE +/- 0.70, N = 6SE +/- 0.71, N = 6SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.67, N = 6SE +/- 0.09, N = 3SE +/- 0.79, N = 635.4234.4633.1633.0632.1727.8733.0231.8933.90gccgccgccgccgccclanggccgccgcc
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated816243240Min: 32.14 / Avg: 35.42 / Max: 38.35Min: 31.91 / Avg: 34.46 / Max: 35.82Min: 31.84 / Avg: 33.16 / Max: 35.37Min: 31.61 / Avg: 33.06 / Max: 35.43Min: 32.15 / Avg: 32.17 / Max: 32.19Min: 27.82 / Avg: 27.87 / Max: 27.95Min: 31.77 / Avg: 33.02 / Max: 35.54Min: 31.76 / Avg: 31.89 / Max: 32.05Min: 31.77 / Avg: 33.9 / Max: 35.67

GMPbench

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated120240360480600552.84554.83554.37552.56555.10554.06550.57554.11-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53-Ofast -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -ftree-vectorize-Ofast -mtune=thunderx-Ofast -mcpu=cortex-a53 -ftree-vectorize1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Fhourstones

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated7001400210028003500SE +/- 0.35, N = 3SE +/- 0.76, N = 3SE +/- 0.22, N = 3SE +/- 1.47, N = 3SE +/- 1.81, N = 3SE +/- 0.99, N = 3SE +/- 0.12, N = 3SE +/- 0.60, N = 3SE +/- 3.32, N = 33212.103210.203213.773209.673205.403358.403206.003189.403223.57gccgccgccgccgccclanggccgccgcc
OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated6001200180024003000Min: 3211.5 / Avg: 3212.1 / Max: 3212.7Min: 3208.7 / Avg: 3210.2 / Max: 3211.2Min: 3213.5 / Avg: 3213.77 / Max: 3214.2Min: 3207.2 / Avg: 3209.67 / Max: 3212.3Min: 3203.2 / Avg: 3205.4 / Max: 3209Min: 3356.8 / Avg: 3358.4 / Max: 3360.2Min: 3205.8 / Avg: 3206 / Max: 3206.2Min: 3188.2 / Avg: 3189.4 / Max: 3190.1Min: 3217 / Avg: 3223.57 / Max: 3227.7

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 323.1623.0123.7723.4723.4923.4823.3423.29-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -ffat-lto-objects-Ofast -mcpu=cortex-a53-Ofast -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -ftree-vectorize-Ofast -mtune=thunderx-Ofast -mcpu=cortex-a53 -ftree-vectorize1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++
OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated612182430Min: 23.14 / Avg: 23.16 / Max: 23.16Min: 23.01 / Avg: 23.01 / Max: 23.01Min: 23.74 / Avg: 23.77 / Max: 23.78Min: 23.44 / Avg: 23.47 / Max: 23.49Min: 23.47 / Avg: 23.49 / Max: 23.51Min: 23.39 / Avg: 23.48 / Max: 23.55Min: 23.32 / Avg: 23.34 / Max: 23.36Min: 23.18 / Avg: 23.29 / Max: 23.481. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated50100150200250SE +/- 0.69, N = 3SE +/- 1.37, N = 3SE +/- 0.17, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 1.03, N = 3SE +/- 0.06, N = 3SE +/- 2.98, N = 3SE +/- 0.27, N = 3187.97149.82184.81186.69186.61250.77150.90155.28161.80-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize-Ofast -mcpu=thunderx -fipa-pta -ftree-vectorize-mcpu=cortex-a53 -fipa-pta -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -fipa-pta-Ofast -mtune=cortex-a53 -fipa-pta -ftree-vectorize-mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -fipa-pta -ftree-vectorize-Ofast -mtune=thunderx -fipa-pta-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize1. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -march=armv8-a+crc
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated50100150200250Min: 186.75 / Avg: 187.97 / Max: 189.13Min: 148.36 / Avg: 149.82 / Max: 152.55Min: 184.51 / Avg: 184.81 / Max: 185.1Min: 186.54 / Avg: 186.69 / Max: 186.96Min: 186.37 / Avg: 186.61 / Max: 186.72Min: 248.72 / Avg: 250.77 / Max: 251.86Min: 150.8 / Avg: 150.9 / Max: 151.01Min: 152.23 / Avg: 155.28 / Max: 161.24Min: 161.53 / Avg: 161.8 / Max: 162.351. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -march=armv8-a+crc

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated5001000150020002500SE +/- 3.01, N = 3SE +/- 2.99, N = 3SE +/- 8.42, N = 3SE +/- 9.14, N = 3SE +/- 6.92, N = 3SE +/- 12.87, N = 3SE +/- 7.59, N = 6SE +/- 10.24, N = 6SE +/- 9.13, N = 4543.16566.21540.95553.13573.132256.35549.83607.41574.65-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize -fopenmp-Ofast -mcpu=thunderx -fipa-pta -ftree-vectorize -fopenmp-O3 -mcpu=cortex-a53 -fipa-pta -ftree-vectorize -flto -ffat-lto-objects -fopenmp-Ofast -mcpu=cortex-a53 -fipa-pta -fopenmp-Ofast -mtune=cortex-a53 -fipa-pta -ftree-vectorize -fopenmp-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -fipa-pta -ftree-vectorize -fopenmp-Ofast -mtune=thunderx -fipa-pta -fopenmp-Ofast -mcpu=cortex-a53 -fipa-pta -ftree-vectorize -fopenmp1. (CXX) g++ options: -fomit-frame-pointer -march=armv8-a+crc
OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated400800120016002000Min: 537.24 / Avg: 543.16 / Max: 547.08Min: 561.7 / Avg: 566.21 / Max: 571.87Min: 532.14 / Avg: 540.95 / Max: 557.79Min: 536.7 / Avg: 553.13 / Max: 568.29Min: 560.03 / Avg: 573.13 / Max: 583.51Min: 2238.86 / Avg: 2256.35 / Max: 2281.46Min: 521.89 / Avg: 549.83 / Max: 570.52Min: 577.07 / Avg: 607.41 / Max: 639.06Min: 557.95 / Avg: 574.65 / Max: 597.031. (CXX) g++ options: -fomit-frame-pointer -march=armv8-a+crc

Smallpt

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3167167168168167167167166-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53-Ofast -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -ftree-vectorize-Ofast -mtune=thunderx-Ofast -mcpu=cortex-a53 -ftree-vectorize1. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated306090120150Min: 167 / Avg: 167 / Max: 167Min: 167 / Avg: 167 / Max: 167Min: 168 / Avg: 168 / Max: 168Min: 168 / Avg: 168 / Max: 168Min: 167 / Avg: 167 / Max: 167Min: 167 / Avg: 167 / Max: 167Min: 167 / Avg: 167 / Max: 167Min: 166 / Avg: 166.33 / Max: 1671. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Sudokut

This is a test of Sudokut, which is a Sudoku puzzle solver written in Tcl. This test measures how long it takes to solve 100 Sudoku puzzles. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSudokut 0.4Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated20406080100SE +/- 0.20, N = 3SE +/- 0.76, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.25, N = 3SE +/- 0.33, N = 3SE +/- 0.20, N = 3101.95102.75101.75101.88102.17102.33102.24103.09102.72
OpenBenchmarking.orgSeconds, Fewer Is BetterSudokut 0.4Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated20406080100Min: 101.73 / Avg: 101.95 / Max: 102.35Min: 101.98 / Avg: 102.75 / Max: 104.26Min: 101.51 / Avg: 101.75 / Max: 102.17Min: 101.7 / Avg: 101.88 / Max: 102.05Min: 102.06 / Avg: 102.17 / Max: 102.35Min: 101.9 / Avg: 102.33 / Max: 102.78Min: 101.93 / Avg: 102.24 / Max: 102.72Min: 102.7 / Avg: 103.09 / Max: 103.75Min: 102.32 / Avg: 102.72 / Max: 102.92

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated20406080100SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.29, N = 369.2771.4167.6469.4069.3475.9671.9670.9169.90
OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total TimeA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated1530456075Min: 69.18 / Avg: 69.27 / Max: 69.42Min: 71.32 / Avg: 71.41 / Max: 71.52Min: 67.52 / Avg: 67.64 / Max: 67.86Min: 69.2 / Avg: 69.4 / Max: 69.61Min: 69.19 / Avg: 69.34 / Max: 69.54Min: 75.52 / Avg: 75.96 / Max: 76.2Min: 71.63 / Avg: 71.96 / Max: 72.18Min: 70.61 / Avg: 70.91 / Max: 71.1Min: 69.38 / Avg: 69.9 / Max: 70.4

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 321.5021.5021.5021.5021.5020.7021.4721.4021.40gccgccgccgccgccclanggccgccgcc
OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA53/clang 3.8A72 vectorizethunderx mtuneA53 vectorize, updated510152025Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 20.7 / Avg: 20.7 / Max: 20.7Min: 21.4 / Avg: 21.47 / Max: 21.5Min: 21.4 / Avg: 21.4 / Max: 21.4Min: 21.4 / Avg: 21.4 / Max: 21.4

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated70K140K210K280K350KSE +/- 4662.92, N = 6SE +/- 2784.59, N = 3SE +/- 2239.53, N = 3SE +/- 1052.91, N = 3SE +/- 1967.34, N = 3SE +/- 3423.31, N = 3SE +/- 2008.41, N = 3SE +/- 2017.17, N = 3310344.73318926.02311785.02309030.64313438.91314705.37277191.13277268.23-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53-Ofast -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a72 -ftree-vectorize-Ofast -mtune=thunderx-Ofast -mcpu=cortex-a53 -ftree-vectorize1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETA53 vectorize, pre-patchthunderx/vectorize, pre-patchA53 vectorize/LTO, pre patchA53, post patchA53 mtune/vectorize, post-patchA72 vectorizethunderx mtuneA53 vectorize, updated60K120K180K240K300KMin: 289435.59 / Avg: 310344.73 / Max: 322268.75Min: 314465.41 / Avg: 318926.02 / Max: 324044.06Min: 309310.22 / Avg: 311785.02 / Max: 316255.53Min: 307314.06 / Avg: 309030.64 / Max: 310945.28Min: 309693.41 / Avg: 313438.91 / Max: 316355.56Min: 308071.47 / Avg: 314705.37 / Max: 319488.81Min: 273224.03 / Avg: 277191.13 / Max: 279720.25Min: 273298.72 / Avg: 277268.23 / Max: 279876.841. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc