ARM64 gcc codegen comparison

gcc 5.4/6.3/7.0 benchmarks running on a Cortex-A53

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1701128-TA-GCCCOMPAR79
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 5 Tests
CPU Massive 9 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 2 Tests
Common Kernel Benchmarks 2 Tests
Multi-Core 5 Tests
Raytracing 2 Tests
Renderers 4 Tests
Scientific Computing 2 Tests
Server 2 Tests
Server CPU Tests 4 Tests
Single-Threaded 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
gcc5 A57 vectorize
January 12 2017
 
gcc5 thunderx vectorize
January 10 2017
 
gcc5 A72 LTO
January 09 2017
 
gcc6 A53
January 11 2017
 
gcc6 A53 mtune/vectorize
January 11 2017
 
gcc6 A57 vectorize
January 12 2017
 
gcc7 A53 vectorize
January 11 2017
 
gcc7 thunderx vectorize
January 10 2017
 
gcc7 A53 vectorize LTO
January 11 2017
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ARM64 gcc codegen comparisonProcessorMotherboardMemoryDiskOSKernelCompilerFile-SystemScreen Resolutiongcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTOAArch64 rev 4 @ 1.55GHz (4 Cores)Amlogic2048MB32GB 00000 + 16GB NCardUbuntu 16.043.14.29 (aarch64)GCC 5.4.0 20160609 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0ext41920x3240AArch64 rev 4 @ 1.50GHz (4 Cores)AArch64 rev 4 @ 1.55GHz (4 Cores)GCC 6.3.0 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0AArch64 rev 4 @ 1.50GHz (4 Cores)GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0AArch64 rev 4 @ 1.55GHz (4 Cores)OpenBenchmarking.orgCompiler Details- gcc5 A57 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc5 thunderx vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc5 A72 LTO: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc6 A53: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc6 A53 mtune/vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc6 A57 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 A53 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 thunderx vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 A53 vectorize LTO: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Disk Details- gcc5 A57 vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc5 thunderx vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc5 A72 LTO: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A53: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A53 mtune/vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A57 vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 A53 vectorize: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 thunderx vectorize: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 A53 vectorize LTO: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rwProcessor Details- Scaling Governor: meson_cpufreq performance

gcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTOResult OverviewPhoronix Test Suite100%118%136%154%RAMspeed SMPC-RayTachyonFFTWPrimesieveTTSIOD 3D RendererTimed MAFFT AlignmentRedisFhourstonesSmallptPostMarkSudokutOpenSSLGMPbench

ARM64 gcc codegen comparisonpostmark: Disk Transaction Performanceramspeed: Copy - Integerramspeed: Copy - Floating Pointfftw: Stock - 2D FFT Size 2048mafft: Multiple Sequence Alignmentgmpbench: Total Timefhourstones: Complex Connect-4 Solvingttsiod-renderer: Phong Rendering With Soft-Shadow Mappingc-ray: Total Timeprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplessudokut: Total Timetachyon: Total Timeopenssl: RSA 4096-bit Performanceredis: GETgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO13614614.264613.36189.2934.01554.313052.8322.48150.16574.61173102.0579.4721.30311665.3313514472.854497.17193.8834.78554.443045.8321.86152.96591.80171103.6381.4921.20303529.9113632916.642917.54185.5534.62549.963048.7022.57223.05604.68172102.6582.0321.20305506.4913784847.384844.14172.7336.10554.943129.5022.01200.00610.66169101.9971.6521.30315587.0613784812.664809.44175.0934.23553.023125.7721.99199.00592.00169101.6171.8221.23317672.2413564621.044624.88164.6134.94555.053123.2321.71144.49571.71168101.9976.9421.23324752.0513634581.324580.39196.9035.42552.843212.1023.16187.97543.16167101.9569.2721.50310344.7313512821.432817.45190.6334.46554.833210.2023.01149.82566.21167102.7571.4121.50318926.0213784829.914825.13180.5333.16554.373213.7723.77184.81540.95168101.7567.6421.50311785.02OpenBenchmarking.org

PostMark

This is a test of NetApp's PostMark benchmark designed to simulate small-file testing similar to the tasks endured by web and mail servers. This test profile will set PostMark to perform 25,000 transactions with 500 files simultaneously with the file sizes ranging between 5 and 512 kilobytes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performancegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO30060090012001500SE +/- 6.49, N = 3SE +/- 4.04, N = 3SE +/- 2.67, N = 3SE +/- 2.67, N = 3SE +/- 2.67, N = 3SE +/- 2.33, N = 3SE +/- 2.67, N = 3SE +/- 0.00, N = 3SE +/- 2.67, N = 31361135113631378137813561363135113781. (CC) gcc options: -O3
OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performancegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO2004006008001000Min: 1351 / Avg: 1360.67 / Max: 1373Min: 1344 / Avg: 1351 / Max: 1358Min: 1358 / Avg: 1363.33 / Max: 1366Min: 1373 / Avg: 1378.33 / Max: 1381Min: 1373 / Avg: 1378.33 / Max: 1381Min: 1351 / Avg: 1355.67 / Max: 1358Min: 1358 / Avg: 1363.33 / Max: 1366Min: 1351 / Avg: 1351 / Max: 1351Min: 1373 / Avg: 1378.33 / Max: 13811. (CC) gcc options: -O3

RAMspeed SMP

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integergcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO100020003000400050004614.264472.852916.644847.384812.664621.044581.322821.434829.91

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO100020003000400050004613.364497.172917.544844.144809.444624.884580.392817.454825.13

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048gcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO4080120160200SE +/- 0.26, N = 5SE +/- 0.04, N = 5SE +/- 0.18, N = 5SE +/- 0.29, N = 5SE +/- 0.28, N = 5SE +/- 0.17, N = 5SE +/- 0.99, N = 5SE +/- 1.10, N = 5SE +/- 0.49, N = 5189.29193.88185.55172.73175.09164.61196.90190.63180.53-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048gcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO4080120160200Min: 188.76 / Avg: 189.29 / Max: 190.26Min: 193.75 / Avg: 193.88 / Max: 194Min: 184.83 / Avg: 185.55 / Max: 185.8Min: 171.75 / Avg: 172.73 / Max: 173.49Min: 174.42 / Avg: 175.09 / Max: 176Min: 164.11 / Avg: 164.61 / Max: 165.08Min: 194.64 / Avg: 196.9 / Max: 200.5Min: 188.41 / Avg: 190.63 / Max: 193.37Min: 179.17 / Avg: 180.53 / Max: 181.621. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Timed MAFFT Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence Alignmentgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO816243240SE +/- 0.54, N = 6SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.61, N = 6SE +/- 0.48, N = 6SE +/- 0.03, N = 3SE +/- 0.80, N = 6SE +/- 0.73, N = 6SE +/- 0.70, N = 634.0134.7834.6236.1034.2334.9435.4234.4633.161. (CC) gcc options: -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence Alignmentgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO816243240Min: 31.33 / Avg: 34.01 / Max: 34.69Min: 34.71 / Avg: 34.78 / Max: 34.9Min: 34.61 / Avg: 34.62 / Max: 34.63Min: 34.66 / Avg: 36.1 / Max: 37.5Min: 31.86 / Avg: 34.23 / Max: 34.84Min: 34.88 / Avg: 34.94 / Max: 35Min: 32.14 / Avg: 35.42 / Max: 38.35Min: 31.91 / Avg: 34.46 / Max: 35.82Min: 31.84 / Avg: 33.16 / Max: 35.371. (CC) gcc options: -O3 -lm -lpthread

GMPbench

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO120240360480600554.31554.44549.96554.94553.02555.05552.84554.83554.37-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Fhourstones

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 Solvinggcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO7001400210028003500SE +/- 1.51, N = 3SE +/- 0.86, N = 3SE +/- 1.26, N = 3SE +/- 1.20, N = 3SE +/- 3.94, N = 3SE +/- 2.96, N = 3SE +/- 0.35, N = 3SE +/- 0.76, N = 3SE +/- 0.22, N = 33052.833045.833048.703129.503125.773123.233212.103210.203213.771. (CC) gcc options: -O3
OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 Solvinggcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO6001200180024003000Min: 3050.6 / Avg: 3052.83 / Max: 3055.7Min: 3044.6 / Avg: 3045.83 / Max: 3047.5Min: 3047.2 / Avg: 3048.7 / Max: 3051.2Min: 3128.3 / Avg: 3129.5 / Max: 3131.9Min: 3117.9 / Avg: 3125.77 / Max: 3130.1Min: 3118.4 / Avg: 3123.23 / Max: 3128.6Min: 3211.5 / Avg: 3212.1 / Max: 3212.7Min: 3208.7 / Avg: 3210.2 / Max: 3211.2Min: 3213.5 / Avg: 3213.77 / Max: 3214.21. (CC) gcc options: -O3

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mappinggcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO612182430SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 322.4821.8622.5722.0121.9921.7123.1623.0123.77-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -ffat-lto-objects1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++
OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mappinggcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO612182430Min: 22.23 / Avg: 22.48 / Max: 22.61Min: 21.76 / Avg: 21.86 / Max: 22.02Min: 22.56 / Avg: 22.57 / Max: 22.57Min: 21.9 / Avg: 22.01 / Max: 22.07Min: 21.93 / Avg: 21.99 / Max: 22.02Min: 21.62 / Avg: 21.71 / Max: 21.77Min: 23.14 / Avg: 23.16 / Max: 23.16Min: 23.01 / Avg: 23.01 / Max: 23.01Min: 23.74 / Avg: 23.77 / Max: 23.781. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO50100150200250SE +/- 0.19, N = 3SE +/- 2.40, N = 6SE +/- 0.09, N = 3SE +/- 1.39, N = 3SE +/- 1.80, N = 3SE +/- 0.08, N = 3SE +/- 0.69, N = 3SE +/- 1.37, N = 3SE +/- 0.17, N = 3150.16152.96223.05200.00199.00144.49187.97149.82184.81-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-mcpu=cortex-a53-mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO4080120160200Min: 149.97 / Avg: 150.16 / Max: 150.54Min: 147.16 / Avg: 152.96 / Max: 160.62Min: 222.93 / Avg: 223.05 / Max: 223.22Min: 197.36 / Avg: 200 / Max: 202.07Min: 197.04 / Avg: 199 / Max: 202.58Min: 144.39 / Avg: 144.49 / Max: 144.65Min: 186.75 / Avg: 187.97 / Max: 189.13Min: 148.36 / Avg: 149.82 / Max: 152.55Min: 184.51 / Avg: 184.81 / Max: 185.11. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number Generationgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO130260390520650SE +/- 8.11, N = 6SE +/- 10.05, N = 6SE +/- 9.47, N = 4SE +/- 15.34, N = 6SE +/- 17.98, N = 6SE +/- 4.25, N = 3SE +/- 3.01, N = 3SE +/- 2.99, N = 3SE +/- 8.42, N = 3574.61591.80604.68610.66592.00571.71543.16566.21540.95-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number Generationgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO110220330440550Min: 554.99 / Avg: 574.61 / Max: 607.78Min: 565.8 / Avg: 591.8 / Max: 619.27Min: 583.75 / Avg: 604.68 / Max: 625.41Min: 572.46 / Avg: 610.66 / Max: 668.73Min: 537.59 / Avg: 592 / Max: 635.33Min: 565.05 / Avg: 571.71 / Max: 579.63Min: 537.24 / Avg: 543.16 / Max: 547.08Min: 561.7 / Avg: 566.21 / Max: 571.87Min: 532.14 / Avg: 540.95 / Max: 557.791. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fopenmp

Smallpt

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3173171172169169168167167168-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO306090120150Min: 173 / Avg: 173 / Max: 173Min: 171 / Avg: 171 / Max: 171Min: 172 / Avg: 172 / Max: 172Min: 169 / Avg: 169 / Max: 169Min: 169 / Avg: 169.33 / Max: 170Min: 168 / Avg: 168.33 / Max: 169Min: 167 / Avg: 167 / Max: 167Min: 167 / Avg: 167 / Max: 167Min: 168 / Avg: 168 / Max: 1681. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Sudokut

This is a test of Sudokut, which is a Sudoku puzzle solver written in Tcl. This test measures how long it takes to solve 100 Sudoku puzzles. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSudokut 0.4Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO20406080100SE +/- 0.11, N = 3SE +/- 0.78, N = 3SE +/- 0.73, N = 3SE +/- 0.23, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.76, N = 3SE +/- 0.21, N = 3102.05103.63102.65101.99101.61101.99101.95102.75101.75
OpenBenchmarking.orgSeconds, Fewer Is BetterSudokut 0.4Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO20406080100Min: 101.93 / Avg: 102.05 / Max: 102.26Min: 102.07 / Avg: 103.63 / Max: 104.53Min: 101.88 / Avg: 102.65 / Max: 104.11Min: 101.73 / Avg: 101.99 / Max: 102.44Min: 101.56 / Avg: 101.61 / Max: 101.71Min: 101.67 / Avg: 101.99 / Max: 102.21Min: 101.73 / Avg: 101.95 / Max: 102.35Min: 101.98 / Avg: 102.75 / Max: 104.26Min: 101.51 / Avg: 101.75 / Max: 102.17

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO20406080100SE +/- 0.09, N = 3SE +/- 0.53, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 379.4781.4982.0371.6571.8276.9469.2771.4167.64
OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO1632486480Min: 79.34 / Avg: 79.47 / Max: 79.64Min: 80.67 / Avg: 81.49 / Max: 82.49Min: 81.87 / Avg: 82.03 / Max: 82.2Min: 71.54 / Avg: 71.65 / Max: 71.83Min: 71.37 / Avg: 71.82 / Max: 72.16Min: 76.7 / Avg: 76.94 / Max: 77.34Min: 69.18 / Avg: 69.27 / Max: 69.42Min: 71.32 / Avg: 71.41 / Max: 71.52Min: 67.52 / Avg: 67.64 / Max: 67.86

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit Performancegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 321.3021.2021.2021.3021.2321.2321.5021.5021.501. (CC) gcc options: -O3 -fomit-frame-pointer -lssl -lcrypto -ldl
OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit Performancegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO510152025Min: 21.3 / Avg: 21.3 / Max: 21.3Min: 21.2 / Avg: 21.2 / Max: 21.2Min: 21.2 / Avg: 21.2 / Max: 21.2Min: 21.3 / Avg: 21.3 / Max: 21.3Min: 21.2 / Avg: 21.23 / Max: 21.3Min: 21.2 / Avg: 21.23 / Max: 21.3Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.5Min: 21.5 / Avg: 21.5 / Max: 21.51. (CC) gcc options: -O3 -fomit-frame-pointer -lssl -lcrypto -ldl

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO70K140K210K280K350KSE +/- 1214.02, N = 3SE +/- 5780.31, N = 3SE +/- 3249.19, N = 3SE +/- 3145.13, N = 3SE +/- 1273.24, N = 3SE +/- 1024.45, N = 3SE +/- 4662.92, N = 6SE +/- 2784.59, N = 3SE +/- 2239.53, N = 3311665.33303529.91305506.49315587.06317672.24324752.05310344.73318926.02311785.02-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-O3 -mcpu=cortex-a53-O3 -mtune=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETgcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO60K120K180K240K300KMin: 310173.69 / Avg: 311665.33 / Max: 314070.31Min: 292141.38 / Avg: 303529.91 / Max: 310945.28Min: 299132.5 / Avg: 305506.49 / Max: 309789.31Min: 309310.22 / Avg: 315587.06 / Max: 319081.03Min: 315159.16 / Avg: 317672.24 / Max: 319284.78Min: 323624.59 / Avg: 324752.05 / Max: 326797.38Min: 289435.59 / Avg: 310344.73 / Max: 322268.75Min: 314465.41 / Avg: 318926.02 / Max: 324044.06Min: 309310.22 / Avg: 311785.02 / Max: 316255.531. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc