Cortex A53 running different codegen options

AARCH64 codegen comparison

HTML result view exported from: https://openbenchmarking.org/result/1609182-LO-1609128HA85.

Cortex A53 running different codegen optionsProcessorMemoryDiskOSKernelCompilerFile-SystemScreen ResolutionA53 codegenA73 codegenmtune A53mtune A53 GCC7Unknown @ 1.50GHz (4 Cores)2048MB16GB NCard + 32GB 00000Ubuntu 16.043.14.65-61 (aarch64)GCC 7.0.0 20160904 + LLVM 3.8.0ext41280x1440GCC 5.3.1 20160413 + LLVM 3.8.0GCC 7.0.0 20160915 + LLVM 3.8.0OpenBenchmarking.orgCompiler Details- A53 codegen: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A73 codegen: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - mtune A53: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - mtune A53 GCC7: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Processor Details- Scaling Governor: meson_cpufreq performance

Cortex A53 running different codegen optionsfftw: Stock - 2D FFT Size 2048fhourstones: Complex Connect-4 Solvingvpxenc: vpxencbuild-apache: Time To Compilec-ray: Total Timeprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplesstockfish: Total Timeencode-flac: WAV To FLACffmpeg: H.264 HD To NTSC DVn-queens: Elapsed Timepgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Writeredis: GETredis: SETA53 codegenA73 codegenmtune A53mtune A53 GCC7196.933136.276.81245.49192.70575.3116920504153.48191.79156.09598.25144.95264703.30194408.30187.983140.637.06248.20184.87579.6717321094162.41192.25157.93584.80204.69270145.88198057.42179.223045.837.15390.57240.34556.0817622423154.83193.89158.04599.02205.87252844.60191749.73190.173144.407.36376.56187.30567.2516920372149.15191.80149.22657.04216.23261242.76192951.04OpenBenchmarking.org

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048A53 codegenA73 codegenmtune A53mtune A53 GCC74080120160200SE +/- 0.13, N = 5SE +/- 0.05, N = 5SE +/- 0.09, N = 5SE +/- 0.16, N = 5196.93187.98179.22190.17-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingA53 codegenA73 codegenmtune A53mtune A53 GCC77001400210028003500SE +/- 2.43, N = 3SE +/- 1.36, N = 3SE +/- 1.05, N = 3SE +/- 3.60, N = 33136.273140.633045.833144.401. (CC) gcc options: -O3

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 1.3.0vpxencA53 codegenA73 codegenmtune A53mtune A53 GCC7246810SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 36.817.067.157.36-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CXX) g++ options: -lvpx -lgtest -lpthread -lm -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileA53 codegenA73 codegenmtune A53mtune A53 GCC780160240320400SE +/- 0.03, N = 3SE +/- 0.29, N = 3SE +/- 0.45, N = 3SE +/- 0.26, N = 3245.49248.20390.57376.56

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeA53 codegenA73 codegenmtune A53mtune A53 GCC750100150200250SE +/- 0.20, N = 3SE +/- 1.45, N = 3SE +/- 1.93, N = 3SE +/- 0.97, N = 3192.70184.87240.34187.30-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number GenerationA53 codegenA73 codegenmtune A53mtune A53 GCC7130260390520650SE +/- 9.24, N = 4SE +/- 16.21, N = 6SE +/- 9.30, N = 3SE +/- 0.68, N = 3575.31579.67556.08567.25-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CXX) g++ options: -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fopenmp

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesA53 codegenA73 codegenmtune A53mtune A53 GCC74080120160200SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3169173176169-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CXX) g++ options: -fopenmp -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeA53 codegenA73 codegenmtune A53mtune A53 GCC75K10K15K20K25KSE +/- 78.03, N = 3SE +/- 120.77, N = 3SE +/- 54.96, N = 3SE +/- 84.05, N = 320504210942242320372-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -ffat-lto-objects-mtune=cortex-a53 -ffat-lto-objects1. (CXX) g++ options: -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fno-exceptions -fno-rtti -ansi -pedantic -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACA53 codegenA73 codegenmtune A53mtune A53 GCC74080120160200SE +/- 0.10, N = 5SE +/- 0.12, N = 5SE +/- 0.07, N = 5SE +/- 0.06, N = 5153.48162.41154.83149.15-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CXX) g++ options: -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fvisibility=hidden -lm

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.6.2H.264 HD To NTSC DVA53 codegenA73 codegenmtune A53mtune A53 GCC74080120160200SE +/- 0.90, N = 3SE +/- 0.96, N = 3SE +/- 0.53, N = 3SE +/- 1.30, N = 3191.79192.25193.89191.80-mcpu=cortex-a53-lSDL -mcpu=cortex-a73-lSDL -mtune=cortex-a53 -flto -ffat-lto-objects-lSDL -mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lxcb -lxcb-shm -lX11 -lm -pthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -std=c99 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeA53 codegenA73 codegenmtune A53mtune A53 GCC7306090120150SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3156.09157.93158.04149.22-mcpu=cortex-a53-mcpu=cortex-a731. (CC) gcc options: -static -fopenmp -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteA53 codegenA73 codegenmtune A53mtune A53 GCC7140280420560700SE +/- 74.34, N = 6SE +/- 75.55, N = 6SE +/- 78.60, N = 6SE +/- 73.52, N = 6598.25584.80599.02657.04-mcpu=cortex-a53-mcpu=cortex-a72-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteA53 codegenA73 codegenmtune A53mtune A53 GCC750100150200250SE +/- 6.47, N = 6SE +/- 5.82, N = 6SE +/- 7.11, N = 6SE +/- 5.47, N = 6144.95204.69205.87216.23-mcpu=cortex-a53-mcpu=cortex-a72-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETA53 codegenA73 codegenmtune A53mtune A53 GCC760K120K180K240K300KSE +/- 1308.37, N = 3SE +/- 3076.94, N = 3SE +/- 110.73, N = 3SE +/- 1125.65, N = 3264703.30270145.88252844.60261242.76-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETA53 codegenA73 codegenmtune A53mtune A53 GCC740K80K120K160K200KSE +/- 826.93, N = 3SE +/- 1065.37, N = 3SE +/- 845.07, N = 3SE +/- 1104.96, N = 3194408.30198057.42191749.73192951.04-mcpu=cortex-a53-mcpu=cortex-a73-mtune=cortex-a53 -flto -ffat-lto-objects-mtune=cortex-a53 -flto -ffat-lto-objects1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc


Phoronix Test Suite v10.8.4