ARM64 gcc codegen comparison

gcc 5.4/6.3/7.0 benchmarks running on a Cortex-A53

HTML result view exported from: https://openbenchmarking.org/result/1701128-TA-GCCCOMPAR79&grw&sro&rro.

ARM64 gcc codegen comparisonProcessorMotherboardMemoryDiskOSKernelCompilerFile-SystemScreen Resolutiongcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTOAArch64 rev 4 @ 1.55GHz (4 Cores)Amlogic2048MB32GB 00000 + 16GB NCardUbuntu 16.043.14.29 (aarch64)GCC 5.4.0 20160609 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0ext41920x3240AArch64 rev 4 @ 1.50GHz (4 Cores)AArch64 rev 4 @ 1.55GHz (4 Cores)GCC 6.3.0 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0AArch64 rev 4 @ 1.50GHz (4 Cores)GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0AArch64 rev 4 @ 1.55GHz (4 Cores)OpenBenchmarking.orgCompiler Details- gcc5 A57 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc5 thunderx vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc5 A72 LTO: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new -v - gcc6 A53: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc6 A53 mtune/vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc6 A57 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 A53 vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 thunderx vectorize: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - gcc7 A53 vectorize LTO: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Disk Details- gcc5 A57 vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc5 thunderx vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc5 A72 LTO: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A53: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A53 mtune/vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc6 A57 vectorize: DEADLINE / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 A53 vectorize: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 thunderx vectorize: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw- gcc7 A53 vectorize LTO: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rwProcessor Details- Scaling Governor: meson_cpufreq performance

ARM64 gcc codegen comparisonramspeed: Copy - Integerramspeed: Copy - Floating Pointfftw: Stock - 2D FFT Size 2048mafft: Multiple Sequence Alignmentpostmark: Disk Transaction Performanceprimesieve: 1e12 Prime Number Generationtachyon: Total Timec-ray: Total Timettsiod-renderer: Phong Rendering With Soft-Shadow Mappingsmallpt: Global Illumination Renderer; 100 Samplesopenssl: RSA 4096-bit Performanceredis: GETfhourstones: Complex Connect-4 Solvinggmpbench: Total Timesudokut: Total Timegcc5 A57 vectorizegcc5 thunderx vectorizegcc5 A72 LTOgcc6 A53gcc6 A53 mtune/vectorizegcc6 A57 vectorizegcc7 A53 vectorizegcc7 thunderx vectorizegcc7 A53 vectorize LTO4614.264613.36189.2934.011361574.6179.47150.1622.4817321.30311665.333052.83554.31102.054472.854497.17193.8834.781351591.8081.49152.9621.8617121.20303529.913045.83554.44103.632916.642917.54185.5534.621363604.6882.03223.0522.5717221.20305506.493048.70549.96102.654847.384844.14172.7336.101378610.6671.65200.0022.0116921.30315587.063129.50554.94101.994812.664809.44175.0934.231378592.0071.82199.0021.9916921.23317672.243125.77553.02101.614621.044624.88164.6134.941356571.7176.94144.4921.7116821.23324752.053123.23555.05101.994581.324580.39196.9035.421363543.1669.27187.9723.1616721.50310344.733212.10552.84101.952821.432817.45190.6334.461351566.2171.41149.8223.0116721.50318926.023210.20554.83102.754829.914825.13180.5333.161378540.9567.64184.8123.7716821.50311785.023213.77554.37101.75OpenBenchmarking.org

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integergcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize100020003000400050002821.434829.914581.324621.044812.664847.384472.852916.644614.26

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointgcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize100020003000400050002817.454825.134580.394624.884809.444844.144497.172917.544613.36

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Stock - Size: 2D FFT Size 2048gcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize4080120160200SE +/- 1.10, N = 5SE +/- 0.49, N = 5SE +/- 0.99, N = 5SE +/- 0.17, N = 5SE +/- 0.28, N = 5SE +/- 0.29, N = 5SE +/- 0.04, N = 5SE +/- 0.18, N = 5SE +/- 0.26, N = 5190.63180.53196.90164.61175.09172.73193.88185.55189.29-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence Alignmentgcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize816243240SE +/- 0.73, N = 6SE +/- 0.70, N = 6SE +/- 0.80, N = 6SE +/- 0.03, N = 3SE +/- 0.48, N = 6SE +/- 0.61, N = 6SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.54, N = 634.4633.1635.4234.9434.2336.1034.7834.6234.011. (CC) gcc options: -O3 -lm -lpthread

PostMark

Disk Transaction Performance

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performancegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize30060090012001500SE +/- 0.00, N = 3SE +/- 2.67, N = 3SE +/- 2.67, N = 3SE +/- 2.33, N = 3SE +/- 2.67, N = 3SE +/- 2.67, N = 3SE +/- 4.04, N = 3SE +/- 2.67, N = 3SE +/- 6.49, N = 31351137813631356137813781351136313611. (CC) gcc options: -O3

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 5.4.21e12 Prime Number Generationgcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize130260390520650SE +/- 2.99, N = 3SE +/- 8.42, N = 3SE +/- 3.01, N = 3SE +/- 4.25, N = 3SE +/- 17.98, N = 6SE +/- 15.34, N = 6SE +/- 10.05, N = 6SE +/- 9.47, N = 4SE +/- 8.11, N = 6566.21540.95543.16571.71592.00610.66591.80604.68574.61-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fopenmp

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total Timegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize20406080100SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.20, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 3SE +/- 0.53, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 371.4167.6469.2776.9471.8271.6581.4982.0379.47

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize50100150200250SE +/- 1.37, N = 3SE +/- 0.17, N = 3SE +/- 0.69, N = 3SE +/- 0.08, N = 3SE +/- 1.80, N = 3SE +/- 1.39, N = 3SE +/- 2.40, N = 6SE +/- 0.09, N = 3SE +/- 0.19, N = 3149.82184.81187.97144.49199.00200.00152.96223.05150.16-Ofast -mcpu=thunderx -ftree-vectorize-mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-mtune=cortex-a53 -ftree-vectorize-mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mappinggcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 323.0123.7723.1621.7121.9922.0121.8622.5722.48-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesgcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3167168167168169169171172173-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit Performancegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 321.5021.5021.5021.2321.2321.3021.2021.2021.301. (CC) gcc options: -O3 -fomit-frame-pointer -lssl -lcrypto -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETgcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize70K140K210K280K350KSE +/- 2784.59, N = 3SE +/- 2239.53, N = 3SE +/- 4662.92, N = 6SE +/- 1024.45, N = 3SE +/- 1273.24, N = 3SE +/- 3145.13, N = 3SE +/- 5780.31, N = 3SE +/- 3249.19, N = 3SE +/- 1214.02, N = 3318926.02311785.02310344.73324752.05317672.24315587.06303529.91305506.49311665.33-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 Solvinggcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize7001400210028003500SE +/- 0.76, N = 3SE +/- 0.22, N = 3SE +/- 0.35, N = 3SE +/- 2.96, N = 3SE +/- 3.94, N = 3SE +/- 1.20, N = 3SE +/- 0.86, N = 3SE +/- 1.26, N = 3SE +/- 1.51, N = 33210.203213.773212.103123.233125.773129.503045.833048.703052.831. (CC) gcc options: -O3

GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total Timegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize120240360480600554.83554.37552.84555.05553.02554.94554.44549.96554.31-Ofast -mcpu=thunderx -ftree-vectorize-O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects-Ofast -mcpu=cortex-a53 -ftree-vectorize-Ofast -mcpu=cortex-a57 -ftree-vectorize-O3 -mtune=cortex-a53 -ftree-vectorize-O3 -mcpu=cortex-a53-Ofast -mcpu=thunderx -ftree-vectorize-Ofast -mcpu=cortex-a72 -flto -ffat-lto-objects -fuse-linker-plugin-Ofast -mcpu=cortex-a57 -ftree-vectorize1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm

Sudokut

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterSudokut 0.4Total Timegcc7 thunderx vectorizegcc7 A53 vectorize LTOgcc7 A53 vectorizegcc6 A57 vectorizegcc6 A53 mtune/vectorizegcc6 A53gcc5 thunderx vectorizegcc5 A72 LTOgcc5 A57 vectorize20406080100SE +/- 0.76, N = 3SE +/- 0.21, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.23, N = 3SE +/- 0.78, N = 3SE +/- 0.73, N = 3SE +/- 0.11, N = 3102.75101.75101.95101.99101.61101.99103.63102.65102.05


Phoronix Test Suite v10.8.5