Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1703238-RI-GCCLATEST31 AARCH64 codegen comparison update - Phoronix Test Suite AARCH64 codegen comparison update gcc7's performance on Cortex A53 (32kB L1)
HTML result view exported from: https://openbenchmarking.org/result/1703238-RI-GCCLATEST31&grs&sro .
AARCH64 codegen comparison update Processor Motherboard Memory Disk OS Kernel Compiler File-System Screen Resolution A53 vectorize, pre-patch thunderx/vectorize, pre-patch A53 vectorize/LTO, pre patch A53, post patch A53 mtune/vectorize, post-patch A53 vectorize, updated A53 vectorize, earlier build A57 vectorize/unrolled GCC 7.0.1 A53 vectorize GCC 7.0.1 A57 vectorize/unrolled GCC 6.3 AArch64 rev 4 @ 1.50GHz (4 Cores) Amlogic 2048MB 32GB 00000 + 16GB NCard Ubuntu 16.04 3.14.29 (aarch64) GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 ext4 1920x3240 AArch64 rev 4 @ 1.55GHz (4 Cores) GCC 7.0.0 20170113 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 Unknown @ 1.54GHz (4 Cores) 16GB NCard + 32GB 00000 3.14.79-vegas95 (aarch64) GCC 7.0.1 20170214 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 1280x1440 GCC 7.0.1 20170220 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 8GB NCard + 32GB 00000 GCC 7.0.1 20170322 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 GCC 6.3.1 20170316 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0 OpenBenchmarking.org Compiler Details - A53 vectorize, pre-patch: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - thunderx/vectorize, pre-patch: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53 vectorize/LTO, pre patch: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53, post patch: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53 mtune/vectorize, post-patch: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53 vectorize, updated: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53 vectorize, earlier build: --build=aarch64-linux-gnu --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A57 vectorize/unrolled GCC 7.0.1: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A53 vectorize GCC 7.0.1: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new - A57 vectorize/unrolled GCC 6.3: --build=aarch64-linux-gnu --disable-bootstrap --disable-browser-plugin --disable-libquadmath --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --target=aarch64-linux-gnu --with-arch-directory=aarch64 --with-default-libstdcxx-abi=new Disk Details - A53 vectorize, pre-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw - thunderx/vectorize, pre-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw - A53 vectorize/LTO, pre patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw - A53, post patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw - A53 mtune/vectorize, post-patch: CFQ / commit=30,errors=remount-ro,noatime,nodiratime,rw - A53 vectorize, updated: DEADLINE / commit=45,errors=remount-ro,noatime,nodiratime,rw - A53 vectorize, earlier build: CFQ / commit=45,errors=remount-ro,noatime,nodiratime,rw - A57 vectorize/unrolled GCC 7.0.1: CFQ / commit=120,errors=remount-ro,noatime,nodiratime,rw - A53 vectorize GCC 7.0.1: CFQ / commit=120,errors=remount-ro,noatime,nodiratime,rw - A57 vectorize/unrolled GCC 6.3: CFQ / commit=120,errors=remount-ro,noatime,nodiratime,rw Processor Details - A53 vectorize, pre-patch: Scaling Governor: meson_cpufreq performance - thunderx/vectorize, pre-patch: Scaling Governor: meson_cpufreq performance - A53 vectorize/LTO, pre patch: Scaling Governor: meson_cpufreq performance - A53, post patch: Scaling Governor: meson_cpufreq performance - A53 mtune/vectorize, post-patch: Scaling Governor: meson_cpufreq interactive - A53 vectorize, updated: Scaling Governor: meson_cpufreq performance - A53 vectorize, earlier build: Scaling Governor: meson_cpufreq performance - A57 vectorize/unrolled GCC 7.0.1: Scaling Governor: meson_cpufreq performance - A53 vectorize GCC 7.0.1: Scaling Governor: meson_cpufreq performance - A57 vectorize/unrolled GCC 6.3: Scaling Governor: meson_cpufreq performance
AARCH64 codegen comparison update ramspeed: Copy - Floating Point ramspeed: Copy - Integer c-ray: Total Time fftw: Stock - 2D FFT Size 2048 postmark: Disk Transaction Performance redis: GET mafft: Multiple Sequence Alignment primesieve: 1e12 Prime Number Generation tachyon: Total Time ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping fhourstones: Complex Connect-4 Solving smallpt: Global Illumination Renderer; 100 Samples openssl: RSA 4096-bit Performance sudokut: Total Time gmpbench: Total Time A53 vectorize, pre-patch thunderx/vectorize, pre-patch A53 vectorize/LTO, pre patch A53, post patch A53 mtune/vectorize, post-patch A53 vectorize, updated A53 vectorize, earlier build A57 vectorize/unrolled GCC 7.0.1 A53 vectorize GCC 7.0.1 A57 vectorize/unrolled GCC 6.3 4580.39 4581.32 187.97 196.90 1363 310344.73 35.42 543.16 69.27 23.16 3212.10 167 21.50 101.95 552.84 2817.45 2821.43 149.82 190.63 1351 318926.02 34.46 566.21 71.41 23.01 3210.20 167 21.50 102.75 554.83 4825.13 4829.91 184.81 180.53 1378 311785.02 33.16 540.95 67.64 23.77 3213.77 168 21.50 101.75 554.37 4964.66 4965.06 186.69 186.21 1381 309030.64 33.06 553.13 69.40 23.47 3209.67 168 21.50 101.88 552.56 4965.60 4955.97 186.61 184.81 1378 313438.91 32.17 573.13 69.34 23.49 3205.40 167 21.50 102.17 555.10 4785.59 4706.40 161.80 185.15 1217 277268.23 33.90 574.65 69.90 23.29 3223.57 166 21.40 102.72 554.11 4816.71 4816.69 162.23 191.54 1211 283742.83 34.52 523.43 69.39 23.49 3233.77 166 21.40 102.59 554.21 4193.97 4201.53 154.81 173.03 1184 276169.86 34.22 525.00 23.56 3398.47 168 21.47 103.04 553.17 4188.18 4161.09 151.02 156.72 1194 276298.44 35.52 547.12 69.64 23.13 3415.07 166 21.47 102.97 552.75 4388.73 4384.53 149.61 157.97 1190 275458.70 35.47 531.95 72.28 22.29 3325.60 169 21.20 103.00 554.03 OpenBenchmarking.org
RAMspeed SMP Type: Copy - Benchmark: Floating Point OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 1100 2200 3300 4400 5500 4965.60 4188.18 4816.71 4580.39 4785.59 4825.13 4964.66 4388.73 4193.97 2817.45
RAMspeed SMP Type: Copy - Benchmark: Integer OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 1100 2200 3300 4400 5500 4955.97 4161.09 4816.69 4581.32 4706.40 4829.91 4965.06 4384.53 4201.53 2821.43
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 40 80 120 160 200 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 1.00, N = 3 SE +/- 0.69, N = 3 SE +/- 0.27, N = 3 SE +/- 0.17, N = 3 SE +/- 0.14, N = 3 SE +/- 1.47, N = 3 SE +/- 0.78, N = 3 SE +/- 1.37, N = 3 186.61 151.02 162.23 187.97 161.80 184.81 186.69 149.61 154.81 149.82 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CC) gcc options: -lm -lpthread -O3 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
FFTW Build: Stock - Size: 2D FFT Size 2048 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.4 Build: Stock - Size: 2D FFT Size 2048 A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 40 80 120 160 200 SE +/- 0.21, N = 5 SE +/- 0.16, N = 5 SE +/- 0.16, N = 5 SE +/- 0.99, N = 5 SE +/- 0.06, N = 5 SE +/- 0.49, N = 5 SE +/- 0.08, N = 5 SE +/- 0.26, N = 5 SE +/- 0.10, N = 5 SE +/- 1.10, N = 5 184.81 156.72 191.54 196.90 185.15 180.53 186.21 157.97 173.03 190.63 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm
PostMark Disk Transaction Performance OpenBenchmarking.org TPS, More Is Better PostMark 1.51 Disk Transaction Performance A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 300 600 900 1200 1500 SE +/- 5.00, N = 3 SE +/- 3.67, N = 3 SE +/- 5.29, N = 3 SE +/- 2.67, N = 3 SE +/- 2.00, N = 3 SE +/- 2.67, N = 3 SE +/- 4.33, N = 3 SE +/- 0.00, N = 3 1378 1194 1211 1363 1217 1378 1381 1190 1184 1351 1. (CC) gcc options: -O3
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: GET A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 70K 140K 210K 280K 350K SE +/- 1967.34, N = 3 SE +/- 2031.32, N = 3 SE +/- 419.32, N = 3 SE +/- 4662.92, N = 6 SE +/- 2017.17, N = 3 SE +/- 2239.53, N = 3 SE +/- 1052.91, N = 3 SE +/- 3267.58, N = 3 SE +/- 649.43, N = 3 SE +/- 2784.59, N = 3 313438.91 276298.44 283742.83 310344.73 277268.23 311785.02 309030.64 275458.70 276169.86 318926.02 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -O2 -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.97, N = 6 SE +/- 1.04, N = 6 SE +/- 0.80, N = 6 SE +/- 0.79, N = 6 SE +/- 0.70, N = 6 SE +/- 0.71, N = 6 SE +/- 0.08, N = 3 SE +/- 0.71, N = 6 SE +/- 0.73, N = 6 32.17 35.52 34.52 35.42 33.90 33.16 33.06 35.47 34.22 34.46 1. (CC) gcc options: -O3 -lm -lpthread
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 5.4.2 1e12 Prime Number Generation A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 120 240 360 480 600 SE +/- 6.92, N = 3 SE +/- 9.38, N = 3 SE +/- 4.17, N = 3 SE +/- 3.01, N = 3 SE +/- 9.13, N = 4 SE +/- 8.42, N = 3 SE +/- 9.14, N = 3 SE +/- 1.80, N = 3 SE +/- 3.16, N = 3 SE +/- 2.99, N = 3 573.13 547.12 523.43 543.16 574.65 540.95 553.13 531.95 525.00 566.21 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -fopenmp
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 thunderx/vectorize, pre-patch 16 32 48 64 80 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 SE +/- 0.22, N = 3 SE +/- 0.08, N = 3 SE +/- 0.29, N = 3 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 69.34 69.64 69.39 69.27 69.90 67.64 69.40 72.28 71.41
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 23.49 23.13 23.49 23.16 23.29 23.77 23.47 22.29 23.56 23.01 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CXX) g++ options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -ffast-math -mtune=native -flto -lSDL -lstdc++
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 700 1400 2100 2800 3500 SE +/- 1.81, N = 3 SE +/- 2.42, N = 3 SE +/- 1.49, N = 3 SE +/- 0.35, N = 3 SE +/- 3.32, N = 3 SE +/- 0.22, N = 3 SE +/- 1.47, N = 3 SE +/- 1.97, N = 3 SE +/- 1.93, N = 3 SE +/- 0.76, N = 3 3205.40 3415.07 3233.77 3212.10 3223.57 3213.77 3209.67 3325.60 3398.47 3210.20 1. (CC) gcc options: -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 2.33, N = 3 SE +/- 0.00, N = 3 167 166 166 167 166 168 168 169 168 167 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CXX) g++ options: -fopenmp -fomit-frame-pointer -fipa-pta -march=armv8-a+crc
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1g RSA 4096-bit Performance A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 21.50 21.47 21.40 21.50 21.40 21.50 21.50 21.20 21.47 21.50 1. (CC) gcc options: -O3 -fomit-frame-pointer -lssl -lcrypto -ldl
Sudokut Total Time OpenBenchmarking.org Seconds, Fewer Is Better Sudokut 0.4 Total Time A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.13, N = 3 SE +/- 0.06, N = 3 SE +/- 0.20, N = 3 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.76, N = 3 102.17 102.97 102.59 101.95 102.72 101.75 101.88 103.00 103.04 102.75
GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GMPbench 0.2 Total Time A53 mtune/vectorize, post-patch A53 vectorize GCC 7.0.1 A53 vectorize, earlier build A53 vectorize, pre-patch A53 vectorize, updated A53 vectorize/LTO, pre patch A53, post patch A57 vectorize/unrolled GCC 6.3 A57 vectorize/unrolled GCC 7.0.1 thunderx/vectorize, pre-patch 120 240 360 480 600 555.10 552.75 554.21 552.84 554.11 554.37 552.56 554.03 553.17 554.83 -Ofast -mtune=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -Ofast -mcpu=cortex-a53 -ftree-vectorize -O3 -mcpu=cortex-a53 -ftree-vectorize -flto -ffat-lto-objects -Ofast -mcpu=cortex-a53 -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mtune=cortex-a57 -ftree-vectorize -funroll-loops -Ofast -mcpu=thunderx -ftree-vectorize 1. (CC) gcc options: -fomit-frame-pointer -fipa-pta -march=armv8-a+crc -lm
Phoronix Test Suite v10.8.4