Cortex A53 vs Cortex A5 ARMv7 codegen options comparison A53 vectorize, pre-patch: Processor: AArch64 rev 4 @ 1.50GHz (4 Cores), Motherboard: Amlogic, Memory: 2048MB, Disk: 32GB 00000 + 16GB NCard OS: Ubuntu 16.04, Kernel: 3.14.29 (aarch64), Compiler: GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0, File-System: ext4, Screen Resolution: 1920x3240 thunderx/vectorize, pre-patch: Processor: AArch64 rev 4 @ 1.50GHz (4 Cores), Motherboard: Amlogic, Memory: 2048MB, Disk: 32GB 00000 + 16GB NCard OS: Ubuntu 16.04, Kernel: 3.14.29 (aarch64), Compiler: GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0, File-System: ext4, Screen Resolution: 1920x3240 A53 vectorize/LTO, pre patch: Processor: AArch64 rev 4 @ 1.55GHz (4 Cores), Motherboard: Amlogic, Memory: 2048MB, Disk: 32GB 00000 + 16GB NCard OS: Ubuntu 16.04, Kernel: 3.14.29 (aarch64), Compiler: GCC 7.0.0 20170110 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0, File-System: ext4, Screen Resolution: 1920x3240 Cortex-A53: Processor: AArch64 rev 4 @ 1.55GHz (4 Cores), Motherboard: Amlogic, Memory: 2048MB, Disk: 32GB 00000 + 16GB NCard OS: Ubuntu 16.04, Kernel: 3.14.29 (aarch64), Compiler: GCC 7.0.0 20170113 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0, File-System: ext4, Screen Resolution: 1920x3240 Cortex-A53 vectorize: Processor: AArch64 rev 4 @ 1.55GHz (4 Cores), Motherboard: Amlogic, Memory: 2048MB, Disk: 32GB 00000 + 16GB NCard OS: Ubuntu 16.04, Kernel: 3.14.29 (aarch64), Compiler: GCC 7.0.0 20170113 + Clang 3.8.0-2ubuntu4 + LLVM 3.8.0, File-System: ext4, Screen Resolution: 1920x3240 Cortex-A5 VFPV4: Processor: ARMv7 rev 1 @ 1.73GHz (4 Cores), Motherboard: ODROIDC, Memory: 916MB, Disk: 64GB 00000 OS: Ubuntu 14.04, Kernel: 3.10.96-149 (armv7l), Display Server: X Server 1.15.1, Compiler: GCC 7.0.0 20170114 + Clang 3.6.0-2ubuntu1~trusty1 + LLVM 3.6.0, File-System: ext4, Screen Resolution: 1280x1024 Cortex-A5 Neon: Processor: ARMv7 rev 1 @ 1.73GHz (4 Cores), Motherboard: ODROIDC, Memory: 916MB, Disk: 64GB 00000 OS: Ubuntu 14.04, Kernel: 3.10.96-149 (armv7l), Display Server: X Server 1.15.1, Compiler: GCC 7.0.0 20170114 + Clang 3.6.0-2ubuntu1~trusty1 + LLVM 3.6.0, File-System: ext4, Screen Resolution: 1280x1024 Cortex-A5 NEON-vfpv4: Processor: ARMv7 rev 1 @ 1.73GHz (4 Cores), Motherboard: ODROIDC, Memory: 916MB, Disk: 64GB 00000 OS: Ubuntu 14.04, Kernel: 3.10.96-149 (armv7l), Display Server: X Server 1.15.1, Compiler: GCC 7.0.0 20170114 + Clang 3.6.0-2ubuntu1~trusty1 + LLVM 3.6.0, File-System: ext4, Screen Resolution: 1280x1024 GCC 4.9 NEON-vfpv4: Processor: ARMv7 rev 1 @ 1.73GHz (4 Cores), Motherboard: ODROIDC, Memory: 916MB, Disk: 64GB 00000 OS: Ubuntu 14.04, Kernel: 3.10.96-149 (armv7l), Display Server: X Server 1.15.1, Compiler: GCC 4.9.4 + Clang 3.6.0-2ubuntu1~trusty1 + LLVM 3.6.0, File-System: ext4, Screen Resolution: 1280x1024 GCC 4.9 VFPv4: Processor: ARMv7 rev 1 @ 1.73GHz (4 Cores), Motherboard: ODROIDC, Memory: 916MB, Disk: 64GB 00000 OS: Ubuntu 14.04, Kernel: 3.10.96-149 (armv7l), Display Server: X Server 1.15.1, Compiler: GCC 4.9.4 + Clang 3.6.0-2ubuntu1~trusty1 + LLVM 3.6.0, File-System: ext4, Screen Resolution: 1280x1024 PostMark 1.51 Disk Transaction Performance TPS > Higher Is Better A53 vectorize, pre-patch ...... 1363 |======================================== thunderx/vectorize, pre-patch . 1351 |======================================== A53 vectorize/LTO, pre patch .. 1378 |========================================= Cortex-A53 .................... 1381 |========================================= Cortex-A53 vectorize .......... 1378 |========================================= Cortex-A5 VFPV4 ............... 725 |====================== Cortex-A5 Neon ................ 717 |===================== Cortex-A5 NEON-vfpv4 .......... 744 |====================== GCC 4.9 NEON-vfpv4 ............ 692 |===================== GCC 4.9 VFPv4 ................. 720 |===================== RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer MB/s > Higher Is Better A53 vectorize, pre-patch ...... 4581.32 |=================================== thunderx/vectorize, pre-patch . 2821.43 |====================== A53 vectorize/LTO, pre patch .. 4829.91 |===================================== Cortex-A53 .................... 4965.06 |====================================== Cortex-A53 vectorize .......... 4955.97 |====================================== Cortex-A5 VFPV4 ............... 1476.43 |=========== Cortex-A5 Neon ................ 1262.11 |========== Cortex-A5 NEON-vfpv4 .......... 1482.21 |=========== GCC 4.9 NEON-vfpv4 ............ 1466.11 |=========== GCC 4.9 VFPv4 ................. 1567.32 |============ RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point MB/s > Higher Is Better A53 vectorize, pre-patch ...... 4580.39 |=================================== thunderx/vectorize, pre-patch . 2817.45 |====================== A53 vectorize/LTO, pre patch .. 4825.13 |===================================== Cortex-A53 .................... 4964.66 |====================================== Cortex-A53 vectorize .......... 4965.60 |====================================== Cortex-A5 VFPV4 ............... 1365.27 |========== Cortex-A5 Neon ................ 1049.66 |======== Cortex-A5 NEON-vfpv4 .......... 1369.05 |========== GCC 4.9 NEON-vfpv4 ............ 1302.02 |========== GCC 4.9 VFPv4 ................. 1114.65 |========= FFTW 3.3.4 Build: Stock - Size: 2D FFT Size 2048 Mflops > Higher Is Better A53 vectorize, pre-patch ...... 196.90 |======================================= thunderx/vectorize, pre-patch . 190.63 |====================================== A53 vectorize/LTO, pre patch .. 180.53 |==================================== Cortex-A53 .................... 186.21 |===================================== Cortex-A53 vectorize .......... 184.81 |===================================== Cortex-A5 VFPV4 ............... 135.07 |=========================== Cortex-A5 Neon ................ 128.75 |========================== Cortex-A5 NEON-vfpv4 .......... 132.50 |========================== GCC 4.9 NEON-vfpv4 ............ 127.22 |========================= GCC 4.9 VFPv4 ................. 134.44 |=========================== Timed MAFFT Alignment 6.864 Multiple Sequence Alignment Seconds < Lower Is Better A53 vectorize, pre-patch ...... 35.42 |=============================== thunderx/vectorize, pre-patch . 34.46 |============================== A53 vectorize/LTO, pre patch .. 33.16 |============================= Cortex-A53 .................... 33.06 |============================= Cortex-A53 vectorize .......... 32.17 |============================ Cortex-A5 VFPV4 ............... 43.74 |====================================== Cortex-A5 Neon ................ 45.21 |======================================= Cortex-A5 NEON-vfpv4 .......... 46.12 |======================================== GCC 4.9 NEON-vfpv4 ............ 41.68 |==================================== GCC 4.9 VFPv4 ................. 31.59 |=========================== GMPbench 0.2 Total Time GMPbench Score > Higher Is Better A53 vectorize, pre-patch ...... 552.84 |======================================= thunderx/vectorize, pre-patch . 554.83 |======================================= A53 vectorize/LTO, pre patch .. 554.37 |======================================= Cortex-A53 .................... 552.56 |======================================= Cortex-A53 vectorize .......... 555.10 |======================================= Cortex-A5 VFPV4 ............... 221.56 |================ Cortex-A5 Neon ................ 221.63 |================ Cortex-A5 NEON-vfpv4 .......... 221.48 |================ GCC 4.9 NEON-vfpv4 ............ 221.04 |================ GCC 4.9 VFPv4 ................. 221.70 |================ Fhourstones 3.1 Complex Connect-4 Solving Kpos / sec > Higher Is Better A53 vectorize, pre-patch ...... 3212.10 |====================================== thunderx/vectorize, pre-patch . 3210.20 |====================================== A53 vectorize/LTO, pre patch .. 3213.77 |====================================== Cortex-A53 .................... 3209.67 |====================================== Cortex-A53 vectorize .......... 3205.40 |====================================== Cortex-A5 VFPV4 ............... 1211.33 |============== Cortex-A5 Neon ................ 1246.77 |=============== Cortex-A5 NEON-vfpv4 .......... 1228.77 |=============== GCC 4.9 NEON-vfpv4 ............ 1186.23 |============== GCC 4.9 VFPv4 ................. 1046.73 |============ TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping FPS > Higher Is Better A53 vectorize, pre-patch ...... 23.16 |======================================= thunderx/vectorize, pre-patch . 23.01 |======================================= A53 vectorize/LTO, pre patch .. 23.77 |======================================== Cortex-A53 .................... 23.47 |======================================= Cortex-A53 vectorize .......... 23.49 |======================================== Cortex-A5 VFPV4 ............... 16.91 |============================ Cortex-A5 Neon ................ 15.75 |=========================== Cortex-A5 NEON-vfpv4 .......... 15.77 |=========================== GCC 4.9 NEON-vfpv4 ............ 15.78 |=========================== GCC 4.9 VFPv4 ................. 16.47 |============================ C-Ray 1.1 Total Time Seconds < Lower Is Better A53 vectorize, pre-patch ...... 187.97 |========================= thunderx/vectorize, pre-patch . 149.82 |==================== A53 vectorize/LTO, pre patch .. 184.81 |========================= Cortex-A53 .................... 186.69 |========================= Cortex-A53 vectorize .......... 186.61 |========================= Cortex-A5 VFPV4 ............... 273.18 |===================================== Cortex-A5 Neon ................ 235.23 |================================ Cortex-A5 NEON-vfpv4 .......... 244.62 |================================= GCC 4.9 NEON-vfpv4 ............ 265.13 |==================================== GCC 4.9 VFPv4 ................. 290.51 |======================================= Primesieve 5.4.2 1e12 Prime Number Generation Seconds < Lower Is Better A53 vectorize, pre-patch ...... 543.16 |==================== thunderx/vectorize, pre-patch . 566.21 |===================== A53 vectorize/LTO, pre patch .. 540.95 |==================== Cortex-A53 .................... 553.13 |==================== Cortex-A53 vectorize .......... 573.13 |===================== Cortex-A5 VFPV4 ............... 836.59 |=============================== Cortex-A5 Neon ................ 1021.13 |====================================== Cortex-A5 NEON-vfpv4 .......... 1029.57 |====================================== GCC 4.9 NEON-vfpv4 ............ 894.65 |================================= GCC 4.9 VFPv4 ................. 948.93 |=================================== Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better A53 vectorize, pre-patch ...... 167 |======================= thunderx/vectorize, pre-patch . 167 |======================= A53 vectorize/LTO, pre patch .. 168 |======================= Cortex-A53 .................... 168 |======================= Cortex-A53 vectorize .......... 167 |======================= Cortex-A5 VFPV4 ............... 306 |========================================== Cortex-A5 Neon ................ 289 |======================================== Cortex-A5 NEON-vfpv4 .......... 291 |======================================== GCC 4.9 NEON-vfpv4 ............ 287 |======================================= GCC 4.9 VFPv4 ................. 306 |========================================== Sudokut 0.4 Total Time Seconds < Lower Is Better A53 vectorize, pre-patch ...... 101.95 |======================= thunderx/vectorize, pre-patch . 102.75 |======================== A53 vectorize/LTO, pre patch .. 101.75 |======================= Cortex-A53 .................... 101.88 |======================= Cortex-A53 vectorize .......... 102.17 |======================= Cortex-A5 VFPV4 ............... 167.97 |====================================== Cortex-A5 Neon ................ 165.14 |====================================== Cortex-A5 NEON-vfpv4 .......... 167.41 |====================================== GCC 4.9 NEON-vfpv4 ............ 169.11 |======================================= GCC 4.9 VFPv4 ................. 170.29 |======================================= Tachyon 0.98.9 Total Time Seconds < Lower Is Better A53 vectorize, pre-patch ...... 69.27 |=========== thunderx/vectorize, pre-patch . 71.41 |=========== A53 vectorize/LTO, pre patch .. 67.64 |=========== Cortex-A53 .................... 69.40 |=========== Cortex-A53 vectorize .......... 69.34 |=========== Cortex-A5 VFPV4 ............... 107.33 |================= Cortex-A5 Neon ................ 247.96 |======================================= Cortex-A5 NEON-vfpv4 .......... 108.33 |================= GCC 4.9 NEON-vfpv4 ............ 109.85 |================= GCC 4.9 VFPv4 ................. 109.81 |================= OpenSSL 1.0.1g RSA 4096-bit Performance Signs Per Second > Higher Is Better A53 vectorize, pre-patch ...... 21.50 |======================================== thunderx/vectorize, pre-patch . 21.50 |======================================== A53 vectorize/LTO, pre patch .. 21.50 |======================================== Cortex-A53 .................... 21.50 |======================================== Cortex-A53 vectorize .......... 21.50 |======================================== Cortex-A5 VFPV4 ............... 20.23 |====================================== Cortex-A5 Neon ................ 20.30 |====================================== Cortex-A5 NEON-vfpv4 .......... 20.30 |====================================== GCC 4.9 NEON-vfpv4 ............ 20.33 |====================================== GCC 4.9 VFPv4 ................. 20.30 |====================================== Redis 3.0.1 Test: GET Requests Per Second > Higher Is Better A53 vectorize, pre-patch ...... 310344.73 |=================================== thunderx/vectorize, pre-patch . 318926.02 |==================================== A53 vectorize/LTO, pre patch .. 311785.02 |=================================== Cortex-A53 .................... 309030.64 |=================================== Cortex-A53 vectorize .......... 313438.91 |=================================== Cortex-A5 VFPV4 ............... 174917.94 |==================== Cortex-A5 Neon ................ 177754.08 |==================== Cortex-A5 NEON-vfpv4 .......... 179240.55 |==================== GCC 4.9 NEON-vfpv4 ............ 171955.12 |=================== GCC 4.9 VFPv4 ................. 165811.52 |===================