GCC 4.9 Compiler Optimization Tuning AMD Kaveri AMD Steamroller CPU Cores on AMD A10-7850K Kaveri APU compiler optimization tuning with various march= values. Benchmarks by Michael Larabel for a future article on Phoronix.com. bdver3: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126 + Clang 3.4 + LLVM 3.4, File-System: ext4, Screen Resolution: 1920x1080 bdver2: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126 + Clang 3.4 + LLVM 3.4, File-System: ext4, Screen Resolution: 1920x1080 bdver1: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126 + Clang 3.4 + LLVM 3.4, File-System: ext4, Screen Resolution: 1920x1080 barcelona: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126 + Clang 3.4 + LLVM 3.4, File-System: ext4, Screen Resolution: 1920x1080 k8: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126 + Clang 3.4 + LLVM 3.4, File-System: ext4, Screen Resolution: 1920x1080 C-Ray 1.1 Total Time Seconds < Lower Is Better bdver3 .... 40.54 |============================ bdver2 .... 40.55 |============================ bdver1 .... 40.67 |============================ barcelona . 53.33 |==================================== k8 ........ 87.90 |============================================================ FLAC Audio Encoding 1.3.0 WAV To FLAC Seconds < Lower Is Better bdver3 .... 5.52 |================================================= bdver2 .... 5.47 |================================================ bdver1 .... 5.29 |=============================================== barcelona . 6.90 |============================================================= k8 ........ 6.62 |=========================================================== GraphicsMagick 1.3.19 Operation: Blur Iterations Per Minute > Higher Is Better bdver3 .... 106 |============================================================ bdver2 .... 110 |============================================================== bdver1 .... 108 |============================================================= barcelona . 93 |==================================================== k8 ........ 97 |======================================================= GraphicsMagick 1.3.19 Operation: Sharpen Iterations Per Minute > Higher Is Better bdver3 .... 81 |=========================================================== bdver2 .... 81 |=========================================================== bdver1 .... 87 |=============================================================== barcelona . 72 |==================================================== k8 ........ 71 |=================================================== GraphicsMagick 1.3.19 Operation: Resizing Iterations Per Minute > Higher Is Better bdver3 .... 133 |============================================================== bdver2 .... 133 |============================================================== bdver1 .... 133 |============================================================== barcelona . 120 |======================================================== k8 ........ 106 |================================================= GraphicsMagick 1.3.19 Operation: HWB Color Space Iterations Per Minute > Higher Is Better bdver3 .... 139 |============================================================== bdver2 .... 139 |============================================================== bdver1 .... 139 |============================================================== barcelona . 138 |============================================================== k8 ........ 126 |======================================================== GraphicsMagick 1.3.19 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better bdver3 .... 80 |============================================================== bdver2 .... 81 |=============================================================== bdver1 .... 81 |=============================================================== barcelona . 76 |=========================================================== k8 ........ 77 |============================================================ Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better bdver3 .... 902.98 |=========================================================== bdver2 .... 905.30 |=========================================================== bdver1 .... 894.27 |========================================================== barcelona . 898.33 |=========================================================== k8 ........ 867.36 |========================================================= SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better bdver3 .... 641.05 |=========================================================== bdver2 .... 644.89 |=========================================================== bdver1 .... 640.56 |=========================================================== barcelona . 629.44 |========================================================== k8 ........ 636.54 |========================================================== SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better bdver3 .... 413.64 |========================================================== bdver2 .... 423.28 |=========================================================== bdver1 .... 423.73 |=========================================================== barcelona . 384.47 |====================================================== k8 ........ 397.86 |======================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better bdver3 .... 70.77 |========================================================== bdver2 .... 71.15 |========================================================== bdver1 .... 68.50 |======================================================== barcelona . 68.99 |======================================================== k8 ........ 73.59 |============================================================ SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better bdver3 .... 866.81 |========================================================== bdver2 .... 877.91 |=========================================================== bdver1 .... 860.41 |========================================================== barcelona . 849.34 |========================================================= k8 ........ 865.92 |========================================================== SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better bdver3 .... 1165.94 |========================================================== bdver2 .... 1164.64 |========================================================== bdver1 .... 1162.41 |========================================================== barcelona . 1155.51 |========================================================= k8 ........ 1156.29 |========================================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better bdver3 .... 688.08 |=========================================================== bdver2 .... 687.47 |=========================================================== bdver1 .... 687.76 |=========================================================== barcelona . 688.88 |=========================================================== k8 ........ 689.02 |=========================================================== Timed Apache Compilation 2.4.7 Time To Compile Seconds < Lower Is Better bdver3 .... 58.83 |============================================================ bdver2 .... 58.91 |============================================================ bdver1 .... 59.07 |============================================================ barcelona . 58.68 |============================================================ k8 ........ 58.55 |=========================================================== Timed PHP Compilation 5.2.9 Time To Compile Seconds < Lower Is Better bdver3 .... 58.48 |============================================================ bdver2 .... 58.54 |============================================================ bdver1 .... 58.45 |============================================================ barcelona . 56.60 |========================================================== k8 ........ 56.54 |========================================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better bdver3 .... 739101 |========================================================= bdver2 .... 738707 |========================================================= bdver1 .... 738311 |========================================================= barcelona . 742690 |========================================================== k8 ........ 760113 |=========================================================== x264 2014-01-09 H.264 Video Encoding Frames Per Second > Higher Is Better bdver3 .... 83.83 |============================================================ bdver2 .... 83.85 |============================================================ bdver1 .... 84.14 |============================================================ barcelona . 83.66 |============================================================ k8 ........ 83.43 |===========================================================