GCC AMD Bulldozer Compiler Tuning AMD FX-8150 Bulldozer compiler tuning using GCC 4.7.1 with different march options for the test profiles to look at the latest AMD GCC performance. Benchmarking by Michael Larabel for a future article on Phoronix.com. march=nocona: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 march=core2: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 march=k8: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 march=k8-sse3: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 march=barcelona: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 march=bdver1: Processor: AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: AMD ATI RD890 bridge, Memory: 4096MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce 9600 GSO 512MB (399/399MHz), Audio: Realtek ALC889, Monitor: DELL P2210H, Network: Intel 82583V Gigabit Connection OS: Ubuntu 12.10, Kernel: 3.5.0-14-generic (x86_64), Desktop: Unity 6.4.0, Display Server: X Server 1.13.0, Display Driver: nouveau 1.0.2, OpenGL: 3.0 Mesa 8.1-devel Gallium 0.4, Compiler: GCC 4.7, File-System: ext4, Screen Resolution: 1920x1080 GraphicsMagick 1.3.12 Operation: Blur Iterations Per Minute > Higher Is Better march=nocona .... 99 |================================================= march=core2 ..... 96 |================================================ march=k8 ........ 97 |================================================ march=k8-sse3 ... 95 |=============================================== march=barcelona . 107 |===================================================== march=bdver1 .... 113 |======================================================== GraphicsMagick 1.3.12 Operation: Sharpen Iterations Per Minute > Higher Is Better march=nocona .... 65 |==================================== march=core2 ..... 99 |======================================================= march=k8 ........ 63 |=================================== march=k8-sse3 ... 62 |=================================== march=barcelona . 86 |================================================ march=bdver1 .... 100 |======================================================== GraphicsMagick 1.3.12 Operation: Resizing Iterations Per Minute > Higher Is Better march=nocona .... 126 |================================================= march=core2 ..... 139 |====================================================== march=k8 ........ 128 |================================================== march=k8-sse3 ... 126 |================================================= march=barcelona . 139 |====================================================== march=bdver1 .... 144 |======================================================== GraphicsMagick 1.3.12 Operation: HWB Color Space Iterations Per Minute > Higher Is Better march=nocona .... 160 |======================================================== march=core2 ..... 158 |======================================================= march=k8 ........ 151 |===================================================== march=k8-sse3 ... 150 |===================================================== march=barcelona . 156 |======================================================= march=bdver1 .... 157 |======================================================= GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better march=nocona .... 68 |========================================================= march=core2 ..... 67 |======================================================== march=k8 ........ 67 |======================================================== march=k8-sse3 ... 66 |======================================================= march=barcelona . 61 |=================================================== march=bdver1 .... 66 |======================================================= Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better march=nocona .... 654.27 |================================================== march=core2 ..... 670.75 |=================================================== march=k8 ........ 614.83 |=============================================== march=k8-sse3 ... 678.97 |=================================================== march=barcelona . 669.28 |=================================================== march=bdver1 .... 698.92 |===================================================== Timed PHP Compilation 5.2.9 Time To Compile Seconds < Lower Is Better march=nocona .... 31.20 |================================================= march=core2 ..... 33.22 |==================================================== march=k8 ........ 33.02 |==================================================== march=k8-sse3 ... 32.98 |==================================================== march=barcelona . 33.07 |==================================================== march=bdver1 .... 34.41 |====================================================== C-Ray 1.1 Total Time Seconds < Lower Is Better march=nocona .... 36.84 |====================================== march=core2 ..... 36.38 |===================================== march=k8 ........ 52.84 |====================================================== march=k8-sse3 ... 52.82 |====================================================== march=barcelona . 36.09 |===================================== march=bdver1 .... 26.99 |============================ Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better march=nocona .... 33 |=========================================== march=core2 ..... 33 |=========================================== march=k8 ........ 44 |========================================================= march=k8-sse3 ... 44 |========================================================= march=barcelona . 32 |========================================= march=bdver1 .... 30 |======================================= Open FMM Nero2D 2.0.2 Total Time Seconds < Lower Is Better march=nocona .... 549.43 |============================================= march=core2 ..... 531.27 |============================================ march=k8 ........ 643.59 |===================================================== march=k8-sse3 ... 636.99 |==================================================== march=barcelona . 534.42 |============================================ march=bdver1 .... 561.85 |============================================== PostgreSQL pgbench 8.4.11 TPC-B Transactions Per Second TPS > Higher Is Better march=nocona .... 2064.40 |==================================================== march=core2 ..... 2039.71 |=================================================== march=k8 ........ 1986.18 |================================================== march=k8-sse3 ... 2018.34 |=================================================== march=barcelona . 2037.11 |=================================================== march=bdver1 .... 1916.51 |================================================