AMD Bulldozer Compiler Tests Testing GCC 4.6, GCC 4.7 snapshot, and Open64 4.2.4 compilers on an AMD FX-8150 Bulldozer CPU. Testing with stock compiler support and then when the CFLAGS/CXXFLAGS with having mtune/march options for Bulldozer (bdver1) set. For the GCC releases there's also -Ofast with the bdver1 tuning. Benchmarking by Michael Larabel for a future article on Phoronix.com. GCC 4.6.1: Stock: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.6.1, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.6.1: bdver1: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.6.1, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.7-20111015: Stock: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.7.0, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.7-20111015: bdver1: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.7.0, File-System: ext4, Screen Resolution: 1920x1080 Open64 4.2.4: Stock: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: Open64 4.2.4, File-System: ext4, Screen Resolution: 1920x1080 Open64 4.2.4: bdver1: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: Open64 4.2.4, File-System: ext4, Screen Resolution: 1920x1080 AMD Open64 4.2.5.2: Stock: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: Open64 4.2.5.2, File-System: ext4, Screen Resolution: 1920x1080 AMD Open64 4.2.5.2: bdver1: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: Open64 4.2.5.2, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.7-20111015: bdver1 + Ofast: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.7.0, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.6.1: bdver1 + Ofast: Processor: AMD FX-8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS Crosshair V Formula, Chipset: ATI RD890 PCI to PCI bridge, Memory: 8192MB, Disk: 60GB OCZ VERTEX2, Graphics: NVIDIA GeForce GTX 460 768MB (675/1800MHz), Audio: Realtek ALC889, Network: Intel 82583V Gigabit Connection OS: Ubuntu 11.10, Kernel: 3.1.0-rc10-ic-aliasing-patch+ (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 4.2.0 NVIDIA 285.05.09, Compiler: GCC 4.7.0, File-System: ext4, Screen Resolution: 1920x1080 POV-Ray 3.6.1 Total Time Seconds < Lower Is Better GCC 4.6.1: Stock ................. 822 |==================================== GCC 4.6.1: bdver1 ................ 636 |============================ GCC 4.7-20111015: Stock .......... 851 |===================================== GCC 4.7-20111015: bdver1 ......... 636 |============================ Open64 4.2.4: Stock .............. 823 |==================================== Open64 4.2.4: bdver1 ............. 705 |=============================== AMD Open64 4.2.5.2: Stock ........ 894 |======================================= AMD Open64 4.2.5.2: bdver1 ....... 671 |============================= GCC 4.7-20111015: bdver1 + Ofast . 619 |=========================== GCC 4.6.1: bdver1 + Ofast ........ 623 |=========================== GraphicsMagick 1.3.12 Operation: HWB Color Space Iterations Per Minute > Higher Is Better GCC 4.6.1: Stock ................. 162 |====================================== GCC 4.6.1: bdver1 ................ 83 |==================== GCC 4.7-20111015: Stock .......... 163 |====================================== GCC 4.7-20111015: bdver1 ......... 83 |==================== Open64 4.2.4: Stock .............. 86 |==================== Open64 4.2.4: bdver1 ............. 88 |===================== AMD Open64 4.2.5.2: Stock ........ 81 |=================== AMD Open64 4.2.5.2: bdver1 ....... 82 |=================== GCC 4.7-20111015: bdver1 + Ofast . 165 |======================================= GCC 4.6.1: bdver1 + Ofast ........ 166 |======================================= GraphicsMagick 1.3.12 Operation: Blur Iterations Per Minute > Higher Is Better GCC 4.6.1: Stock ................. 98 |================================== GCC 4.6.1: bdver1 ................ 59 |===================== GCC 4.7-20111015: Stock .......... 98 |================================== GCC 4.7-20111015: bdver1 ......... 60 |===================== Open64 4.2.4: Stock .............. 59 |===================== Open64 4.2.4: bdver1 ............. 60 |===================== AMD Open64 4.2.5.2: Stock ........ 60 |===================== AMD Open64 4.2.5.2: bdver1 ....... 60 |===================== GCC 4.7-20111015: bdver1 + Ofast . 111 |======================================= GCC 4.6.1: bdver1 + Ofast ........ 110 |======================================= GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better GCC 4.6.1: Stock ................. 68 |======================================== GCC 4.6.1: bdver1 ................ 31 |================== GCC 4.7-20111015: Stock .......... 68 |======================================== GCC 4.7-20111015: bdver1 ......... 30 |================== Open64 4.2.4: Stock .............. 20 |============ Open64 4.2.4: bdver1 ............. 19 |=========== AMD Open64 4.2.5.2: Stock ........ 22 |============= AMD Open64 4.2.5.2: bdver1 ....... 22 |============= GCC 4.7-20111015: bdver1 + Ofast . 67 |======================================= GCC 4.6.1: bdver1 + Ofast ........ 67 |======================================= GraphicsMagick 1.3.12 Operation: Resizing Iterations Per Minute > Higher Is Better GCC 4.6.1: Stock ................. 131 |=================================== GCC 4.6.1: bdver1 ................ 66 |================= GCC 4.7-20111015: Stock .......... 127 |================================= GCC 4.7-20111015: bdver1 ......... 67 |================== Open64 4.2.4: Stock .............. 69 |================== Open64 4.2.4: bdver1 ............. 70 |================== AMD Open64 4.2.5.2: Stock ........ 69 |================== AMD Open64 4.2.5.2: bdver1 ....... 69 |================== GCC 4.7-20111015: bdver1 + Ofast . 146 |====================================== GCC 4.6.1: bdver1 + Ofast ........ 148 |======================================= GraphicsMagick 1.3.12 Operation: Sharpen Iterations Per Minute > Higher Is Better GCC 4.6.1: Stock ................. 65 |========================= GCC 4.6.1: bdver1 ................ 39 |=============== GCC 4.7-20111015: Stock .......... 65 |========================= GCC 4.7-20111015: bdver1 ......... 40 |=============== Open64 4.2.4: Stock .............. 35 |============== Open64 4.2.4: bdver1 ............. 35 |============== AMD Open64 4.2.5.2: Stock ........ 33 |============= AMD Open64 4.2.5.2: bdver1 ....... 34 |============= GCC 4.7-20111015: bdver1 + Ofast . 95 |===================================== GCC 4.6.1: bdver1 + Ofast ........ 101 |======================================= Gcrypt Library 1.4.4 CAMELLIA256-ECB Cipher Microseconds < Lower Is Better GCC 4.6.1: Stock ........... 2263 |======== GCC 4.6.1: bdver1 .......... 12723 |=========================================== GCC 4.7-20111015: Stock .... 2503 |======== GCC 4.7-20111015: bdver1 ... 10977 |===================================== Open64 4.2.4: Stock ........ 1960 |======= Open64 4.2.4: bdver1 ....... 1993 |======= AMD Open64 4.2.5.2: Stock .. 1950 |======= AMD Open64 4.2.5.2: bdver1 . 1987 |=======