AMD Bulldozer Compiler Tests, slew AMD FX-4100 Bulldozer Quad-core compiler benchmarking under Ubuntu Linux with GCC 4.5 and GCC 4.6. Testing for a future article on Phoronix.com by Michael Larabel. Thanks to Daniel Newkirk for the SSH access to this AMD Bulldozer system due to lack of AMD FX CPUs at Phoronix. slew: AMD FX -4100 testing with a ASUS M5A88-V EVO and AMD Radeon HD 4250 on Ubuntu 11.04 via the Phoronix Test Suite. Bulldozer GCC 4.5.2: Processor: AMD FX-4100 @ 3.60GHz (4 Cores), Motherboard: ASUS M5A97 EVO, Chipset: ATI RD890 PCI to PCI bridge, Memory: 16384MB, Disk: 60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0, Graphics: NVIDIA GeForce 8400 GS, Audio: Realtek ALC892, Network: Realtek RTL8111/8168B OS: Ubuntu 11.04, Kernel: 2.6.38-11-generic (x86_64), Display Driver: NVIDIA 1.0.0, Compiler: GCC 4.5.2, File-System: xfs Bulldozer GCC 4.6.1: Processor: AMD FX-4100 @ 3.60GHz (4 Cores), Motherboard: ASUS M5A97 EVO, Chipset: ATI RD890 PCI to PCI bridge, Memory: 16384MB, Disk: 60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0, Graphics: NVIDIA GeForce 8400 GS, Audio: Realtek ALC892, Network: Realtek RTL8111/8168B OS: Ubuntu 11.04, Kernel: 2.6.38-11-generic (x86_64), Display Driver: NVIDIA 1.0.0, Compiler: GCC 4.6.1, File-System: xfs c-ray: Processor: AMD FX -4100 @ 3.60GHz (4 Cores), Motherboard: ASUS M5A88-V EVO, Chipset: AMD RS880, Memory: 4096MB, Disk: 1000GB SAMSUNG HD103SJ + 164GB Maxtor 6Y160M0 + 500GB Western Digital WDC WD5000AAKS-0, Graphics: AMD Radeon HD 4250, Audio: Realtek ALC892, Monitor: SyncMaster, Network: Realtek RTL8111/8168B OS: Ubuntu 11.04, Kernel: 3.1.0-999-generic (x86_64), Desktop: GNOME 2.32.1, Display Server: X Server 1.10.3.902 (1.10.4 RC 2), Display Driver: radeon 6.14.0, Compiler: GCC 4.5.2, File-System: ext4, Screen Resolution: 1600x900 C-Ray 1.1 Total Time Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 106.75 |================================================= Bulldozer GCC 4.6.1 . 105.49 |================================================ c-ray ............... 84.19 |======================================= Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 211 |=================================================== Bulldozer GCC 4.6.1 . 215 |==================================================== POV-Ray 3.6.1 Total Time Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 913 |==================================================== Bulldozer GCC 4.6.1 . 916 |==================================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better Bulldozer GCC 4.5.2 . 293780 |================================================= Bulldozer GCC 4.6.1 . 286148 |================================================ GraphicsMagick 1.3.12 Operation: HWB Color Space Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 123 |================================================ Bulldozer GCC 4.6.1 . 134 |==================================================== GraphicsMagick 1.3.12 Operation: Blur Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 87 |=================================================== Bulldozer GCC 4.6.1 . 90 |===================================================== GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 44 |===================================================== Bulldozer GCC 4.6.1 . 43 |==================================================== GraphicsMagick 1.3.12 Operation: Resizing Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 92 |=================================================== Bulldozer GCC 4.6.1 . 96 |===================================================== GraphicsMagick 1.3.12 Operation: Sharpen Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 44 |========================================== Bulldozer GCC 4.6.1 . 55 |===================================================== John The Ripper 1.7.8 Test: Traditional DES Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 3523667 |================================================ Bulldozer GCC 4.6.1 . 3321333 |============================================= John The Ripper 1.7.8 Test: MD5 Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 14082 |================================================== Bulldozer GCC 4.6.1 . 13598 |================================================ John The Ripper 1.7.8 Test: Blowfish Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 866 |================================================== Bulldozer GCC 4.6.1 . 895 |==================================================== OpenSSL 1.0.0a RSA 4096-bit Performance Signs Per Second > Higher Is Better Bulldozer GCC 4.5.2 . 63.65 |================================================== Bulldozer GCC 4.6.1 . 63.83 |================================================== Gcrypt Library 1.4.4 CAMELLIA256-ECB Cipher Microseconds < Lower Is Better Bulldozer GCC 4.5.2 . 3777 |=================================================== Bulldozer GCC 4.6.1 . 2503 |================================== 7-Zip Compression 9.13 Compress Speed Test MIPS > Higher Is Better Bulldozer GCC 4.5.2 . 8877 |=================================================== Bulldozer GCC 4.6.1 . 8783 |================================================== LAME MP3 Encoding 3.98.2 WAV To MP3 Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 28.69 |================================================ Bulldozer GCC 4.6.1 . 29.76 |================================================== FLAC Audio Encoding 1.2.1 WAV To FLAC Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 8.75 |=================================================== Bulldozer GCC 4.6.1 . 8.66 |================================================== FFmpeg 0.8.2 AVI To NTSC VCD Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 16.50 |================================================ Bulldozer GCC 4.6.1 . 17.06 |================================================== x264 2010-11-22 H.264 Video Encoding Frames Per Second > Higher Is Better Bulldozer GCC 4.5.2 . 50.19 |================================================= Bulldozer GCC 4.6.1 . 50.89 |================================================== Timed MAFFT Alignment 6.706 Multiple Sequence Alignment Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 28.86 |================================================== Bulldozer GCC 4.6.1 . 28.83 |================================================== NAS Parallel Benchmarks 3.3 Test / Class: BT.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 5982.31 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: CG.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 1788.94 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: EP.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 78.97 |================================================== NAS Parallel Benchmarks 3.3 Test / Class: FT.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 3785.19 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: IS.C Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 99.91 |================================================== Bulldozer GCC 4.6.1 . 99.50 |================================================== NAS Parallel Benchmarks 3.3 Test / Class: LU.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 6058.10 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: MG.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 3645.03 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: SP.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 3735.79 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: UA.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 21.48 |================================================== CLOMP 3.3 Static OMP Speedup Speedup > Higher Is Better Bulldozer GCC 4.5.2 . 2.31 |=================================================== Bulldozer GCC 4.6.1 . 2.31 |=================================================== SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 738.80 |================================================= Bulldozer GCC 4.6.1 . 735.23 |================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 72.89 |================================================== Bulldozer GCC 4.6.1 . 69.17 |=============================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 690.52 |================================================= Bulldozer GCC 4.6.1 . 690.51 |================================================= SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 345.55 |================================================= Bulldozer GCC 4.6.1 . 336.07 |================================================ SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 968.47 |================================================= Bulldozer GCC 4.6.1 . 940.56 |================================================ SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 1616.56 |=============================================== Bulldozer GCC 4.6.1 . 1639.84 |================================================ Fhourstones 3.1 Complex Connect-4 Solving Kpos / sec > Higher Is Better Bulldozer GCC 4.5.2 . 9425.30 |================================================ Bulldozer GCC 4.6.1 . 9221.03 |=============================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 281.74 |=========================================== Bulldozer GCC 4.6.1 . 322.56 |=================================================