bulldozer, AMD Bulldozer Compiler Tests AMD FX -8150 Eight-Core testing with a ASUS SABERTOOTH 990FX and NVIDIA GeForce 8800 GT 512MB on Gentoo Base release 2.1 via the Phoronix Test Suite. AMD Bulldozer Compiler Tests: AMD FX-4100 Bulldozer Quad-core compiler benchmarking under Ubuntu Linux with GCC 4.5 and GCC 4.6. Testing for a future article on Phoronix.com by Michael Larabel. Thanks to Daniel Newkirk for the SSH access to this AMD Bulldozer system due to lack of AMD FX CPUs at Phoronix. bulldozer: Processor: AMD FX -8150 Eight-Core @ 3.61GHz (8 Cores), Motherboard: ASUS SABERTOOTH 990FX, Memory: 4096MB, Disk: 300GB Western Digital WDC WD3000GLFS-0 + 500GB MAXTOR STM350032, Graphics: NVIDIA GeForce 8800 GT 512MB (660/950MHz), Network: Realtek RTL8111/8168B OS: Gentoo Base release 2.1, Kernel: 3.0.6-gentoo (x86_64), Desktop: KDE 4.7.2, Display Server: X Server 1.10.4, Display Driver: NVIDIA 285.05.09, OpenGL: 3.3.0 NVIDIA 285.05.09, Compiler: GCC 4.5.3, File-System: ext4, Screen Resolution: 1680x1050 Phenom2 x4 965: Processor: AMD Phenom II X4 965 @ 3.40GHz (4 Cores), Motherboard: Gigabyte GA-870A-UD3, Chipset: ATI RX780/RX790, Memory: 4096MB, Disk: 1000GB SAMSUNG HD103UJ, Graphics: NVIDIA GeForce GTX 460, Audio: Realtek ALC892 OS: Ubuntu 11.10, Kernel: 3.0.0-12-generic (x86_64), Desktop: Unity 4.22.0, Display Server: X Server 1.10.4, Display Driver: NVIDIA Gtk-WARNING **: Im Modulpfad »pixmap« konnte keine Themen-Engine gefunden werden, OpenGL: Gtk-WARNING **: Im Modulpfad »pixmap« konnte keine Themen-Engine gefunden werden, Compiler: GCC 4.6.1, File-System: ext4, Screen Resolution: 1680x1050 Bulldozer GCC 4.5.2: Processor: AMD FX-4100 @ 3.60GHz (4 Cores), Motherboard: ASUS M5A97 EVO, Chipset: ATI RD890 PCI to PCI bridge, Memory: 16384MB, Disk: 60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0, Graphics: NVIDIA GeForce 8400 GS, Audio: Realtek ALC892, Network: Realtek RTL8111/8168B OS: Ubuntu 11.04, Kernel: 2.6.38-11-generic (x86_64), Display Driver: NVIDIA 1.0.0, Compiler: GCC 4.5.2, File-System: xfs Bulldozer GCC 4.6.1: Processor: AMD FX-4100 @ 3.60GHz (4 Cores), Motherboard: ASUS M5A97 EVO, Chipset: ATI RD890 PCI to PCI bridge, Memory: 16384MB, Disk: 60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0, Graphics: NVIDIA GeForce 8400 GS, Audio: Realtek ALC892, Network: Realtek RTL8111/8168B OS: Ubuntu 11.04, Kernel: 2.6.38-11-generic (x86_64), Display Driver: NVIDIA 1.0.0, Compiler: GCC 4.6.1, File-System: xfs World of Padman 1.2 1680 x 1050 Frames Per Second > Higher Is Better bulldozer ...... 431.10 |====================================================== Phenom2 x4 965 . 384.47 |================================================ ET: Quake Wars Demo 1680 x 1050 Frames Per Second > Higher Is Better bulldozer ...... 71.75 |======================================================= Phenom2 x4 965 . 65.32 |================================================== GraphicsMagick 1.3.12 HWB Color Space Iterations Per Minute > Higher Is Better bulldozer ...... 141 |========================================================= Phenom2 x4 965 . 123 |================================================== GraphicsMagick 1.3.12 Local Adaptive Thresholding Iterations Per Minute > Higher Is Better bulldozer ...... 65 |========================================================== Phenom2 x4 965 . 47 |========================================== GraphicsMagick 1.3.12 Resizing Iterations Per Minute > Higher Is Better bulldozer ...... 120 |========================================================= Phenom2 x4 965 . 103 |================================================= GraphicsMagick 1.3.12 Sharpen Iterations Per Minute > Higher Is Better bulldozer ...... 61 |========================================================== Phenom2 x4 965 . 52 |================================================= John The Ripper 1.7.8 Traditional DES Real C/S > Higher Is Better bulldozer ...... 3918000 |===================================================== Phenom2 x4 965 . 2510667 |================================== John The Ripper 1.7.8 Blowfish Real C/S > Higher Is Better bulldozer ...... 954 |==================================================== Phenom2 x4 965 . 1021 |======================================================== TTSIOD 3D Renderer 2.1v Phong Rendering With Soft-Shadow Mapping FPS > Higher Is Better bulldozer ...... 116.06 |====================================================== Phenom2 x4 965 . 67.25 |=============================== Parallel BZIP2 Compression 1.0.5 256MB File Compression Seconds < Lower Is Better bulldozer ...... 8.72 |=============================== Phenom2 x4 965 . 15.49 |======================================================= 7-Zip Compression 9.13 Compress Speed Test MIPS > Higher Is Better bulldozer ........... 18209 |================================================== Phenom2 x4 965 ...... 10242 |============================ Bulldozer GCC 4.5.2 . 8877 |======================== Bulldozer GCC 4.6.1 . 8783 |======================== LAME MP3 Encoding 3.98.2 WAV To MP3 Seconds < Lower Is Better bulldozer ........... 26.74 |============================================= Phenom2 x4 965 ...... 28.56 |================================================ Bulldozer GCC 4.5.2 . 28.69 |================================================ Bulldozer GCC 4.6.1 . 29.76 |================================================== FLAC Audio Encoding 1.2.1 WAV To FLAC Seconds < Lower Is Better bulldozer ........... 8.25 |================================================ Phenom2 x4 965 ...... 7.96 |============================================== Bulldozer GCC 4.5.2 . 8.75 |=================================================== Bulldozer GCC 4.6.1 . 8.66 |================================================== FFmpeg 0.8.2 AVI To NTSC VCD Seconds < Lower Is Better bulldozer ........... 9.98 |=================== Phenom2 x4 965 ...... 25.63 |================================================== Bulldozer GCC 4.5.2 . 16.50 |================================ Bulldozer GCC 4.6.1 . 17.06 |================================= OpenSSL 1.0.0a RSA 4096-bit Performance Signs Per Second > Higher Is Better bulldozer ........... 68.68 |================================================== Phenom2 x4 965 ...... 64.20 |=============================================== Bulldozer GCC 4.5.2 . 63.65 |============================================== Bulldozer GCC 4.6.1 . 63.83 |============================================== Gcrypt Library 1.4.4 CAMELLIA256-ECB Cipher Microseconds < Lower Is Better bulldozer ........... 2807 |====================================== Bulldozer GCC 4.5.2 . 3777 |=================================================== Bulldozer GCC 4.6.1 . 2503 |================================== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better bulldozer ...... 237.28 |====================================================== Phenom2 x4 965 . 195.53 |============================================ PostgreSQL pgbench 9.0.4 TPC-B Transactions Per Second TPS > Higher Is Better bulldozer ...... 370.12 |====================================================== Phenom2 x4 965 . 175.28 |========================== Apache Benchmark 2.2.17 Static Web Page Serving Requests Per Second > Higher Is Better bulldozer ...... 17849.63 |====================================== Phenom2 x4 965 . 24519.11 |==================================================== C-Ray 1.1 Total Time Seconds < Lower Is Better bulldozer ........... 51.07 |======================= Phenom2 x4 965 ...... 102.62 |=============================================== Bulldozer GCC 4.5.2 . 106.75 |================================================= Bulldozer GCC 4.6.1 . 105.49 |================================================ Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better bulldozer ........... 104 |========================= Phenom2 x4 965 ...... 183 |============================================ Bulldozer GCC 4.5.2 . 211 |=================================================== Bulldozer GCC 4.6.1 . 215 |==================================================== Tachyon 0.98.7 Total Time Seconds < Lower Is Better bulldozer . 21.11 |============================================================ Crafty 23.3 Elapsed Time Seconds < Lower Is Better bulldozer . 426.11 |=========================================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better bulldozer ........... 322772 |================================================= Phenom2 x4 965 ...... 289031 |============================================ Bulldozer GCC 4.5.2 . 293780 |============================================= Bulldozer GCC 4.6.1 . 286148 |=========================================== Timed MAFFT Alignment 6.706 Multiple Sequence Alignment Seconds < Lower Is Better bulldozer ........... 26.37 |============================================== Phenom2 x4 965 ...... 28.51 |================================================= Bulldozer GCC 4.5.2 . 28.86 |================================================== Bulldozer GCC 4.6.1 . 28.83 |================================================== NAS Parallel Benchmarks 3.3 Test / Class: EP.B Total Mop/s > Higher Is Better bulldozer ........... 151.52 |================================================= Phenom2 x4 965 ...... 92.33 |============================== Bulldozer GCC 4.5.2 . 78.97 |========================== NAS Parallel Benchmarks 3.3 Test / Class: IS.C Total Mop/s > Higher Is Better bulldozer ........... 111.59 |================================================= Phenom2 x4 965 ...... 91.10 |======================================== Bulldozer GCC 4.5.2 . 99.91 |============================================ Bulldozer GCC 4.6.1 . 99.50 |============================================ NAS Parallel Benchmarks 3.3 Test / Class: LU.A Total Mop/s > Higher Is Better bulldozer ........... 11771.43 |=============================================== Phenom2 x4 965 ...... 5322.82 |===================== Bulldozer GCC 4.5.2 . 6058.10 |======================== NAS Parallel Benchmarks 3.3 Test / Class: MG.B Total Mop/s > Higher Is Better bulldozer ........... 5671.96 |================================================ Phenom2 x4 965 ...... 3507.20 |============================== Bulldozer GCC 4.5.2 . 3645.03 |=============================== NAS Parallel Benchmarks 3.3 Test / Class: UA.A Total Mop/s > Higher Is Better bulldozer ........... 39.95 |================================================== Phenom2 x4 965 ...... 20.96 |========================== Bulldozer GCC 4.5.2 . 21.48 |=========================== Stream 2009-04-11 Type: Copy MB/s > Higher Is Better bulldozer ...... 12128.41 |==================================================== Phenom2 x4 965 . 9473.36 |========================================= Stream 2009-04-11 Type: Scale MB/s > Higher Is Better bulldozer ...... 11666.57 |==================================================== Phenom2 x4 965 . 9204.61 |========================================= Stream 2009-04-11 Type: Add MB/s > Higher Is Better bulldozer ...... 11773.64 |==================================================== Phenom2 x4 965 . 10038.79 |============================================ Stream 2009-04-11 Type: Triad MB/s > Higher Is Better bulldozer ...... 12190.00 |==================================================== Phenom2 x4 965 . 10298.18 |============================================ POV-Ray 3.6.1 Total Time Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 913 |==================================================== Bulldozer GCC 4.6.1 . 916 |==================================================== GraphicsMagick 1.3.12 Operation: HWB Color Space Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 123 |================================================ Bulldozer GCC 4.6.1 . 134 |==================================================== GraphicsMagick 1.3.12 Operation: Blur Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 87 |=================================================== Bulldozer GCC 4.6.1 . 90 |===================================================== GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 44 |===================================================== Bulldozer GCC 4.6.1 . 43 |==================================================== GraphicsMagick 1.3.12 Operation: Resizing Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 92 |=================================================== Bulldozer GCC 4.6.1 . 96 |===================================================== GraphicsMagick 1.3.12 Operation: Sharpen Iterations Per Minute > Higher Is Better Bulldozer GCC 4.5.2 . 44 |========================================== Bulldozer GCC 4.6.1 . 55 |===================================================== John The Ripper 1.7.8 Test: Traditional DES Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 3523667 |================================================ Bulldozer GCC 4.6.1 . 3321333 |============================================= John The Ripper 1.7.8 Test: MD5 Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 14082 |================================================== Bulldozer GCC 4.6.1 . 13598 |================================================ John The Ripper 1.7.8 Test: Blowfish Real C/S > Higher Is Better Bulldozer GCC 4.5.2 . 866 |================================================== Bulldozer GCC 4.6.1 . 895 |==================================================== x264 2010-11-22 H.264 Video Encoding Frames Per Second > Higher Is Better Bulldozer GCC 4.5.2 . 50.19 |================================================= Bulldozer GCC 4.6.1 . 50.89 |================================================== NAS Parallel Benchmarks 3.3 Test / Class: BT.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 5982.31 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: CG.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 1788.94 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: FT.B Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 3785.19 |================================================ NAS Parallel Benchmarks 3.3 Test / Class: SP.A Total Mop/s > Higher Is Better Bulldozer GCC 4.5.2 . 3735.79 |================================================ CLOMP 3.3 Static OMP Speedup Speedup > Higher Is Better Bulldozer GCC 4.5.2 . 2.31 |=================================================== Bulldozer GCC 4.6.1 . 2.31 |=================================================== SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 738.80 |================================================= Bulldozer GCC 4.6.1 . 735.23 |================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 72.89 |================================================== Bulldozer GCC 4.6.1 . 69.17 |=============================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 690.52 |================================================= Bulldozer GCC 4.6.1 . 690.51 |================================================= SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 345.55 |================================================= Bulldozer GCC 4.6.1 . 336.07 |================================================ SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 968.47 |================================================= Bulldozer GCC 4.6.1 . 940.56 |================================================ SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better Bulldozer GCC 4.5.2 . 1616.56 |=============================================== Bulldozer GCC 4.6.1 . 1639.84 |================================================ Fhourstones 3.1 Complex Connect-4 Solving Kpos / sec > Higher Is Better Bulldozer GCC 4.5.2 . 9425.30 |================================================ Bulldozer GCC 4.6.1 . 9221.03 |=============================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better Bulldozer GCC 4.5.2 . 281.74 |=========================================== Bulldozer GCC 4.6.1 . 322.56 |=================================================