AMD Bulldozer Compiler Tests AMD FX-4100 Bulldozer Quad-core compiler benchmarking under Ubuntu Linux with GCC 4.5 and GCC 4.6. Testing for a future article on Phoronix.com by Michael Larabel. Thanks to Daniel Newkirk for the SSH access to this AMD Bulldozer system due to lack of AMD FX CPUs at Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1110159-AR-BULLDOZER08&grr .
AMD Bulldozer Compiler Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Driver Compiler File-System Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 AMD FX-4100 @ 3.60GHz (4 Cores) ASUS M5A97 EVO ATI RD890 PCI to PCI bridge 16384MB 60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0 NVIDIA GeForce 8400 GS Realtek ALC892 Realtek RTL8111/8168B Ubuntu 11.04 2.6.38-11-generic (x86_64) NVIDIA 1.0.0 GCC 4.5.2 xfs GCC 4.6.1 OpenBenchmarking.org
AMD Bulldozer Compiler Tests n-queens: Elapsed Time fhourstones: Complex Connect-4 Solving scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Monte Carlo scimark2: Jacobi Successive Over-Relaxation scimark2: Fast Fourier Transform scimark2: Composite clomp: Static OMP Speedup npb: UA.A npb: SP.A npb: MG.B npb: LU.A npb: IS.C npb: FT.B npb: EP.B npb: CG.B npb: BT.A mafft: Multiple Sequence Alignment x264: H.264 Video Encoding ffmpeg: AVI To NTSC VCD encode-flac: WAV To FLAC encode-mp3: WAV To MP3 compress-7zip: Compress Speed Test gcrypt: CAMELLIA256-ECB Cipher openssl: RSA 4096-bit Performance john-the-ripper: Blowfish john-the-ripper: MD5 john-the-ripper: Traditional DES graphics-magick: Sharpen graphics-magick: Resizing graphics-magick: Local Adaptive Thresholding graphics-magick: Blur graphics-magick: HWB Color Space tscp: AI Chess Performance povray: Total Time smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 281.74 9425.30 1616.56 968.47 345.55 690.52 72.89 738.80 2.31 21.48 3735.79 3645.03 6058.10 99.91 3785.19 78.97 1788.94 5982.31 28.86 50.19 16.50 8.75 28.69 8877 3777 63.65 866 14082 3523667 44 92 44 87 123 293780 913 211 106.75 322.56 9221.03 1639.84 940.56 336.07 690.51 69.17 735.23 2.31 99.50 28.83 50.89 17.06 8.66 29.76 8783 2503 63.83 895 13598 3321333 55 96 43 90 134 286148 916 215 105.49 OpenBenchmarking.org
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 70 140 210 280 350 SE +/- 0.01, N = 2 SE +/- 0.02, N = 2 281.74 322.56
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 2K 4K 6K 8K 10K SE +/- 8.51, N = 3 SE +/- 12.73, N = 3 9425.30 9221.03
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 400 800 1200 1600 2000 SE +/- 14.44, N = 4 SE +/- 6.26, N = 4 1616.56 1639.84
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 200 400 600 800 1000 SE +/- 6.77, N = 4 SE +/- 3.24, N = 4 968.47 940.56
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 80 160 240 320 400 SE +/- 1.31, N = 4 SE +/- 0.31, N = 4 345.55 336.07
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 150 300 450 600 750 SE +/- 1.27, N = 4 SE +/- 0.00, N = 4 690.52 690.51
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 16 32 48 64 80 SE +/- 1.30, N = 4 SE +/- 0.11, N = 4 72.89 69.17
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 160 320 480 640 800 SE +/- 4.39, N = 4 SE +/- 1.65, N = 4 738.80 735.23
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 0.5198 1.0396 1.5594 2.0792 2.599 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 2.31 2.31
NAS Parallel Benchmarks Test / Class: UA.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A Bulldozer GCC 4.5.2 5 10 15 20 25 SE +/- 0.02, N = 3 21.48
NAS Parallel Benchmarks Test / Class: SP.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: SP.A Bulldozer GCC 4.5.2 800 1600 2400 3200 4000 SE +/- 13.66, N = 3 3735.79
NAS Parallel Benchmarks Test / Class: MG.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: MG.B Bulldozer GCC 4.5.2 800 1600 2400 3200 4000 SE +/- 1.71, N = 3 3645.03
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A Bulldozer GCC 4.5.2 1300 2600 3900 5200 6500 SE +/- 82.99, N = 3 6058.10
NAS Parallel Benchmarks Test / Class: IS.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: IS.C Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 20 40 60 80 100 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 99.91 99.50
NAS Parallel Benchmarks Test / Class: FT.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: FT.B Bulldozer GCC 4.5.2 800 1600 2400 3200 4000 SE +/- 15.77, N = 3 3785.19
NAS Parallel Benchmarks Test / Class: EP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: EP.B Bulldozer GCC 4.5.2 20 40 60 80 100 SE +/- 0.14, N = 3 78.97
NAS Parallel Benchmarks Test / Class: CG.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: CG.B Bulldozer GCC 4.5.2 400 800 1200 1600 2000 SE +/- 0.92, N = 3 1788.94
NAS Parallel Benchmarks Test / Class: BT.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: BT.A Bulldozer GCC 4.5.2 1300 2600 3900 5200 6500 SE +/- 12.46, N = 3 5982.31
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.706 Multiple Sequence Alignment Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 7 14 21 28 35 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 28.86 28.83
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2010-11-22 H.264 Video Encoding Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 50.19 50.89
FFmpeg AVI To NTSC VCD OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 0.8.2 AVI To NTSC VCD Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 4 8 12 16 20 SE +/- 0.84, N = 6 SE +/- 0.11, N = 3 16.50 17.06
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 2 4 6 8 10 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 8.75 8.66
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.98.2 WAV To MP3 Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 7 14 21 28 35 SE +/- 0.01, N = 5 SE +/- 0.04, N = 5 28.69 29.76
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.13 Compress Speed Test Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 2K 4K 6K 8K 10K SE +/- 4.06, N = 3 SE +/- 55.43, N = 3 8877 8783
Gcrypt Library CAMELLIA256-ECB Cipher OpenBenchmarking.org Microseconds, Fewer Is Better Gcrypt Library 1.4.4 CAMELLIA256-ECB Cipher Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 800 1600 2400 3200 4000 SE +/- 3.33, N = 3 SE +/- 8.82, N = 3 3777 2503
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0a RSA 4096-bit Performance Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 14 28 42 56 70 SE +/- 0.25, N = 4 SE +/- 0.31, N = 4 63.65 63.83
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.8 Test: Blowfish Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 200 400 600 800 1000 SE +/- 0.00, N = 3 SE +/- 11.67, N = 3 866 895
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.8 Test: MD5 Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 3K 6K 9K 12K 15K SE +/- 4.70, N = 3 SE +/- 342.56, N = 3 14082 13598
John The Ripper Test: Traditional DES OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.8 Test: Traditional DES Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 800K 1600K 2400K 3200K 4000K SE +/- 5897.27, N = 3 SE +/- 15452.44, N = 3 3523667 3321333
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 12 24 36 48 60 SE +/- 1.00, N = 6 SE +/- 2.17, N = 6 44 55
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 92 96
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 44 43
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Blur Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 87 90
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: HWB Color Space Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 123 134
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 60K 120K 180K 240K 300K SE +/- 250.13, N = 5 SE +/- 184.65, N = 5 293780 286148
POV-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 200 400 600 800 1000 913 916
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 50 100 150 200 250 SE +/- 1.00, N = 3 SE +/- 1.20, N = 3 211 215
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time Bulldozer GCC 4.5.2 Bulldozer GCC 4.6.1 20 40 60 80 100 SE +/- 1.95, N = 3 SE +/- 0.70, N = 3 106.75 105.49
Phoronix Test Suite v10.8.4