AMD Bulldozer Compiler Tests

AMD FX-4100 Bulldozer Quad-core compiler benchmarking under Ubuntu Linux with GCC 4.5 and GCC 4.6. Testing for a future article on Phoronix.com by Michael Larabel. Thanks to Daniel Newkirk for the SSH access to this AMD Bulldozer system due to lack of AMD FX CPUs at Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1110161-LI-1110159AR87.

AMD Bulldozer Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay DriverCompilerFile-SystemDesktopDisplay ServerOpenGLScreen ResolutionBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840AMD FX-4100 @ 3.60GHz (4 Cores)ASUS M5A97 EVOATI RD890 PCI to PCI bridge16384MB60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0NVIDIA GeForce 8400 GSRealtek ALC892Realtek RTL8111/8168BUbuntu 11.042.6.38-11-generic (x86_64)NVIDIA 1.0.0GCC 4.5.2xfsGCC 4.6.1AMD Phenom II X4 840 @ 3.21GHz (4 Cores)Gigabyte GA-MA790GP-DS4HAMD RS780 + SB700/SB8004096MB1000GB SAMSUNG HD103SJ + 500GB Western Digital WDC WD5000AAKS-0 + 1000GB SAMSUNG HD103UJNVIDIA Quadro FX 570 256MB (460/400MHz)Realtek ALC889AFedora 142.6.35.10-74.fc14.x86_64 (x86_64)GNOME 2.32.0X Server 1.9.5NVIDIA 270.41.063.3.0 NVIDIA 270.41.06ext41920x1080OpenBenchmarking.orgSystem Details- phii840: Compiz was running on this system. SELinux: Enabled.

AMD Bulldozer Compiler Testsc-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplespovray: Total Timetscp: AI Chess Performancegraphics-magick: HWB Color Spacegraphics-magick: Blurgraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Sharpenjohn-the-ripper: Traditional DESjohn-the-ripper: MD5john-the-ripper: Blowfishopenssl: RSA 4096-bit Performancegcrypt: CAMELLIA256-ECB Ciphercompress-7zip: Compress Speed Testencode-mp3: WAV To MP3encode-flac: WAV To FLACffmpeg: AVI To NTSC VCDx264: H.264 Video Encodingmafft: Multiple Sequence Alignmentnpb: BT.Anpb: CG.Bnpb: EP.Bnpb: FT.Bnpb: IS.Cnpb: LU.Anpb: MG.Bnpb: SP.Anpb: UA.Aclomp: Static OMP Speedupscimark2: Compositescimark2: Fast Fourier Transformscimark2: Jacobi Successive Over-Relaxationscimark2: Monte Carloscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationfhourstones: Complex Connect-4 Solvingn-queens: Elapsed TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840106.752119132937801238744924435236671408286663.653777887728.698.7516.5050.1928.865982.311788.9478.973785.1999.916058.103645.033735.7921.482.31738.8072.89690.52345.55968.471616.569425.30281.74105.492159162861481349043965533213331359889563.832503878329.768.6617.0650.8928.8399.502.31735.2369.17690.51336.07940.561639.849221.03322.56107.42194107127006812263451035027870001467791060.85888529.858.5113.6452.5732.026488.78532.1788.812754.6579.572601.132794.512171.9917.612.78475.5966.13704.81337.66571.28698.098830.77241.08OpenBenchmarking.org

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 1.95, N = 3SE +/- 0.70, N = 3SE +/- 0.21, N = 3106.75105.49107.42

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84050100150200250SE +/- 1.00, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3211215194

POV-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.6.1Total TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020040060080010009139161071

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84060K120K180K240K300KSE +/- 250.13, N = 5SE +/- 184.65, N = 5SE +/- 189.99, N = 5293780286148270068

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color SpaceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3123134122

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: BlurBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3879063

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive ThresholdingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3444345

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: ResizingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 39296103

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: SharpenBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401224364860SE +/- 1.00, N = 6SE +/- 2.17, N = 6SE +/- 0.00, N = 3445550

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: Traditional DESBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840800K1600K2400K3200K4000KSE +/- 5897.27, N = 3SE +/- 15452.44, N = 3SE +/- 4509.25, N = 3352366733213332787000

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: MD5Bulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8403K6K9K12K15KSE +/- 4.70, N = 3SE +/- 342.56, N = 3SE +/- 15.17, N = 3140821359814677

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: BlowfishBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402004006008001000SE +/- 0.00, N = 3SE +/- 11.67, N = 3SE +/- 1.00, N = 3866895910

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0aRSA 4096-bit PerformanceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401428425670SE +/- 0.25, N = 4SE +/- 0.31, N = 4SE +/- 0.05, N = 463.6563.8360.85

Gcrypt Library

CAMELLIA256-ECB Cipher

OpenBenchmarking.orgMicroseconds, Fewer Is BetterGcrypt Library 1.4.4CAMELLIA256-ECB CipherBulldozer GCC 4.5.2Bulldozer GCC 4.6.18001600240032004000SE +/- 3.33, N = 3SE +/- 8.82, N = 337772503

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.13Compress Speed TestBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402K4K6K8K10KSE +/- 4.06, N = 3SE +/- 55.43, N = 3SE +/- 74.42, N = 3887787838885

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.98.2WAV To MP3Bulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840714212835SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.01, N = 528.6929.7629.85

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.2.1WAV To FLACBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840246810SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 58.758.668.51

FFmpeg

AVI To NTSC VCD

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 0.8.2AVI To NTSC VCDBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84048121620SE +/- 0.84, N = 6SE +/- 0.11, N = 3SE +/- 0.07, N = 316.5017.0613.64

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2010-11-22H.264 Video EncodingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401224364860SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 350.1950.8952.57

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.706Multiple Sequence AlignmentBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 328.8628.8332.02

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.ABulldozer GCC 4.5.2phii84014002800420056007000SE +/- 12.46, N = 3SE +/- 11.21, N = 35982.316488.78

NAS Parallel Benchmarks

Test / Class: CG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: CG.BBulldozer GCC 4.5.2phii840400800120016002000SE +/- 0.92, N = 3SE +/- 4.70, N = 31788.94532.17

NAS Parallel Benchmarks

Test / Class: EP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.BBulldozer GCC 4.5.2phii84020406080100SE +/- 0.14, N = 3SE +/- 0.06, N = 378.9788.81

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BBulldozer GCC 4.5.2phii8408001600240032004000SE +/- 15.77, N = 3SE +/- 7.81, N = 33785.192754.65

NAS Parallel Benchmarks

Test / Class: IS.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: IS.CBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 399.9199.5079.57

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.ABulldozer GCC 4.5.2phii84013002600390052006500SE +/- 82.99, N = 3SE +/- 1.62, N = 36058.102601.13

NAS Parallel Benchmarks

Test / Class: MG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: MG.BBulldozer GCC 4.5.2phii8408001600240032004000SE +/- 1.71, N = 3SE +/- 2.71, N = 33645.032794.51

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.ABulldozer GCC 4.5.2phii8408001600240032004000SE +/- 13.66, N = 3SE +/- 4.54, N = 33735.792171.99

NAS Parallel Benchmarks

Test / Class: UA.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: UA.ABulldozer GCC 4.5.2phii840510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 321.4817.61

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8400.62551.2511.87652.5023.1275SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.02, N = 52.312.312.78

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840160320480640800SE +/- 4.39, N = 4SE +/- 1.65, N = 4SE +/- 0.84, N = 4738.80735.23475.59

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401632486480SE +/- 1.30, N = 4SE +/- 0.11, N = 4SE +/- 0.13, N = 472.8969.1766.13

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840150300450600750SE +/- 1.27, N = 4SE +/- 0.00, N = 4SE +/- 1.61, N = 4690.52690.51704.81

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84080160240320400SE +/- 1.31, N = 4SE +/- 0.31, N = 4SE +/- 0.43, N = 4345.55336.07337.66

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402004006008001000SE +/- 6.77, N = 4SE +/- 3.24, N = 4SE +/- 1.03, N = 4968.47940.56571.28

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840400800120016002000SE +/- 14.44, N = 4SE +/- 6.26, N = 4SE +/- 1.29, N = 41616.561639.84698.09

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402K4K6K8K10KSE +/- 8.51, N = 3SE +/- 12.73, N = 3SE +/- 6.81, N = 39425.309221.038830.77

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84070140210280350SE +/- 0.01, N = 2SE +/- 0.02, N = 2SE +/- 0.33, N = 2281.74322.56241.08


Phoronix Test Suite v10.8.4