AMD Bulldozer Compiler Tests

AMD FX-4100 Bulldozer Quad-core compiler benchmarking under Ubuntu Linux with GCC 4.5 and GCC 4.6. Testing for a future article on Phoronix.com by Michael Larabel. Thanks to Daniel Newkirk for the SSH access to this AMD Bulldozer system due to lack of AMD FX CPUs at Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1110161-LI-1110159AR87&grs&sro.

AMD Bulldozer Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay DriverCompilerFile-SystemDesktopDisplay ServerOpenGLScreen ResolutionBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840AMD FX-4100 @ 3.60GHz (4 Cores)ASUS M5A97 EVOATI RD890 PCI to PCI bridge16384MB60GB SSD G2 64 + 2000GB SAMSUNG HD204UI + 1000GB Western Digital WD1001FALS-0NVIDIA GeForce 8400 GSRealtek ALC892Realtek RTL8111/8168BUbuntu 11.042.6.38-11-generic (x86_64)NVIDIA 1.0.0GCC 4.5.2xfsGCC 4.6.1AMD Phenom II X4 840 @ 3.21GHz (4 Cores)Gigabyte GA-MA790GP-DS4HAMD RS780 + SB700/SB8004096MB1000GB SAMSUNG HD103SJ + 500GB Western Digital WDC WD5000AAKS-0 + 1000GB SAMSUNG HD103UJNVIDIA Quadro FX 570 256MB (460/400MHz)Realtek ALC889AFedora 142.6.35.10-74.fc14.x86_64 (x86_64)GNOME 2.32.0X Server 1.9.5NVIDIA 270.41.063.3.0 NVIDIA 270.41.06ext41920x1080OpenBenchmarking.orgSystem Details- phii840: Compiz was running on this system. SELinux: Enabled.

AMD Bulldozer Compiler Testsnpb: CG.Bscimark2: Dense LU Matrix Factorizationnpb: LU.Anpb: SP.Ascimark2: Sparse Matrix Multiplyscimark2: Compositegcrypt: CAMELLIA256-ECB Ciphergraphics-magick: Blurnpb: FT.Bn-queens: Elapsed Timenpb: MG.Bjohn-the-ripper: Traditional DESnpb: IS.Cnpb: UA.Aclomp: Static OMP Speeduppovray: Total Timenpb: EP.Bgraphics-magick: Resizingmafft: Multiple Sequence Alignmentsmallpt: Global Illumination Renderer; 100 Samplesscimark2: Fast Fourier Transformgraphics-magick: HWB Color Spacetscp: AI Chess Performancenpb: BT.Ajohn-the-ripper: MD5fhourstones: Complex Connect-4 Solvingjohn-the-ripper: Blowfishopenssl: RSA 4096-bit Performancex264: H.264 Video Encodinggraphics-magick: Local Adaptive Thresholdingencode-mp3: WAV To MP3scimark2: Monte Carloencode-flac: WAV To FLACscimark2: Jacobi Successive Over-Relaxationc-ray: Total Timecompress-7zip: Compress Speed Testffmpeg: AVI To NTSC VCDgraphics-magick: SharpenBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401788.941616.566058.103735.79968.47738.803777873785.19281.743645.03352366799.9121.482.3191378.979228.8621172.891232937805982.31140829425.3086663.6550.194428.69345.558.75690.52106.75887716.50441639.84940.56735.23250390322.56332133399.502.319169628.8321569.17134286148135989221.0389563.8350.894329.76336.078.66690.51105.49878317.0655532.17698.092601.132171.99571.28475.59632754.65241.082794.51278700079.5717.612.78107188.8110332.0219466.131222700686488.78146778830.7791060.8552.574529.85337.668.51704.81107.42888513.6450OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: CG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: CG.BBulldozer GCC 4.5.2phii840400800120016002000SE +/- 0.92, N = 3SE +/- 4.70, N = 31788.94532.17

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840400800120016002000SE +/- 14.44, N = 4SE +/- 6.26, N = 4SE +/- 1.29, N = 41616.561639.84698.09

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.ABulldozer GCC 4.5.2phii84013002600390052006500SE +/- 82.99, N = 3SE +/- 1.62, N = 36058.102601.13

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.ABulldozer GCC 4.5.2phii8408001600240032004000SE +/- 13.66, N = 3SE +/- 4.54, N = 33735.792171.99

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402004006008001000SE +/- 6.77, N = 4SE +/- 3.24, N = 4SE +/- 1.03, N = 4968.47940.56571.28

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840160320480640800SE +/- 4.39, N = 4SE +/- 1.65, N = 4SE +/- 0.84, N = 4738.80735.23475.59

Gcrypt Library

CAMELLIA256-ECB Cipher

OpenBenchmarking.orgMicroseconds, Fewer Is BetterGcrypt Library 1.4.4CAMELLIA256-ECB CipherBulldozer GCC 4.5.2Bulldozer GCC 4.6.18001600240032004000SE +/- 3.33, N = 3SE +/- 8.82, N = 337772503

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: BlurBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3879063

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BBulldozer GCC 4.5.2phii8408001600240032004000SE +/- 15.77, N = 3SE +/- 7.81, N = 33785.192754.65

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84070140210280350SE +/- 0.01, N = 2SE +/- 0.02, N = 2SE +/- 0.33, N = 2281.74322.56241.08

NAS Parallel Benchmarks

Test / Class: MG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: MG.BBulldozer GCC 4.5.2phii8408001600240032004000SE +/- 1.71, N = 3SE +/- 2.71, N = 33645.032794.51

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: Traditional DESBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840800K1600K2400K3200K4000KSE +/- 5897.27, N = 3SE +/- 15452.44, N = 3SE +/- 4509.25, N = 3352366733213332787000

NAS Parallel Benchmarks

Test / Class: IS.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: IS.CBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 399.9199.5079.57

NAS Parallel Benchmarks

Test / Class: UA.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: UA.ABulldozer GCC 4.5.2phii840510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 321.4817.61

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8400.62551.2511.87652.5023.1275SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.02, N = 52.312.312.78

POV-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.6.1Total TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020040060080010009139161071

NAS Parallel Benchmarks

Test / Class: EP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.BBulldozer GCC 4.5.2phii84020406080100SE +/- 0.14, N = 3SE +/- 0.06, N = 378.9788.81

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: ResizingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 39296103

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.706Multiple Sequence AlignmentBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 328.8628.8332.02

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84050100150200250SE +/- 1.00, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3211215194

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401632486480SE +/- 1.30, N = 4SE +/- 0.11, N = 4SE +/- 0.13, N = 472.8969.1766.13

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color SpaceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3123134122

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84060K120K180K240K300KSE +/- 250.13, N = 5SE +/- 184.65, N = 5SE +/- 189.99, N = 5293780286148270068

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.ABulldozer GCC 4.5.2phii84014002800420056007000SE +/- 12.46, N = 3SE +/- 11.21, N = 35982.316488.78

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: MD5Bulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8403K6K9K12K15KSE +/- 4.70, N = 3SE +/- 342.56, N = 3SE +/- 15.17, N = 3140821359814677

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402K4K6K8K10KSE +/- 8.51, N = 3SE +/- 12.73, N = 3SE +/- 6.81, N = 39425.309221.038830.77

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.7.8Test: BlowfishBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402004006008001000SE +/- 0.00, N = 3SE +/- 11.67, N = 3SE +/- 1.00, N = 3866895910

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0aRSA 4096-bit PerformanceBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401428425670SE +/- 0.25, N = 4SE +/- 0.31, N = 4SE +/- 0.05, N = 463.6563.8360.85

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2010-11-22H.264 Video EncodingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401224364860SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 350.1950.8952.57

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive ThresholdingBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3444345

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.98.2WAV To MP3Bulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840714212835SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.01, N = 528.6929.7629.85

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84080160240320400SE +/- 1.31, N = 4SE +/- 0.31, N = 4SE +/- 0.43, N = 4345.55336.07337.66

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.2.1WAV To FLACBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840246810SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 58.758.668.51

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii840150300450600750SE +/- 1.27, N = 4SE +/- 0.00, N = 4SE +/- 1.61, N = 4690.52690.51704.81

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84020406080100SE +/- 1.95, N = 3SE +/- 0.70, N = 3SE +/- 0.21, N = 3106.75105.49107.42

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.13Compress Speed TestBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8402K4K6K8K10KSE +/- 4.06, N = 3SE +/- 55.43, N = 3SE +/- 74.42, N = 3887787838885

FFmpeg

AVI To NTSC VCD

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 0.8.2AVI To NTSC VCDBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii84048121620SE +/- 0.84, N = 6SE +/- 0.11, N = 3SE +/- 0.07, N = 316.5017.0613.64

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: SharpenBulldozer GCC 4.5.2Bulldozer GCC 4.6.1phii8401224364860SE +/- 1.00, N = 6SE +/- 2.17, N = 6SE +/- 0.00, N = 3445550


Phoronix Test Suite v10.8.4