GCC AMD Bulldozer Compiler Tuning

AMD FX-8150 Bulldozer compiler tuning using GCC 4.7.1 with different march options for the test profiles to look at the latest AMD GCC performance. Benchmarking by Michael Larabel for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1209227-RA-GCCAMDBUL32
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
march=nocona
September 14 2012
 
march=core2
September 14 2012
 
march=k8
September 14 2012
 
march=k8-sse3
September 15 2012
 
march=barcelona
September 14 2012
 
march=bdver1
September 14 2012
 
Invert Behavior (Only Show Selected Data)
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCC AMD Bulldozer Compiler TuningOpenBenchmarking.orgPhoronix Test SuiteAMD FX-8150 Eight-Core @ 3.60GHz (8 Cores)ASUS Crosshair V FormulaAMD ATI RD890 bridge4096MB60GB OCZ VERTEX2NVIDIA GeForce 9600 GSO 512MB (399/399MHz)Realtek ALC889DELL P2210HIntel 82583V Gigabit ConnectionUbuntu 12.103.5.0-14-generic (x86_64)Unity 6.4.0X Server 1.13.0nouveau 1.0.23.0 Mesa 8.1-devel Gallium 0.4GCC 4.7ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC AMD Bulldozer Compiler Tuning BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v - Scaling Governor: ondemand- march=nocona: Compiz was running on this system.- march=core2: Compiz was running on this system.- march=k8: Compiz and Firefox were running on this system.- march=k8-sse3: Compiz was running on this system.- march=barcelona: Compiz was running on this system.- march=bdver1: Compiz and Firefox were running on this system.

march=noconamarch=core2march=k8march=k8-sse3march=barcelonamarch=bdver1Result OverviewPhoronix Test Suite100%124%148%172%C-RayGraphicsMagickSmallptOpen FMM Nero2DGraphicsMagickGraphicsMagickHimeno BenchmarkGraphicsMagickTimed PHP CompilationPostgreSQL pgbenchGraphicsMagickTotal TimeSharpenG.I.R.1.STotal TimeBlurResizingP.P.SL.A.TTime To CompileT.B.T.P.SHWB Color Space

GCC AMD Bulldozer Compiler Tuninggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-php: Time To Compilec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesnero2d: Total Timepgbench: TPC-B Transactions Per Secondmarch=noconamarch=core2march=k8march=k8-sse3march=barcelonamarch=bdver1996512616068654.2731.2036.8433549.432064.40969913915867670.7533.2236.3833531.272039.71976312815167614.8333.0252.8444643.591986.18956212615066678.9732.9852.8244636.992018.341078613915661669.2833.0736.0932534.422037.1111310014415766698.9234.4126.9930561.851916.51OpenBenchmarking.org

GraphicsMagick

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Blurmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona306090120150SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 1.15, N = 399959796113107-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Sharpenmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona20406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 36562639910086-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Resizingmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3126126128139144139-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color Spacemarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona4080120160200SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3160150151158157156-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive Thresholdingmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona1530456075SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3686667676661-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solvermarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona150300450600750SE +/- 5.51, N = 3SE +/- 7.84, N = 3SE +/- 0.75, N = 3SE +/- 8.14, N = 3SE +/- 8.36, N = 3SE +/- 6.64, N = 3654.27678.97614.83670.75698.92669.28-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -O3

Timed PHP Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compilemarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona816243240SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 331.2032.9833.0233.2234.4133.07-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timemarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona1224364860SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 336.8452.8252.8436.3826.9936.09-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -lm -lpthread -O3

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona1020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3334444333032-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CXX) g++ options: -fopenmp -O3

Open FMM Nero2D

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total Timemarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona140280420560700549.43636.99643.59531.27561.85534.42-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CXX) g++ options: -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm

PostgreSQL pgbench

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per Secondmarch=noconamarch=k8-sse3march=k8march=core2march=bdver1march=barcelona400800120016002000SE +/- 18.47, N = 3SE +/- 8.45, N = 3SE +/- 40.09, N = 6SE +/- 33.91, N = 3SE +/- 33.50, N = 6SE +/- 1.46, N = 32064.402018.341986.182039.711916.512037.11-march=nocona-march=k8-sse3-march=k8-march=core2-march=bdver1-march=barcelona1. (CC) gcc options: -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm