GCC AMD Bulldozer Compiler Tuning

AMD FX-8150 Bulldozer compiler tuning using GCC 4.7.1 with different march options for the test profiles to look at the latest AMD GCC performance. Benchmarking by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1209227-RA-GCCAMDBUL32&sor.

GCC AMD Bulldozer Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolutionmarch=noconamarch=core2march=k8march=k8-sse3march=barcelonamarch=bdver1AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores)ASUS Crosshair V FormulaAMD ATI RD890 bridge4096MB60GB OCZ VERTEX2NVIDIA GeForce 9600 GSO 512MB (399/399MHz)Realtek ALC889DELL P2210HIntel 82583V Gigabit ConnectionUbuntu 12.103.5.0-14-generic (x86_64)Unity 6.4.0X Server 1.13.0nouveau 1.0.23.0 Mesa 8.1-devel Gallium 0.4GCC 4.7ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details- Scaling Governor: ondemandSystem Details- march=nocona: Compiz was running on this system.- march=core2: Compiz was running on this system.- march=k8: Compiz and Firefox were running on this system.- march=k8-sse3: Compiz was running on this system.- march=barcelona: Compiz was running on this system.- march=bdver1: Compiz and Firefox were running on this system.

GCC AMD Bulldozer Compiler Tuninggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-php: Time To Compilec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesnero2d: Total Timepgbench: TPC-B Transactions Per Secondmarch=noconamarch=core2march=k8march=k8-sse3march=barcelonamarch=bdver1996512616068654.2731.2036.8433549.432064.40969913915867670.7533.2236.3833531.272039.71976312815167614.8333.0252.8444643.591986.18956212615066678.9732.9852.8244636.992018.341078613915661669.2833.0736.0932534.422037.1111310014415766698.9234.4126.9930561.851916.51OpenBenchmarking.org

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Blurmarch=bdver1march=barcelonamarch=noconamarch=k8march=core2march=k8-sse3306090120150SE +/- 0.33, N = 3SE +/- 1.15, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 311310799979695-march=bdver1-march=barcelona-march=nocona-march=k8-march=core2-march=k8-sse31. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Sharpenmarch=bdver1march=core2march=barcelonamarch=noconamarch=k8march=k8-sse320406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 31009986656362-march=bdver1-march=core2-march=barcelona-march=nocona-march=k8-march=k8-sse31. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Resizingmarch=bdver1march=barcelonamarch=core2march=k8march=k8-sse3march=nocona306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3144139139128126126-march=bdver1-march=barcelona-march=core2-march=k8-march=k8-sse3-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color Spacemarch=noconamarch=core2march=bdver1march=barcelonamarch=k8march=k8-sse34080120160200SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3160158157156151150-march=nocona-march=core2-march=bdver1-march=barcelona-march=k8-march=k8-sse31. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive Thresholdingmarch=noconamarch=k8march=core2march=bdver1march=k8-sse3march=barcelona1530456075SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3686767666661-march=nocona-march=k8-march=core2-march=bdver1-march=k8-sse3-march=barcelona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solvermarch=bdver1march=k8-sse3march=core2march=barcelonamarch=noconamarch=k8150300450600750SE +/- 8.36, N = 3SE +/- 7.84, N = 3SE +/- 8.14, N = 3SE +/- 6.64, N = 3SE +/- 5.51, N = 3SE +/- 0.75, N = 3698.92678.97670.75669.28654.27614.83-march=bdver1-march=k8-sse3-march=core2-march=barcelona-march=nocona-march=k81. (CC) gcc options: -O3

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compilemarch=noconamarch=k8-sse3march=k8march=barcelonamarch=core2march=bdver1816243240SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 331.2032.9833.0233.0733.2234.41-march=nocona-march=k8-sse3-march=k8-march=barcelona-march=core2-march=bdver11. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timemarch=bdver1march=barcelonamarch=core2march=noconamarch=k8-sse3march=k81224364860SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 326.9936.0936.3836.8452.8252.84-march=bdver1-march=barcelona-march=core2-march=nocona-march=k8-sse3-march=k81. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesmarch=bdver1march=barcelonamarch=noconamarch=core2march=k8march=k8-sse31020304050SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3303233334444-march=bdver1-march=barcelona-march=nocona-march=core2-march=k8-march=k8-sse31. (CXX) g++ options: -fopenmp -O3

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total Timemarch=core2march=barcelonamarch=noconamarch=bdver1march=k8-sse3march=k8140280420560700531.27534.42549.43561.85636.99643.59-march=core2-march=barcelona-march=nocona-march=bdver1-march=k8-sse3-march=k81. (CXX) g++ options: -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm

PostgreSQL pgbench

TPC-B Transactions Per Second

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per Secondmarch=noconamarch=core2march=barcelonamarch=k8-sse3march=k8march=bdver1400800120016002000SE +/- 18.47, N = 3SE +/- 33.91, N = 3SE +/- 1.46, N = 3SE +/- 8.45, N = 3SE +/- 40.09, N = 6SE +/- 33.50, N = 62064.402039.712037.112018.341986.181916.51-march=nocona-march=core2-march=barcelona-march=k8-sse3-march=k8-march=bdver11. (CC) gcc options: -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm


Phoronix Test Suite v10.8.4