GCC AMD Bulldozer Compiler Tuning AMD FX-8150 Bulldozer compiler tuning using GCC 4.7.1 with different march options for the test profiles to look at the latest AMD GCC performance. Benchmarking by Michael Larabel for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1209227-RA-GCCAMDBUL32&grw&sro .
GCC AMD Bulldozer Compiler Tuning Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution march=nocona march=core2 march=k8 march=k8-sse3 march=barcelona march=bdver1 AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores) ASUS Crosshair V Formula AMD ATI RD890 bridge 4096MB 60GB OCZ VERTEX2 NVIDIA GeForce 9600 GSO 512MB (399/399MHz) Realtek ALC889 DELL P2210H Intel 82583V Gigabit Connection Ubuntu 12.10 3.5.0-14-generic (x86_64) Unity 6.4.0 X Server 1.13.0 nouveau 1.0.2 3.0 Mesa 8.1-devel Gallium 0.4 GCC 4.7 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details - Scaling Governor: ondemand System Details - march=nocona: Compiz was running on this system. - march=core2: Compiz was running on this system. - march=k8: Compiz and Firefox were running on this system. - march=k8-sse3: Compiz was running on this system. - march=barcelona: Compiz was running on this system. - march=bdver1: Compiz and Firefox were running on this system.
GCC AMD Bulldozer Compiler Tuning himeno: Poisson Pressure Solver build-php: Time To Compile nero2d: Total Time graphics-magick: Blur graphics-magick: Sharpen graphics-magick: Resizing graphics-magick: HWB Color Space graphics-magick: Local Adaptive Thresholding c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples pgbench: TPC-B Transactions Per Second march=nocona march=core2 march=k8 march=k8-sse3 march=barcelona march=bdver1 654.27 31.20 549.43 99 65 126 160 68 36.84 33 2064.40 670.75 33.22 531.27 96 99 139 158 67 36.38 33 2039.71 614.83 33.02 643.59 97 63 128 151 67 52.84 44 1986.18 678.97 32.98 636.99 95 62 126 150 66 52.82 44 2018.34 669.28 33.07 534.42 107 86 139 156 61 36.09 32 2037.11 698.92 34.41 561.85 113 100 144 157 66 26.99 30 1916.51 OpenBenchmarking.org
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 150 300 450 600 750 SE +/- 6.64, N = 3 SE +/- 8.36, N = 3 SE +/- 8.14, N = 3 SE +/- 0.75, N = 3 SE +/- 7.84, N = 3 SE +/- 5.51, N = 3 669.28 698.92 670.75 614.83 678.97 654.27 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -O3
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 33.07 34.41 33.22 33.02 32.98 31.20 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm
Open FMM Nero2D Total Time OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 140 280 420 560 700 534.42 561.85 531.27 643.59 636.99 549.43 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CXX) g++ options: -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Blur march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 30 60 90 120 150 SE +/- 1.15, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 107 113 96 97 95 99 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 86 100 99 63 62 65 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 139 144 139 128 126 126 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: HWB Color Space march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 156 157 158 151 150 160 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 15 30 45 60 75 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 61 66 67 67 66 68 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lz -lm -lgomp -lpthread
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 12 24 36 48 60 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 36.09 26.99 36.38 52.84 52.82 36.84 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 32 30 33 44 44 33 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CXX) g++ options: -fopenmp -O3
PostgreSQL pgbench TPC-B Transactions Per Second OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 8.4.11 TPC-B Transactions Per Second march=barcelona march=bdver1 march=core2 march=k8 march=k8-sse3 march=nocona 400 800 1200 1600 2000 SE +/- 1.46, N = 3 SE +/- 33.50, N = 6 SE +/- 33.91, N = 3 SE +/- 40.09, N = 6 SE +/- 8.45, N = 3 SE +/- 18.47, N = 3 2037.11 1916.51 2039.71 1986.18 2018.34 2064.40 -march=barcelona -march=bdver1 -march=core2 -march=k8 -march=k8-sse3 -march=nocona 1. (CC) gcc options: -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm
Phoronix Test Suite v10.8.5