Compiler Optimizations For AMD Steamroller/Kaveri

Compiler benchmarks by Michael Larabel for a future article on Phoronix.com

HTML result view exported from: https://openbenchmarking.org/result/1408161-KH-COMPILERO99&gru&rdt.

Compiler Optimizations For AMD Steamroller/KaveriProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolutionbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + fltoAMD A10-7800 Radeon R7 12 Compute Cores 4C+8G @ 3.50GHz (4 Cores)ASUS A88X-PROAMD Family 15h7168MB64GB OCZ AGILITYASUS AMD Radeon R7 200 1024MBAMD Kaveri HDMI/DPSyncMasterRealtek RTL8111/8168/8411Ubuntu 14.043.16.0-031600-generic (x86_64)Unity 7.2.2X Server 1.15.1radeon 7.4.993.3 Mesa 10.3.0-devel (git-b1843a2 2014-08-10 trusty-oibaf-ppa+gallium-nine) Gallium 0.4GCC 4.10.0 20140810ext42560x1600OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++Processor Details- Scaling Governor: acpi-cpufreq ondemand

Compiler Optimizations For AMD Steamroller/Kaverigraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-apache: Time To Compilebuild-php: Time To Compilec-ray: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3ffmpeg: H.264 HD To NTSC DVbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto1006712413781882.1161.4020.2347.315.5919.6121.601006712413781872.2261.7160.5747.325.5219.6021.691006812413781883.0661.2960.5547.435.2819.6321.95915811913777871.1761.0758.1565.577.2222.2821.77924210212976869.5361.0758.1296.936.7420.6121.78774210212977870.3361.0658.0996.896.7420.8521.731006712413782880.9596.69184.1547.415.3719.6721.65OpenBenchmarking.org

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blurbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto20406080100SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3100100100919277100-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpenbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto1530456075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 367676858424267-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizingbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3124124124119102102124-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Spacebdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto306090120150SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3137137137137129129137-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholdingbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto20406080100SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 381818177767782-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solverbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto2004006008001000SE +/- 2.02, N = 3SE +/- 3.58, N = 3SE +/- 1.93, N = 3SE +/- 1.09, N = 3SE +/- 0.82, N = 3SE +/- 0.27, N = 3SE +/- 0.76, N = 3882.11872.22883.06871.17869.53870.33880.95-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compilebdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto20406080100SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 0.10, N = 361.4061.7161.2961.0761.0761.0696.69

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compilebdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto4080120160200SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.86, N = 320.2360.5760.5558.1558.1258.09184.15-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timebdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto20406080100SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 347.3147.3247.4365.5796.9396.8947.41-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -lm -lpthread -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto246810SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 55.595.525.287.226.746.745.37-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3bdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto510152025SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 5SE +/- 0.02, N = 5SE +/- 0.27, N = 5SE +/- 0.02, N = 519.6119.6019.6322.2820.6120.8519.67-march=bdver3-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -pipe -O3 -lncurses -lm

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.1.1H.264 HD To NTSC DVbdver3bdver2bdver1barcelonak8-sse3k8bdver3 + flto510152025SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 321.6021.6921.9521.7721.7821.7321.65-march=native-march=bdver2-march=bdver1-march=barcelona-march=k8-sse3-march=k8-march=bdver3 -flto1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lXv -lX11 -lXext -lasound -lSDL -lm -pthread -O3 -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT


Phoronix Test Suite v10.8.4