GCC 4.9 Snapshot Compiler Flag Tests

GCC 4.9 compiler optimization tests of an Intel Core i7 Haswell CPU and applying different CFLAGS/CXXFLAGS to look at impact of core-avx2 CPU optimizations and other x86_64 -march= options. Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1308319-SO-GCC49SNAP71&sro&grt.

GCC 4.9 Snapshot Compiler Flag TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionCore-AVX2Core-AVX-ICorei7-AVXCorei7Core2NoconaIntel Core i7-4900MQ @ 2.80GHz (8 Cores)System76 Gazelle ProfessionalIntel Xeon E3-1200 v3/4th8192MB120GB INTEL SSDSC2CW12Intel 4th Gen Core IGP (1300MHz)Intel Haswell HDMIRealtek RTL8111/8168/8411 + Intel Centrino Advanced-N 6235Ubuntu 13.103.11.0-4-generic (x86_64)Unity 7.1.0X Server 1.14.2.901 (1.14.3 RC 1)intel 2.21.143.1 Mesa 9.2.0GCC 4.9.0 20130731ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-nls --disable-werror --enable-checking=yes --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-objc-gc --enable-plugin --enable-shared --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.9-snap-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.9-snap-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.9-snap-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: acpi-cpufreq ondemand

GCC 4.9 Snapshot Compiler Flag Testsapache: Static Web Page Servingc-ray: Total Timeffmpeg: H.264 HD To NTSC DVencode-flac: WAV To FLACgraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverencode-mp3: WAV To MP3n-queens: Elapsed Timeencode-ogg: WAV To Oggprimesieve: 1e12 Prime Number Generationscimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsmallpt: Global Illumination Renderer; 100 Samplesttsiod-renderer: Phong Rendering With Soft-Shadow Mappingx264: H.264 Video EncodingCore-AVX2Core-AVX-ICorei7-AVXCorei7Core2Nocona31291.5219.7113.175.311491261802001161770.4814.3042.547.0693.19595.70335.892332.252783.231142.9229133.95162.9631007.1926.3813.315.591491161762011161868.4714.0844.517.0693.17595.28329.122184.672794.161142.9229133.36163.7730875.4126.4013.375.591491161762011161803.8214.0744.507.0693.15594.05315.452322.342786.851142.9229132.85162.8630666.5326.5213.505.091491161752001151746.9512.5744.547.0893.14596.52316.622388.352842.551140.7930150.07162.7631064.1726.5313.335.211491161762011151747.2312.8844.517.0693.15595.70319.572379.682853.981140.7930146.09162.1029830.0226.6613.305.291481151721981141695.2712.6445.767.0793.31698.37318.082346.982589.061140.7930144.87162.14OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page ServingCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona7K14K21K28K35KSE +/- 162.72, N = 3SE +/- 88.76, N = 3SE +/- 43.24, N = 3SE +/- 225.32, N = 3SE +/- 191.20, N = 3SE +/- 111.38, N = 331007.1931291.5231064.1730666.5330875.4129830.02-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -shared -fPIC -pthread -O3

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 326.3819.7126.5326.5226.4026.66-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -lm -lpthread -O3

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.0H.264 HD To NTSC DVCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona3691215SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 313.3113.1713.3313.5013.3713.30-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -O3 -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona1.25782.51563.77345.03126.289SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 55.595.315.215.095.595.29-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: BlurCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona306090120150SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3149149149149149148-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: SharpenCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3116126116116116115-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: ResizingCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona4080120160200SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3176180176175176172-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: HWB Color SpaceCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona4080120160200SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3201200201200201198-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive ThresholdingCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3116116115115116114-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona400800120016002000SE +/- 0.59, N = 3SE +/- 29.31, N = 4SE +/- 1.11, N = 3SE +/- 1.79, N = 3SE +/- 24.88, N = 6SE +/- 0.23, N = 31868.471770.481747.231746.951803.821695.27-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -O3

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Core-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona48121620SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 514.0814.3012.8812.5714.0712.64-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -pipe -O3 -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona1020304050SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 344.5142.5444.5144.5444.5045.76-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -static -fopenmp -O3

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona246810SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 57.067.067.067.087.067.07-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -lm -logg

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona20406080100SE +/- 0.43, N = 3SE +/- 0.44, N = 3SE +/- 0.41, N = 3SE +/- 0.37, N = 3SE +/- 0.35, N = 3SE +/- 0.33, N = 393.1793.1993.1593.1493.1593.311. (CXX) g++ options: -O2 -fopenmp

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona150300450600750SE +/- 0.41, N = 4SE +/- 0.48, N = 4SE +/- 0.48, N = 4SE +/- 0.00, N = 4SE +/- 0.47, N = 4SE +/- 0.66, N = 4595.28595.70595.70596.52594.05698.37-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona70140210280350SE +/- 1.63, N = 4SE +/- 1.12, N = 4SE +/- 0.57, N = 4SE +/- 1.20, N = 4SE +/- 0.73, N = 4SE +/- 0.48, N = 4329.12335.89319.57316.62315.45318.08-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona5001000150020002500SE +/- 9.72, N = 4SE +/- 3.18, N = 4SE +/- 3.31, N = 4SE +/- 2.84, N = 4SE +/- 3.15, N = 4SE +/- 32.43, N = 42184.672332.252379.682388.352322.342346.98-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona6001200180024003000SE +/- 3.50, N = 4SE +/- 3.48, N = 4SE +/- 4.81, N = 4SE +/- 1.89, N = 4SE +/- 1.82, N = 4SE +/- 12.57, N = 22794.162783.232853.982842.552786.852589.06-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona2004006008001000SE +/- 1.07, N = 4SE +/- 1.07, N = 4SE +/- 1.07, N = 4SE +/- 1.07, N = 4SE +/- 1.07, N = 4SE +/- 1.07, N = 41142.921142.921140.791140.791142.921140.79-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona714212835SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3292930302930-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -fopenmp -O3

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2zPhong Rendering With Soft-Shadow MappingCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona306090120150SE +/- 0.40, N = 3SE +/- 0.65, N = 3SE +/- 0.25, N = 3SE +/- 0.19, N = 3SE +/- 0.31, N = 3SE +/- 0.90, N = 3133.36133.95146.09150.07132.85144.87-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video EncodingCore-AVX-ICore-AVX2Core2Corei7Corei7-AVXNocona4080120160200SE +/- 0.36, N = 5SE +/- 0.32, N = 5SE +/- 0.45, N = 5SE +/- 0.56, N = 5SE +/- 0.58, N = 5SE +/- 0.30, N = 5163.77162.96162.10162.76162.86162.14-march=core-avx-i-march=core-avx2-march=core2-march=corei7-march=corei7-avx-march=nocona1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.5