GCC 4.9 Intel Core i7 Haswell Core-AVX2

Intel Core i7-4770K Haswell testing of GCC 4.8.1 and an early GCC 4.9.0 compiler snapshot. Testing with CFLAGS/CXXFLAGS for the Intel Core i7 Haswell CPU of -O3 and -march=core-avx2. Benchmarking by Michael Larabel for a future article on phoronix.com

HTML result view exported from: https://openbenchmarking.org/result/1306266-SO-GCC49INTE20&grs&sor.

GCC 4.9 Intel Core i7 Haswell Core-AVX2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 4.8.1GCC 4.9.0 20130623Intel Core i7-4770K @ 3.50GHz (8 Cores)Intel DH87RLIntel 4th Gen Core DRAM15360MB240GB OCZ VERTEX3Intel Haswell DesktopIntel Haswell HDMIVA2431Intel Connection I217-VUbuntu 13.103.10.0-999-generic (x86_64)KDE 4.10.4X Server 1.13.3intel 2.21.93.0 Mesa 9.2.0-devel (git-bbd2d57)GCC 4.8.1 + Clang 3.3 + LLVM 3.3ext41920x1080GCC 4.9.0 20130623 + Clang 3.3 + LLVM 3.3OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranProcessor Details- Scaling Governor: acpi-cpufreq ondemand

GCC 4.9 Intel Core i7 Haswell Core-AVX2himeno: Poisson Pressure Solverblake2: Phoronix Test Suite v4.8.0m1build-imagemagick: Time To Compilescimark2: Sparse Matrix Multiplygraphics-magick: Resizingscimark2: Fast Fourier Transformencode-flac: WAV To FLACgraphics-magick: Local Adaptive Thresholdingapache: Static Web Page Servingscimark2: Dense LU Matrix Factorizationbuild-linux-kernel: Time To Compilegraphics-magick: HWB Color Spacegraphics-magick: Sharpenhmmer: Pfam Database Searchscimark2: Compositex264: H.264 Video Encodingc-ray: Total Timescimark2: Jacobi Successive Over-Relaxationn-queens: Elapsed Timeprimesieve: 1e12 Prime Number Generationscimark2: Monte Carlosmallpt: Global Illumination Renderer; 100 Samplesgraphics-magick: Blurmafft: Multiple Sequence AlignmentGCC 4.8.1GCC 4.9.0 201306231048.775.7179.181204.00175242.665.3711825415.421825.4396.3820613410.471009.99156.2617.061169.0936.6979.17608.79251375.471588.375.2974.591148.95181248.015.2612025832.901851.8697.7320813510.401005.56155.6717.031170.1836.7279.20608.78251375.52OpenBenchmarking.org

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 4.9.0 20130623GCC 4.8.130060090012001500SE +/- 2.58, N = 3SE +/- 5.10, N = 31588.371048.771. (CC) gcc options: -O3 -march=core-avx2

BLAKE2

Phoronix Test Suite v4.8.0m1

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.0m1GCC 4.9.0 20130623GCC 4.8.11.28482.56963.85445.13926.424SE +/- 0.00, N = 3SE +/- 0.08, N = 35.295.711. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.8.1-10Time To CompileGCC 4.9.0 20130623GCC 4.8.120406080100SE +/- 0.11, N = 3SE +/- 0.28, N = 374.5979.18

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 4.8.1GCC 4.9.0 2013062330060090012001500SE +/- 8.39, N = 4SE +/- 1.61, N = 41204.001148.951. (CXX) g++ options: -O3 -march=core-avx2

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: ResizingGCC 4.9.0 20130623GCC 4.8.14080120160200SE +/- 0.00, N = 3SE +/- 2.73, N = 31811751. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=core-avx2 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lpthread

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 4.9.0 20130623GCC 4.8.150100150200250SE +/- 0.51, N = 4SE +/- 2.71, N = 4248.01242.661. (CXX) g++ options: -O3 -march=core-avx2

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACGCC 4.9.0 20130623GCC 4.8.11.20832.41663.62494.83326.0415SE +/- 0.00, N = 5SE +/- 0.00, N = 55.265.371. (CXX) g++ options: -O3 -march=core-avx2 -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive ThresholdingGCC 4.9.0 20130623GCC 4.8.1306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 31201181. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=core-avx2 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lpthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page ServingGCC 4.9.0 20130623GCC 4.8.16K12K18K24K30KSE +/- 143.63, N = 3SE +/- 262.62, N = 325832.9025415.421. (CC) gcc options: -shared -fPIC -pthread -O3 -march=core-avx2

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 4.9.0 20130623GCC 4.8.1400800120016002000SE +/- 2.63, N = 4SE +/- 17.54, N = 41851.861825.431. (CXX) g++ options: -O3 -march=core-avx2

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 3.1Time To CompileGCC 4.8.1GCC 4.9.0 2013062320406080100SE +/- 0.56, N = 3SE +/- 0.69, N = 396.3897.73

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: HWB Color SpaceGCC 4.9.0 20130623GCC 4.8.150100150200250SE +/- 0.58, N = 3SE +/- 0.67, N = 32082061. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=core-avx2 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: SharpenGCC 4.9.0 20130623GCC 4.8.1306090120150SE +/- 0.00, N = 3SE +/- 0.33, N = 31351341. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=core-avx2 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 4.9.0 20130623GCC 4.8.13691215SE +/- 0.03, N = 3SE +/- 0.01, N = 310.4010.471. (CC) gcc options: -O3 -march=core-avx2 -pthread -lhmmer -lsquid -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 4.8.1GCC 4.9.0 201306232004006008001000SE +/- 3.60, N = 4SE +/- 0.86, N = 41009.991005.561. (CXX) g++ options: -O3 -march=core-avx2

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video EncodingGCC 4.8.1GCC 4.9.0 20130623306090120150SE +/- 0.79, N = 5SE +/- 0.90, N = 5156.26155.671. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=core-avx2 -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 4.9.0 20130623GCC 4.8.148121620SE +/- 0.01, N = 3SE +/- 0.00, N = 317.0317.061. (CC) gcc options: -lm -lpthread -O3 -march=core-avx2

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 4.9.0 20130623GCC 4.8.130060090012001500SE +/- 0.00, N = 4SE +/- 3.34, N = 41170.181169.091. (CXX) g++ options: -O3 -march=core-avx2

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeGCC 4.8.1GCC 4.9.0 20130623816243240SE +/- 0.04, N = 3SE +/- 0.03, N = 336.6936.721. (CC) gcc options: -static -fopenmp -O3 -march=core-avx2

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationGCC 4.8.1GCC 4.9.0 2013062320406080100SE +/- 0.14, N = 3SE +/- 0.04, N = 379.1779.201. (CXX) g++ options: -O2 -fopenmp

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 4.8.1GCC 4.9.0 20130623130260390520650SE +/- 1.29, N = 4SE +/- 0.43, N = 4608.79608.781. (CXX) g++ options: -O3 -march=core-avx2

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 4.8.1GCC 4.9.0 20130623612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 325251. (CXX) g++ options: -fopenmp -O3 -march=core-avx2

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: BlurGCC 4.9.0 20130623GCC 4.8.1306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 31371371. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=core-avx2 -pthread -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentGCC 4.8.1GCC 4.9.0 201306231.2422.4843.7264.9686.21SE +/- 0.13, N = 6SE +/- 0.15, N = 65.475.521. (CC) gcc options: -O3 -lm -lpthread


Phoronix Test Suite v10.8.4