Intel Ivy Bridge GCC 4.7 LTO

Quick benchmarking of GCC 4.7.1 with the Link-Time Optimization (LTO) support. Building the test profiles with CFLAGS/CXXFLAGS that includes -flto for link-time optimizing and then without. Benchmarking by Michael Larabel for a future article looking briefly at the GCC Link-Time Optimization support for some C/C++ programs.

HTML result view exported from: https://openbenchmarking.org/result/1208200-SU-INTELIVYB99&sor.

Intel Ivy Bridge GCC 4.7 LTOProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionNo LTOLTO OptimizedIntel Core i7-3517UE @ 2.10GHz (4 Cores)CompuLab Intense-PCIntel 3rd Gen Core DRAM8192MB500GB Hitachi HCC54755Intel 3rd Gen CoreRealtek ALC888VA2431Intel 82579LM Gigabit Connection + Realtek RTL8188CE 802.11b/g/nUbuntu 12.103.6.0-999-generic (x86_64)Unity 2D 6.2.0X Server 1.12.1.902 (1.12.2 RC 2)intel 2.20.32.1 Mesa 8.1-devel (git-6a3ac03)GCC 4.7 + LLVM 3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details- Scaling Governor: ondemand

Intel Ivy Bridge GCC 4.7 LTObyte: Dhrystone 2ttsiod-renderer: Phong Rendering With Soft-Shadow Mappinghimeno: Poisson Pressure Solverbuild-php: Time To Compilec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplescompress-lzma: 256MB File Compressionnero2d: Total Timepgbench: TPC-B Transactions Per Secondapache: Static Web Page ServingNo LTOLTO Optimized19378446.1355.951168.7764.4092.1270181.52577.51123.2516455.6226106973.2353.601177.98181.6191.8370180.37523.28125.1416489.65OpenBenchmarking.org

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2LTO OptimizedNo LTO6M12M18M24M30MSE +/- 22328.22, N = 3SE +/- 22656.15, N = 326106973.2319378446.13-flto1. (CC) gcc options: -march=native -O3

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingNo LTOLTO Optimized1326395265SE +/- 0.17, N = 3SE +/- 0.04, N = 355.9553.601. (CXX) g++ options: -march=native -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLTO OptimizedNo LTO30060090012001500SE +/- 2.96, N = 3SE +/- 4.94, N = 31177.981168.77-flto1. (CC) gcc options: -O3 -march=native

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileNo LTOLTO Optimized4080120160200SE +/- 0.35, N = 3SE +/- 0.22, N = 364.40181.61-flto1. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeLTO OptimizedNo LTO20406080100SE +/- 0.20, N = 3SE +/- 0.43, N = 391.8392.12-flto1. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesNo LTOLTO Optimized1632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 37070-flto1. (CXX) g++ options: -fopenmp -march=native -O3

LZMA Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionLTO OptimizedNo LTO4080120160200SE +/- 4.32, N = 6SE +/- 4.30, N = 6180.37181.52-flto1. (CC) gcc options: -march=native -O3

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeLTO OptimizedNo LTO120240360480600523.28577.51-flto1. (CXX) g++ options: -march=native -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm

PostgreSQL pgbench

TPC-B Transactions Per Second

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per SecondLTO OptimizedNo LTO306090120150SE +/- 0.33, N = 3SE +/- 0.23, N = 3125.14123.25-flto1. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingLTO OptimizedNo LTO4K8K12K16K20KSE +/- 20.76, N = 3SE +/- 72.79, N = 316489.6516455.62-flto1. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl


Phoronix Test Suite v10.8.4