Intel Ivy Bridge GCC 4.7 LTO

Quick benchmarking of GCC 4.7.1 with the Link-Time Optimization (LTO) support. Building the test profiles with CFLAGS/CXXFLAGS that includes -flto for link-time optimizing and then without. Benchmarking by Michael Larabel for a future article looking briefly at the GCC Link-Time Optimization support for some C/C++ programs.

HTML result view exported from: https://openbenchmarking.org/result/1208200-SU-INTELIVYB99&gru.

Intel Ivy Bridge GCC 4.7 LTOProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionNo LTOLTO OptimizedIntel Core i7-3517UE @ 2.10GHz (4 Cores)CompuLab Intense-PCIntel 3rd Gen Core DRAM8192MB500GB Hitachi HCC54755Intel 3rd Gen CoreRealtek ALC888VA2431Intel 82579LM Gigabit Connection + Realtek RTL8188CE 802.11b/g/nUbuntu 12.103.6.0-999-generic (x86_64)Unity 2D 6.2.0X Server 1.12.1.902 (1.12.2 RC 2)intel 2.20.32.1 Mesa 8.1-devel (git-6a3ac03)GCC 4.7 + LLVM 3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details- Scaling Governor: ondemand

Intel Ivy Bridge GCC 4.7 LTOttsiod-renderer: Phong Rendering With Soft-Shadow Mappingbyte: Dhrystone 2himeno: Poisson Pressure Solverapache: Static Web Page Servingpgbench: TPC-B Transactions Per Secondbuild-php: Time To Compilec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplescompress-lzma: 256MB File Compressionnero2d: Total TimeNo LTOLTO Optimized55.9519378446.131168.7716455.62123.2564.4092.1270181.52577.5153.6026106973.231177.9816489.65125.14181.6191.8370180.37523.28OpenBenchmarking.org

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingNo LTOLTO Optimized1326395265SE +/- 0.17, N = 3SE +/- 0.04, N = 355.9553.601. (CXX) g++ options: -march=native -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2No LTOLTO Optimized6M12M18M24M30MSE +/- 22656.15, N = 3SE +/- 22328.22, N = 319378446.1326106973.23-flto1. (CC) gcc options: -march=native -O3

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverNo LTOLTO Optimized30060090012001500SE +/- 4.94, N = 3SE +/- 2.96, N = 31168.771177.98-flto1. (CC) gcc options: -O3 -march=native

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingNo LTOLTO Optimized4K8K12K16K20KSE +/- 72.79, N = 3SE +/- 20.76, N = 316455.6216489.65-flto1. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl

PostgreSQL pgbench

TPC-B Transactions Per Second

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per SecondNo LTOLTO Optimized306090120150SE +/- 0.23, N = 3SE +/- 0.33, N = 3123.25125.14-flto1. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileNo LTOLTO Optimized4080120160200SE +/- 0.35, N = 3SE +/- 0.22, N = 364.40181.61-flto1. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeNo LTOLTO Optimized20406080100SE +/- 0.43, N = 3SE +/- 0.20, N = 392.1291.83-flto1. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesNo LTOLTO Optimized1632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 37070-flto1. (CXX) g++ options: -fopenmp -march=native -O3

LZMA Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionNo LTOLTO Optimized4080120160200SE +/- 4.30, N = 6SE +/- 4.32, N = 6181.52180.37-flto1. (CC) gcc options: -march=native -O3

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeNo LTOLTO Optimized120240360480600577.51523.28-flto1. (CXX) g++ options: -march=native -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm


Phoronix Test Suite v10.8.4