Intel Ivy Bridge GCC 4.7 LTO

Quick benchmarking of GCC 4.7.1 with the Link-Time Optimization (LTO) support. Building the test profiles with CFLAGS/CXXFLAGS that includes -flto for link-time optimizing and then without. Benchmarking by Michael Larabel for a future article looking briefly at the GCC Link-Time Optimization support for some C/C++ programs.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1208200-SU-INTELIVYB99
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 5 Tests
CPU Massive 7 Tests
Creator Workloads 3 Tests
Common Kernel Benchmarks 2 Tests
Multi-Core 6 Tests
Renderers 3 Tests
Server 2 Tests
Server CPU Tests 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
No LTO
August 20 2012
 
LTO Optimized
August 20 2012
 
Invert Hiding All Results Option
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Ivy Bridge GCC 4.7 LTOOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-3517UE @ 2.10GHz (4 Cores)CompuLab Intense-PCIntel 3rd Gen Core DRAM8192MB500GB Hitachi HCC54755Intel 3rd Gen CoreRealtek ALC888VA2431Intel 82579LM Gigabit Connection + Realtek RTL8188CE 802.11b/g/nUbuntu 12.103.6.0-999-generic (x86_64)Unity 2D 6.2.0X Server 1.12.1.902 (1.12.2 RC 2)intel 2.20.32.1 Mesa 8.1-devel (git-6a3ac03)GCC 4.7 + LLVM 3.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionIntel Ivy Bridge GCC 4.7 LTO BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v - Scaling Governor: ondemand

No LTO vs. LTO Optimized ComparisonPhoronix Test SuiteBaseline+45.5%+45.5%+91%+91%+136.5%+136.5%34.7%10.4%Time To Compile182%Dhrystone 2Total TimeP.R.W.S.S.M4.4%Timed PHP CompilationBYTE Unix BenchmarkOpen FMM Nero2DTTSIOD 3D RendererNo LTOLTO Optimized

Intel Ivy Bridge GCC 4.7 LTOttsiod-renderer: Phong Rendering With Soft-Shadow Mappingbyte: Dhrystone 2himeno: Poisson Pressure Solverapache: Static Web Page Servingpgbench: TPC-B Transactions Per Secondbuild-php: Time To Compilec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplescompress-lzma: 256MB File Compressionnero2d: Total TimeNo LTOLTO Optimized55.9519378446.131168.7716455.62123.2564.4092.1270181.52577.5153.6026106973.231177.9816489.65125.14181.6191.8370180.37523.28OpenBenchmarking.org

TTSIOD 3D Renderer

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingNo LTOLTO Optimized1326395265SE +/- 0.17, N = 3SE +/- 0.04, N = 355.9553.601. (CXX) g++ options: -march=native -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingNo LTOLTO Optimized1122334455Min: 55.72 / Avg: 55.95 / Max: 56.28Min: 53.54 / Avg: 53.6 / Max: 53.661. (CXX) g++ options: -march=native -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

BYTE Unix Benchmark

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2No LTOLTO Optimized6M12M18M24M30MSE +/- 22656.15, N = 3SE +/- 22328.22, N = 319378446.1326106973.23-flto1. (CC) gcc options: -march=native -O3
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2No LTOLTO Optimized5M10M15M20M25MMin: 19338244.4 / Avg: 19378446.13 / Max: 19416651.4Min: 26062996.4 / Avg: 26106973.23 / Max: 261356831. (CC) gcc options: -march=native -O3

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverNo LTOLTO Optimized30060090012001500SE +/- 4.94, N = 3SE +/- 2.96, N = 31168.771177.98-flto1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverNo LTOLTO Optimized2004006008001000Min: 1163.4 / Avg: 1168.77 / Max: 1178.64Min: 1172.37 / Avg: 1177.98 / Max: 1182.441. (CC) gcc options: -O3 -march=native

Apache Benchmark

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingNo LTOLTO Optimized4K8K12K16K20KSE +/- 72.79, N = 3SE +/- 20.76, N = 316455.6216489.65-flto1. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl
OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingNo LTOLTO Optimized3K6K9K12K15KMin: 16310.06 / Avg: 16455.62 / Max: 16530.69Min: 16449.66 / Avg: 16489.65 / Max: 16519.321. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl

PostgreSQL pgbench

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per SecondNo LTOLTO Optimized306090120150SE +/- 0.23, N = 3SE +/- 0.33, N = 3123.25125.14-flto1. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 8.4.11TPC-B Transactions Per SecondNo LTOLTO Optimized20406080100Min: 122.81 / Avg: 123.25 / Max: 123.58Min: 124.68 / Avg: 125.14 / Max: 125.771. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm

Timed PHP Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileNo LTOLTO Optimized4080120160200SE +/- 0.35, N = 3SE +/- 0.22, N = 364.40181.61-flto1. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileNo LTOLTO Optimized306090120150Min: 63.72 / Avg: 64.4 / Max: 64.89Min: 181.18 / Avg: 181.61 / Max: 181.921. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeNo LTOLTO Optimized20406080100SE +/- 0.43, N = 3SE +/- 0.20, N = 392.1291.83-flto1. (CC) gcc options: -lm -lpthread -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeNo LTOLTO Optimized20406080100Min: 91.65 / Avg: 92.12 / Max: 92.97Min: 91.5 / Avg: 91.83 / Max: 92.211. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesNo LTOLTO Optimized1632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 37070-flto1. (CXX) g++ options: -fopenmp -march=native -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesNo LTOLTO Optimized1428425670Min: 70 / Avg: 70 / Max: 70Min: 70 / Avg: 70 / Max: 701. (CXX) g++ options: -fopenmp -march=native -O3

LZMA Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionNo LTOLTO Optimized4080120160200SE +/- 4.30, N = 6SE +/- 4.32, N = 6181.52180.37-flto1. (CC) gcc options: -march=native -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionNo LTOLTO Optimized306090120150Min: 173.28 / Avg: 181.52 / Max: 199.19Min: 174.03 / Avg: 180.37 / Max: 200.241. (CC) gcc options: -march=native -O3

Open FMM Nero2D

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeNo LTOLTO Optimized120240360480600577.51523.28-flto1. (CXX) g++ options: -march=native -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm