Intel Ivy Bridge GCC 4.7 LTO Quick benchmarking of GCC 4.7.1 with the Link-Time Optimization (LTO) support. Building the test profiles with CFLAGS/CXXFLAGS that includes -flto for link-time optimizing and then without. Benchmarking by Michael Larabel for a future article looking briefly at the GCC Link-Time Optimization support for some C/C++ programs.
HTML result view exported from: https://openbenchmarking.org/result/1208200-SU-INTELIVYB99&sro .
Intel Ivy Bridge GCC 4.7 LTO Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution No LTO LTO Optimized Intel Core i7-3517UE @ 2.10GHz (4 Cores) CompuLab Intense-PC Intel 3rd Gen Core DRAM 8192MB 500GB Hitachi HCC54755 Intel 3rd Gen Core Realtek ALC888 VA2431 Intel 82579LM Gigabit Connection + Realtek RTL8188CE 802.11b/g/n Ubuntu 12.10 3.6.0-999-generic (x86_64) Unity 2D 6.2.0 X Server 1.12.1.902 (1.12.2 RC 2) intel 2.20.3 2.1 Mesa 8.1-devel (git-6a3ac03) GCC 4.7 + LLVM 3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details - Scaling Governor: ondemand
Intel Ivy Bridge GCC 4.7 LTO byte: Dhrystone 2 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping himeno: Poisson Pressure Solver build-php: Time To Compile c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples compress-lzma: 256MB File Compression nero2d: Total Time pgbench: TPC-B Transactions Per Second apache: Static Web Page Serving No LTO LTO Optimized 19378446.13 55.95 1168.77 64.40 92.12 70 181.52 577.51 123.25 16455.62 26106973.23 53.60 1177.98 181.61 91.83 70 180.37 523.28 125.14 16489.65 OpenBenchmarking.org
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 LTO Optimized No LTO 6M 12M 18M 24M 30M SE +/- 22328.22, N = 3 SE +/- 22656.15, N = 3 26106973.23 19378446.13 -flto 1. (CC) gcc options: -march=native -O3
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping LTO Optimized No LTO 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 53.60 55.95 1. (CXX) g++ options: -march=native -O3 -flto -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LTO Optimized No LTO 300 600 900 1200 1500 SE +/- 2.96, N = 3 SE +/- 4.94, N = 3 1177.98 1168.77 -flto 1. (CC) gcc options: -O3 -march=native
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile LTO Optimized No LTO 40 80 120 160 200 SE +/- 0.22, N = 3 SE +/- 0.35, N = 3 181.61 64.40 -flto 1. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LTO Optimized No LTO 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.43, N = 3 91.83 92.12 -flto 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LTO Optimized No LTO 16 32 48 64 80 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 70 70 -flto 1. (CXX) g++ options: -fopenmp -march=native -O3
LZMA Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better LZMA Compression 256MB File Compression LTO Optimized No LTO 40 80 120 160 200 SE +/- 4.32, N = 6 SE +/- 4.30, N = 6 180.37 181.52 -flto 1. (CC) gcc options: -march=native -O3
Open FMM Nero2D Total Time OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time LTO Optimized No LTO 120 240 360 480 600 523.28 577.51 -flto 1. (CXX) g++ options: -march=native -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm
PostgreSQL pgbench TPC-B Transactions Per Second OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 8.4.11 TPC-B Transactions Per Second LTO Optimized No LTO 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.23, N = 3 125.14 123.25 -flto 1. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving LTO Optimized No LTO 4K 8K 12K 16K 20K SE +/- 20.76, N = 3 SE +/- 72.79, N = 3 16489.65 16455.62 -flto 1. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl
Phoronix Test Suite v10.8.4