Intel Ivy Bridge GCC 4.7 LTO Quick benchmarking of GCC 4.7.1 with the Link-Time Optimization (LTO) support. Building the test profiles with CFLAGS/CXXFLAGS that includes -flto for link-time optimizing and then without. Benchmarking by Michael Larabel for a future article looking briefly at the GCC Link-Time Optimization support for some C/C++ programs.
HTML result view exported from: https://openbenchmarking.org/result/1208200-SU-INTELIVYB99&grw .
Intel Ivy Bridge GCC 4.7 LTO Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution No LTO LTO Optimized Intel Core i7-3517UE @ 2.10GHz (4 Cores) CompuLab Intense-PC Intel 3rd Gen Core DRAM 8192MB 500GB Hitachi HCC54755 Intel 3rd Gen Core Realtek ALC888 VA2431 Intel 82579LM Gigabit Connection + Realtek RTL8188CE 802.11b/g/n Ubuntu 12.10 3.6.0-999-generic (x86_64) Unity 2D 6.2.0 X Server 1.12.1.902 (1.12.2 RC 2) intel 2.20.3 2.1 Mesa 8.1-devel (git-6a3ac03) GCC 4.7 + LLVM 3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details - Scaling Governor: ondemand
Intel Ivy Bridge GCC 4.7 LTO himeno: Poisson Pressure Solver compress-lzma: 256MB File Compression build-php: Time To Compile nero2d: Total Time c-ray: Total Time ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping smallpt: Global Illumination Renderer; 100 Samples apache: Static Web Page Serving pgbench: TPC-B Transactions Per Second byte: Dhrystone 2 No LTO LTO Optimized 1168.77 181.52 64.40 577.51 92.12 55.95 70 16455.62 123.25 19378446.13 1177.98 180.37 181.61 523.28 91.83 53.60 70 16489.65 125.14 26106973.23 OpenBenchmarking.org
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver No LTO LTO Optimized 300 600 900 1200 1500 SE +/- 4.94, N = 3 SE +/- 2.96, N = 3 1168.77 1177.98 -flto 1. (CC) gcc options: -O3 -march=native
LZMA Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better LZMA Compression 256MB File Compression No LTO LTO Optimized 40 80 120 160 200 SE +/- 4.30, N = 6 SE +/- 4.32, N = 6 181.52 180.37 -flto 1. (CC) gcc options: -march=native -O3
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile No LTO LTO Optimized 40 80 120 160 200 SE +/- 0.35, N = 3 SE +/- 0.22, N = 3 64.40 181.61 -flto 1. (CC) gcc options: -march=native -O3 -pedantic -ldl -lz -lm
Open FMM Nero2D Total Time OpenBenchmarking.org Seconds, Fewer Is Better Open FMM Nero2D 2.0.2 Total Time No LTO LTO Optimized 120 240 360 480 600 577.51 523.28 -flto 1. (CXX) g++ options: -march=native -O3 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time No LTO LTO Optimized 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.20, N = 3 92.12 91.83 -flto 1. (CC) gcc options: -lm -lpthread -O3 -march=native
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping No LTO LTO Optimized 13 26 39 52 65 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 55.95 53.60 1. (CXX) g++ options: -march=native -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples No LTO LTO Optimized 16 32 48 64 80 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 70 70 -flto 1. (CXX) g++ options: -fopenmp -march=native -O3
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving No LTO LTO Optimized 4K 8K 12K 16K 20K SE +/- 72.79, N = 3 SE +/- 20.76, N = 3 16455.62 16489.65 -flto 1. (CC) gcc options: -pthread -march=native -O3 -lm -lexpat -lrt -lcrypt -lpthread -ldl
PostgreSQL pgbench TPC-B Transactions Per Second OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 8.4.11 TPC-B Transactions Per Second No LTO LTO Optimized 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.33, N = 3 123.25 125.14 -flto 1. (CC) gcc options: -march=native -O3 -fno-strict-aliasing -fwrapv -lpgport -lpq -lcrypt -ldl -lm
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 No LTO LTO Optimized 6M 12M 18M 24M 30M SE +/- 22656.15, N = 3 SE +/- 22328.22, N = 3 19378446.13 26106973.23 -flto 1. (CC) gcc options: -march=native -O3
Phoronix Test Suite v10.8.4