GCC 4.x Benchmarking Intel Core i7 Benchmarking of GCC 4.2 through GCC 4.8 when building the compiler the same and setting CFLAGS/CXXFLAGS of -O3 and -march=native prior to test installation and execution. Benchmarking for a future article on Phoronix.com by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1207067-PTS-GCC4BENC69&sro&grr .
GCC 4.x Benchmarking Intel Core i7 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 Intel Core i7 720Q @ 1.60GHz (8 Cores) LENOVO 4318CTO Intel Core DMI 4096MB 160GB INTEL SSDSA2M160 NVIDIA Quadro FX 880M Conexant CX20585 Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300 Ubuntu 12.10 3.5.0-2-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 302.17 3.3.0 GCC 4.2.4 ext4 1600x900 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 OpenBenchmarking.org Compiler Details - --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
GCC 4.x Benchmarking Intel Core i7 openssl: RSA 4096-bit Performance tachyon: Total Time povray: Total Time encode-mp3: WAV To MP3 encode-flac: WAV To FLAC crafty: Elapsed Time smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time compress-7zip: Compress Speed Test graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen x264: H.264 Video Encoding vpxenc: vpxenc ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo fhourstones: Complex Connect-4 Solving ffte: N=64, 1D Complex FFT Routine lammps: Rhodopsin Protein npb: UA.A npb: LU.A GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 41.45 37.95 1106 23.34 11.43 109.85 138.48 8968 13 40 15 56.91 9.60 788.14 1219.06 1118.38 167.62 371.94 8823.30 72.32 41.75 38.99 1315 23.38 11.15 116.15 118.43 8753 13 42 14 56.49 9.83 16.16 786.63 1217.67 1128.38 163.31 194.68 8908.70 74.68 41.88 38.41 984 23.04 10.62 109.12 70 116.43 8823 37 78 39 56.85 9.38 53.31 2150 789.67 1219.07 1140.18 157.15 205.71 8927.97 2954.08 71.59 20.74 4268.49 41.83 36.44 979 22.98 10.58 108.61 69 105.73 8890 37 76 38 57.15 9.50 44.78 2180 786.14 1223.26 1135.42 162.05 348.62 8887.43 2648.87 71.60 20.64 4966.54 41.93 36.09 975 23.83 10.07 110.53 69 106.69 8756 40 89 41 58.34 10.02 46.57 2216 788.66 1220.46 1144.14 151.85 344.15 8588.90 3236.52 73.58 20.44 5312.37 41.88 37.28 947 23.45 9.56 108.56 69 77.71 8857 41 90 41 57.90 9.90 51.30 2211 784.62 1217.67 1153.01 162.09 264.50 8621.07 3756.73 71.82 21.02 5536.77 42.13 37.16 949 22.41 9.33 108.42 69 77.13 8840 41 88 41 58.46 9.87 59.30 2213 788.66 1227.47 1156.25 155.32 265.49 8465.13 3757.28 71.61 21.08 5522.13 OpenBenchmarking.org
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 10 20 30 40 50 SE +/- 0.03, N = 4 SE +/- 0.16, N = 4 SE +/- 0.11, N = 4 SE +/- 0.03, N = 4 SE +/- 0.05, N = 4 SE +/- 0.06, N = 4 SE +/- 0.09, N = 4 41.45 41.75 41.88 41.83 41.93 41.88 42.13 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 37.95 38.99 38.41 36.44 36.09 37.28 37.16 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
POV-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 300 600 900 1200 1500 1106 1315 984 979 975 947 949 -malign-double -malign-double -malign-double -malign-double -malign-double -malign-double 1. (CXX) g++ options: -pipe -O3 -msse -mfpmath=sse -msse2 -march=k8 -mtune=k8 -march=native -lz -lSM -lICE -lX11 -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 6 12 18 24 30 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 23.34 23.38 23.04 22.98 23.83 23.45 22.41
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 11.43 11.15 10.62 10.58 10.07 9.56 9.33 1. (CXX) g++ options: -O3 -march=native -logg -lm
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.24, N = 3 109.85 116.15 109.12 108.61 110.53 108.56 108.42 1. (CC) gcc options: -lstdc++ -lm
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 16 32 48 64 80 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 70 69 69 69 69 1. (CXX) g++ options: -fopenmp -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 138.48 118.43 116.43 105.73 106.69 77.71 77.13 1. (CC) gcc options: -lm -lpthread -O3 -march=native
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 2K 4K 6K 8K 10K SE +/- 4.00, N = 3 SE +/- 92.47, N = 3 SE +/- 39.28, N = 3 SE +/- 65.68, N = 3 SE +/- 39.27, N = 3 SE +/- 37.22, N = 3 SE +/- 53.62, N = 3 8968 8753 8823 8890 8756 8857 8840 1. (CXX) g++ options: -pipe -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 13 13 37 37 40 41 41 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 40 42 78 76 89 90 88 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 15 14 39 38 41 41 41 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 13 26 39 52 65 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 56.91 56.49 56.85 57.15 58.34 57.90 58.46
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 9.60 9.83 9.38 9.50 10.02 9.90 9.87 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 2.37, N = 6 SE +/- 0.08, N = 3 16.16 53.31 44.78 46.57 51.30 59.30 -flto -flto -flto 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 500 1000 1500 2000 2500 SE +/- 2.60, N = 3 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 SE +/- 1.67, N = 3 SE +/- 2.60, N = 3 2150 2180 2216 2211 2213 1. (CC) gcc options: -fopenmp -lcrypt
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 200 400 600 800 1000 SE +/- 0.83, N = 4 SE +/- 1.27, N = 4 SE +/- 1.28, N = 4 SE +/- 2.17, N = 4 SE +/- 1.52, N = 4 SE +/- 1.50, N = 4 SE +/- 1.93, N = 4 788.14 786.63 789.67 786.14 788.66 784.62 788.66
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 300 600 900 1200 1500 SE +/- 1.39, N = 4 SE +/- 2.27, N = 4 SE +/- 2.66, N = 4 SE +/- 2.29, N = 4 SE +/- 2.78, N = 4 SE +/- 2.27, N = 4 SE +/- 1.41, N = 4 1219.06 1217.67 1219.07 1223.26 1220.46 1217.67 1227.47
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 200 400 600 800 1000 SE +/- 2.29, N = 4 SE +/- 1.27, N = 4 SE +/- 2.99, N = 4 SE +/- 0.79, N = 4 SE +/- 1.30, N = 4 SE +/- 3.06, N = 4 SE +/- 0.81, N = 4 1118.38 1128.38 1140.18 1135.42 1144.14 1153.01 1156.25
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 40 80 120 160 200 SE +/- 0.49, N = 4 SE +/- 0.92, N = 4 SE +/- 0.36, N = 4 SE +/- 0.25, N = 4 SE +/- 0.71, N = 4 SE +/- 3.26, N = 4 SE +/- 1.13, N = 4 167.62 163.31 157.15 162.05 151.85 162.09 155.32
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 80 160 240 320 400 SE +/- 4.81, N = 7 SE +/- 3.20, N = 4 SE +/- 0.78, N = 4 SE +/- 0.46, N = 4 SE +/- 0.64, N = 4 SE +/- 1.60, N = 4 SE +/- 1.71, N = 4 371.94 194.68 205.71 348.62 344.15 264.50 265.49
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 2K 4K 6K 8K 10K SE +/- 6.59, N = 3 SE +/- 9.79, N = 3 SE +/- 15.62, N = 3 SE +/- 15.34, N = 3 SE +/- 10.81, N = 3 SE +/- 27.38, N = 3 SE +/- 7.60, N = 3 8823.30 8908.70 8927.97 8887.43 8588.90 8621.07 8465.13 1. (CC) gcc options: -O3
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 800 1600 2400 3200 4000 SE +/- 11.26, N = 3 SE +/- 7.58, N = 3 SE +/- 17.30, N = 3 SE +/- 5.14, N = 3 SE +/- 7.24, N = 3 2954.08 2648.87 3236.52 3756.73 3757.28 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.23, N = 3 SE +/- 0.22, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 72.32 74.68 71.59 71.60 73.58 71.82 71.61 1. (CXX) g++ options: -lfftw -lmpich
NAS Parallel Benchmarks Test / Class: UA.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 20.74 20.64 20.44 21.02 21.08 1. (F9X) gfortran options: -fopenmp
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 1200 2400 3600 4800 6000 SE +/- 3.88, N = 3 SE +/- 4.18, N = 3 SE +/- 12.16, N = 3 SE +/- 16.61, N = 3 SE +/- 8.86, N = 3 4268.49 4966.54 5312.37 5536.77 5522.13 1. (F9X) gfortran options: -fopenmp
Phoronix Test Suite v10.8.4