GCC 4.x Benchmarking Intel Core i7 Benchmarking of GCC 4.2 through GCC 4.8 when building the compiler the same and setting CFLAGS/CXXFLAGS of -O3 and -march=native prior to test installation and execution. Benchmarking for a future article on Phoronix.com by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1207067-PTS-GCC4BENC69 .
GCC 4.x Benchmarking Intel Core i7 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 Intel Core i7 720Q @ 1.60GHz (8 Cores) LENOVO 4318CTO Intel Core DMI 4096MB 160GB INTEL SSDSA2M160 NVIDIA Quadro FX 880M Conexant CX20585 Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300 Ubuntu 12.10 3.5.0-2-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 302.17 3.3.0 GCC 4.2.4 ext4 1600x900 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 OpenBenchmarking.org Compiler Details - --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
GCC 4.x Benchmarking Intel Core i7 npb: LU.A npb: UA.A lammps: Rhodopsin Protein ffte: N=64, 1D Complex FFT Routine fhourstones: Complex Connect-4 Solving scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation john-the-ripper: Blowfish ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping vpxenc: vpxenc x264: H.264 Video Encoding graphics-magick: Sharpen graphics-magick: Resizing graphics-magick: Local Adaptive Thresholding compress-7zip: Compress Speed Test c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples crafty: Elapsed Time encode-flac: WAV To FLAC encode-mp3: WAV To MP3 povray: Total Time tachyon: Total Time openssl: RSA 4096-bit Performance GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 72.32 8823.30 371.94 167.62 1118.38 1219.06 788.14 9.60 56.91 15 40 13 8968 138.48 109.85 11.43 23.34 1106 37.95 41.45 74.68 8908.70 194.68 163.31 1128.38 1217.67 786.63 16.16 9.83 56.49 14 42 13 8753 118.43 116.15 11.15 23.38 1315 38.99 41.75 4268.49 20.74 71.59 2954.08 8927.97 205.71 157.15 1140.18 1219.07 789.67 2150 53.31 9.38 56.85 39 78 37 8823 116.43 70 109.12 10.62 23.04 984 38.41 41.88 4966.54 20.64 71.60 2648.87 8887.43 348.62 162.05 1135.42 1223.26 786.14 2180 44.78 9.50 57.15 38 76 37 8890 105.73 69 108.61 10.58 22.98 979 36.44 41.83 5312.37 20.44 73.58 3236.52 8588.90 344.15 151.85 1144.14 1220.46 788.66 2216 46.57 10.02 58.34 41 89 40 8756 106.69 69 110.53 10.07 23.83 975 36.09 41.93 5536.77 21.02 71.82 3756.73 8621.07 264.50 162.09 1153.01 1217.67 784.62 2211 51.30 9.90 57.90 41 90 41 8857 77.71 69 108.56 9.56 23.45 947 37.28 41.88 5522.13 21.08 71.61 3757.28 8465.13 265.49 155.32 1156.25 1227.47 788.66 2213 59.30 9.87 58.46 41 88 41 8840 77.13 69 108.42 9.33 22.41 949 37.16 42.13 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 1200 2400 3600 4800 6000 SE +/- 3.88, N = 3 SE +/- 4.18, N = 3 SE +/- 12.16, N = 3 SE +/- 16.61, N = 3 SE +/- 8.86, N = 3 4268.49 4966.54 5312.37 5536.77 5522.13 1. (F9X) gfortran options: -fopenmp
NAS Parallel Benchmarks Test / Class: UA.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 20.74 20.64 20.44 21.02 21.08 1. (F9X) gfortran options: -fopenmp
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.23, N = 3 SE +/- 0.22, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 72.32 74.68 71.59 71.60 73.58 71.82 71.61 1. (CXX) g++ options: -lfftw -lmpich
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 800 1600 2400 3200 4000 SE +/- 11.26, N = 3 SE +/- 7.58, N = 3 SE +/- 17.30, N = 3 SE +/- 5.14, N = 3 SE +/- 7.24, N = 3 2954.08 2648.87 3236.52 3756.73 3757.28 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 2K 4K 6K 8K 10K SE +/- 6.59, N = 3 SE +/- 9.79, N = 3 SE +/- 15.62, N = 3 SE +/- 15.34, N = 3 SE +/- 10.81, N = 3 SE +/- 27.38, N = 3 SE +/- 7.60, N = 3 8823.30 8908.70 8927.97 8887.43 8588.90 8621.07 8465.13 1. (CC) gcc options: -O3
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 80 160 240 320 400 SE +/- 4.81, N = 7 SE +/- 3.20, N = 4 SE +/- 0.78, N = 4 SE +/- 0.46, N = 4 SE +/- 0.64, N = 4 SE +/- 1.60, N = 4 SE +/- 1.71, N = 4 371.94 194.68 205.71 348.62 344.15 264.50 265.49
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 40 80 120 160 200 SE +/- 0.49, N = 4 SE +/- 0.92, N = 4 SE +/- 0.36, N = 4 SE +/- 0.25, N = 4 SE +/- 0.71, N = 4 SE +/- 3.26, N = 4 SE +/- 1.13, N = 4 167.62 163.31 157.15 162.05 151.85 162.09 155.32
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 200 400 600 800 1000 SE +/- 2.29, N = 4 SE +/- 1.27, N = 4 SE +/- 2.99, N = 4 SE +/- 0.79, N = 4 SE +/- 1.30, N = 4 SE +/- 3.06, N = 4 SE +/- 0.81, N = 4 1118.38 1128.38 1140.18 1135.42 1144.14 1153.01 1156.25
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 300 600 900 1200 1500 SE +/- 1.39, N = 4 SE +/- 2.27, N = 4 SE +/- 2.66, N = 4 SE +/- 2.29, N = 4 SE +/- 2.78, N = 4 SE +/- 2.27, N = 4 SE +/- 1.41, N = 4 1219.06 1217.67 1219.07 1223.26 1220.46 1217.67 1227.47
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 200 400 600 800 1000 SE +/- 0.83, N = 4 SE +/- 1.27, N = 4 SE +/- 1.28, N = 4 SE +/- 2.17, N = 4 SE +/- 1.52, N = 4 SE +/- 1.50, N = 4 SE +/- 1.93, N = 4 788.14 786.63 789.67 786.14 788.66 784.62 788.66
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 500 1000 1500 2000 2500 SE +/- 2.60, N = 3 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 SE +/- 1.67, N = 3 SE +/- 2.60, N = 3 2150 2180 2216 2211 2213 1. (CC) gcc options: -fopenmp -lcrypt
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 2.37, N = 6 SE +/- 0.08, N = 3 16.16 53.31 44.78 46.57 51.30 59.30 -flto -flto -flto 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 9.60 9.83 9.38 9.50 10.02 9.90 9.87 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 13 26 39 52 65 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 56.91 56.49 56.85 57.15 58.34 57.90 58.46
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 15 14 39 38 41 41 41 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 40 42 78 76 89 90 88 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 13 13 37 37 40 41 41 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 2K 4K 6K 8K 10K SE +/- 4.00, N = 3 SE +/- 92.47, N = 3 SE +/- 39.28, N = 3 SE +/- 65.68, N = 3 SE +/- 39.27, N = 3 SE +/- 37.22, N = 3 SE +/- 53.62, N = 3 8968 8753 8823 8890 8756 8857 8840 1. (CXX) g++ options: -pipe -lpthread
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 138.48 118.43 116.43 105.73 106.69 77.71 77.13 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 16 32 48 64 80 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 70 69 69 69 69 1. (CXX) g++ options: -fopenmp -O3 -march=native
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.24, N = 3 109.85 116.15 109.12 108.61 110.53 108.56 108.42 1. (CC) gcc options: -lstdc++ -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 11.43 11.15 10.62 10.58 10.07 9.56 9.33 1. (CXX) g++ options: -O3 -march=native -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 6 12 18 24 30 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 23.34 23.38 23.04 22.98 23.83 23.45 22.41
POV-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 300 600 900 1200 1500 1106 1315 984 979 975 947 949 -malign-double -malign-double -malign-double -malign-double -malign-double -malign-double 1. (CXX) g++ options: -pipe -O3 -msse -mfpmath=sse -msse2 -march=k8 -mtune=k8 -march=native -lz -lSM -lICE -lX11 -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 37.95 38.99 38.41 36.44 36.09 37.28 37.16 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 10 20 30 40 50 SE +/- 0.03, N = 4 SE +/- 0.16, N = 4 SE +/- 0.11, N = 4 SE +/- 0.03, N = 4 SE +/- 0.05, N = 4 SE +/- 0.06, N = 4 SE +/- 0.09, N = 4 41.45 41.75 41.88 41.83 41.93 41.88 42.13 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.4