GCC 4.x Benchmarking Intel Core i7 Benchmarking of GCC 4.2 through GCC 4.8 when building the compiler the same and setting CFLAGS/CXXFLAGS of -O3 and -march=native prior to test installation and execution. Benchmarking for a future article on Phoronix.com by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1207067-PTS-GCC4BENC69&grr&sor .
GCC 4.x Benchmarking Intel Core i7 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 Intel Core i7 720Q @ 1.60GHz (8 Cores) LENOVO 4318CTO Intel Core DMI 4096MB 160GB INTEL SSDSA2M160 NVIDIA Quadro FX 880M Conexant CX20585 Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300 Ubuntu 12.10 3.5.0-2-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 302.17 3.3.0 GCC 4.2.4 ext4 1600x900 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 OpenBenchmarking.org Compiler Details - --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
GCC 4.x Benchmarking Intel Core i7 openssl: RSA 4096-bit Performance tachyon: Total Time povray: Total Time encode-mp3: WAV To MP3 encode-flac: WAV To FLAC crafty: Elapsed Time smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time compress-7zip: Compress Speed Test graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen x264: H.264 Video Encoding vpxenc: vpxenc ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo fhourstones: Complex Connect-4 Solving ffte: N=64, 1D Complex FFT Routine lammps: Rhodopsin Protein npb: UA.A npb: LU.A GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 41.45 37.95 1106 23.34 11.43 109.85 138.48 8968 13 40 15 56.91 9.60 788.14 1219.06 1118.38 167.62 371.94 8823.30 72.32 41.75 38.99 1315 23.38 11.15 116.15 118.43 8753 13 42 14 56.49 9.83 16.16 786.63 1217.67 1128.38 163.31 194.68 8908.70 74.68 41.88 38.41 984 23.04 10.62 109.12 70 116.43 8823 37 78 39 56.85 9.38 53.31 2150 789.67 1219.07 1140.18 157.15 205.71 8927.97 2954.08 71.59 20.74 4268.49 41.83 36.44 979 22.98 10.58 108.61 69 105.73 8890 37 76 38 57.15 9.50 44.78 2180 786.14 1223.26 1135.42 162.05 348.62 8887.43 2648.87 71.60 20.64 4966.54 41.93 36.09 975 23.83 10.07 110.53 69 106.69 8756 40 89 41 58.34 10.02 46.57 2216 788.66 1220.46 1144.14 151.85 344.15 8588.90 3236.52 73.58 20.44 5312.37 41.88 37.28 947 23.45 9.56 108.56 69 77.71 8857 41 90 41 57.90 9.90 51.30 2211 784.62 1217.67 1153.01 162.09 264.50 8621.07 3756.73 71.82 21.02 5536.77 42.13 37.16 949 22.41 9.33 108.42 69 77.13 8840 41 88 41 58.46 9.87 59.30 2213 788.66 1227.47 1156.25 155.32 265.49 8465.13 3757.28 71.61 21.08 5522.13 OpenBenchmarking.org
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance GCC 4.8.0 20120701 GCC 4.6.3 GCC 4.7.1 GCC 4.4.7 GCC 4.5.4 GCC 4.3.6 GCC 4.2.4 10 20 30 40 50 SE +/- 0.09, N = 4 SE +/- 0.05, N = 4 SE +/- 0.06, N = 4 SE +/- 0.11, N = 4 SE +/- 0.03, N = 4 SE +/- 0.16, N = 4 SE +/- 0.03, N = 4 42.13 41.93 41.88 41.88 41.83 41.75 41.45 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.6.3 GCC 4.5.4 GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.2.4 GCC 4.4.7 GCC 4.3.6 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 36.09 36.44 37.16 37.28 37.95 38.41 38.99 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
POV-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time GCC 4.7.1 GCC 4.8.0 20120701 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.2.4 GCC 4.3.6 300 600 900 1200 1500 947 949 975 979 984 1106 1315 -malign-double -malign-double -malign-double -malign-double -malign-double -malign-double 1. (CXX) g++ options: -pipe -O3 -msse -mfpmath=sse -msse2 -march=k8 -mtune=k8 -march=native -lz -lSM -lICE -lX11 -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.8.0 20120701 GCC 4.5.4 GCC 4.4.7 GCC 4.2.4 GCC 4.3.6 GCC 4.7.1 GCC 4.6.3 6 12 18 24 30 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 22.41 22.98 23.04 23.34 23.38 23.45 23.83
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 9.33 9.56 10.07 10.58 10.62 11.15 11.43 1. (CXX) g++ options: -O3 -march=native -logg -lm
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.5.4 GCC 4.4.7 GCC 4.2.4 GCC 4.6.3 GCC 4.3.6 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.28, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.05, N = 3 SE +/- 0.23, N = 3 SE +/- 0.02, N = 3 108.42 108.56 108.61 109.12 109.85 110.53 116.15 1. (CC) gcc options: -lstdc++ -lm
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 GCC 4.4.7 16 32 48 64 80 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 69 69 69 69 70 1. (CXX) g++ options: -fopenmp -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.5.4 GCC 4.6.3 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 30 60 90 120 150 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 77.13 77.71 105.73 106.69 116.43 118.43 138.48 1. (CC) gcc options: -lm -lpthread -O3 -march=native
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.2.4 GCC 4.5.4 GCC 4.7.1 GCC 4.8.0 20120701 GCC 4.4.7 GCC 4.6.3 GCC 4.3.6 2K 4K 6K 8K 10K SE +/- 4.00, N = 3 SE +/- 65.68, N = 3 SE +/- 37.22, N = 3 SE +/- 53.62, N = 3 SE +/- 39.28, N = 3 SE +/- 39.27, N = 3 SE +/- 92.47, N = 3 8968 8890 8857 8840 8823 8756 8753 1. (CXX) g++ options: -pipe -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 9 18 27 36 45 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 41 41 40 37 37 13 13 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.7.1 GCC 4.6.3 GCC 4.8.0 20120701 GCC 4.4.7 GCC 4.5.4 GCC 4.3.6 GCC 4.2.4 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90 89 88 78 76 42 40 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.4.7 GCC 4.5.4 GCC 4.2.4 GCC 4.3.6 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 41 41 41 39 38 15 14 -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt -fopenmp -lrt 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.8.0 20120701 GCC 4.6.3 GCC 4.7.1 GCC 4.5.4 GCC 4.2.4 GCC 4.4.7 GCC 4.3.6 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.27, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 58.46 58.34 57.90 57.15 56.91 56.85 56.49
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 GCC 4.3.6 GCC 4.2.4 GCC 4.5.4 GCC 4.4.7 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 10.02 9.90 9.87 9.83 9.60 9.50 9.38 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.8.0 20120701 GCC 4.4.7 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.3.6 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 SE +/- 2.37, N = 6 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 59.30 53.31 51.30 46.57 44.78 16.16 -flto -flto -flto 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.6.3 GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.5.4 GCC 4.4.7 500 1000 1500 2000 2500 SE +/- 1.33, N = 3 SE +/- 2.60, N = 3 SE +/- 1.67, N = 3 SE +/- 0.33, N = 3 SE +/- 2.60, N = 3 2216 2213 2211 2180 2150 1. (CC) gcc options: -fopenmp -lcrypt
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.4.7 GCC 4.8.0 20120701 GCC 4.6.3 GCC 4.2.4 GCC 4.3.6 GCC 4.5.4 GCC 4.7.1 200 400 600 800 1000 SE +/- 1.28, N = 4 SE +/- 1.93, N = 4 SE +/- 1.52, N = 4 SE +/- 0.83, N = 4 SE +/- 1.27, N = 4 SE +/- 2.17, N = 4 SE +/- 1.50, N = 4 789.67 788.66 788.66 788.14 786.63 786.14 784.62
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.8.0 20120701 GCC 4.5.4 GCC 4.6.3 GCC 4.4.7 GCC 4.2.4 GCC 4.7.1 GCC 4.3.6 300 600 900 1200 1500 SE +/- 1.41, N = 4 SE +/- 2.29, N = 4 SE +/- 2.78, N = 4 SE +/- 2.66, N = 4 SE +/- 1.39, N = 4 SE +/- 2.27, N = 4 SE +/- 2.27, N = 4 1227.47 1223.26 1220.46 1219.07 1219.06 1217.67 1217.67
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.4.7 GCC 4.5.4 GCC 4.3.6 GCC 4.2.4 200 400 600 800 1000 SE +/- 0.81, N = 4 SE +/- 3.06, N = 4 SE +/- 1.30, N = 4 SE +/- 2.99, N = 4 SE +/- 0.79, N = 4 SE +/- 1.27, N = 4 SE +/- 2.29, N = 4 1156.25 1153.01 1144.14 1140.18 1135.42 1128.38 1118.38
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.2.4 GCC 4.3.6 GCC 4.7.1 GCC 4.5.4 GCC 4.4.7 GCC 4.8.0 20120701 GCC 4.6.3 40 80 120 160 200 SE +/- 0.49, N = 4 SE +/- 0.92, N = 4 SE +/- 3.26, N = 4 SE +/- 0.25, N = 4 SE +/- 0.36, N = 4 SE +/- 1.13, N = 4 SE +/- 0.71, N = 4 167.62 163.31 162.09 162.05 157.15 155.32 151.85
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.2.4 GCC 4.5.4 GCC 4.6.3 GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.4.7 GCC 4.3.6 80 160 240 320 400 SE +/- 4.81, N = 7 SE +/- 0.46, N = 4 SE +/- 0.64, N = 4 SE +/- 1.71, N = 4 SE +/- 1.60, N = 4 SE +/- 0.78, N = 4 SE +/- 3.20, N = 4 371.94 348.62 344.15 265.49 264.50 205.71 194.68
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving GCC 4.4.7 GCC 4.3.6 GCC 4.5.4 GCC 4.2.4 GCC 4.7.1 GCC 4.6.3 GCC 4.8.0 20120701 2K 4K 6K 8K 10K SE +/- 15.62, N = 3 SE +/- 9.79, N = 3 SE +/- 15.34, N = 3 SE +/- 6.59, N = 3 SE +/- 27.38, N = 3 SE +/- 10.81, N = 3 SE +/- 7.60, N = 3 8927.97 8908.70 8887.43 8823.30 8621.07 8588.90 8465.13 1. (CC) gcc options: -O3
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.4.7 GCC 4.5.4 800 1600 2400 3200 4000 SE +/- 7.24, N = 3 SE +/- 5.14, N = 3 SE +/- 17.30, N = 3 SE +/- 11.26, N = 3 SE +/- 7.58, N = 3 3757.28 3756.73 3236.52 2954.08 2648.87 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.4.7 GCC 4.5.4 GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.2.4 GCC 4.6.3 GCC 4.3.6 20 40 60 80 100 SE +/- 0.22, N = 3 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 0.23, N = 3 71.59 71.60 71.61 71.82 72.32 73.58 74.68 1. (CXX) g++ options: -lfftw -lmpich
NAS Parallel Benchmarks Test / Class: UA.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.8.0 20120701 GCC 4.7.1 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 21.08 21.02 20.74 20.64 20.44 1. (F9X) gfortran options: -fopenmp
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.7.1 GCC 4.8.0 20120701 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 1200 2400 3600 4800 6000 SE +/- 16.61, N = 3 SE +/- 8.86, N = 3 SE +/- 12.16, N = 3 SE +/- 4.18, N = 3 SE +/- 3.88, N = 3 5536.77 5522.13 5312.37 4966.54 4268.49 1. (F9X) gfortran options: -fopenmp
Phoronix Test Suite v10.8.4