GCC 4.8 Snapshot Intel Ivy Bridge Benchmarking of a GCC 4.8 snapshot from 2012-06-17 on an Intel Core i7 3770K Ivy Bridge system at Phoronix.com. Benchmarking for a future article. GCC 4.8 snapshot was compared to the latest of the GCC 4.6 and GCC 4.7 compilers when being built with the same compiler options. CFLAGS/CXXFLAGS of -O3 and -march=native were set each time prior to test profile installation.
HTML result view exported from: https://openbenchmarking.org/result/1206225-SU-GCC48SNAP70&grs&sro .
GCC 4.8 Snapshot Intel Ivy Bridge Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 4.8.0 20120617 GCC 4.7.1 GCC 4.6.3 Intel Core i7-3770K @ 3.50GHz (8 Cores) ECS Z77H2-A2X v1.0 Intel Ivy Bridge DRAM 16384MB 240GB OCZ VERTEX3 NVIDIA GeForce GTX 460 768MB (675/1800MHz) Realtek ALC892 Realtek RTL8111/8168B + Intel Centrino Advanced-N 6205 Ubuntu 12.04 3.2.0-25-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 295.49 4.2.0 NVIDIA 295.49 GCC 4.8.0 20120617 ext4 1920x1080 GCC 4.7.1 GCC 4.6.3 OpenBenchmarking.org Compiler Details - --disable-multilib --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
GCC 4.8 Snapshot Intel Ivy Bridge scimark2: Monte Carlo c-ray: Total Time mrbayes: Primate Phylogeny Analysis encode-flac: WAV To FLAC npb: EP.B scimark2: Fast Fourier Transform npb: SP.A compress-pbzip2: 256MB File Compression npb: LU.A himeno: Poisson Pressure Solver ffte: N=64, 1D Complex FFT Routine ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping scimark2: Sparse Matrix Multiply lammps: Rhodopsin Protein scimark2: Composite apache: Static Web Page Serving npb: UA.A crafty: Elapsed Time npb: CG.B compress-7zip: Compress Speed Test john-the-ripper: Blowfish npb: MG.B graphics-magick: HWB Color Space vpxenc: vpxenc x264: H.264 Video Encoding graphics-magick: Resizing scimark2: Dense LU Matrix Factorization hmmer: Pfam Database Search smallpt: Global Illumination Renderer; 100 Samples graphics-magick: Local Adaptive Thresholding graphics-magick: Sharpen graphics-magick: Blur scimark2: Jacobi Successive Over-Relaxation mafft: Multiple Sequence Alignment GCC 4.8.0 20120617 GCC 4.7.1 GCC 4.6.3 422.32 32.94 12.49 4.76 274.39 348.89 6357.88 7.85 13982.63 1681.60 5839.11 130.25 2203.64 36.61 1312.37 32548.59 49.65 72.76 3006.25 20885 5999 6333.49 205 27.11 149.63 174 2407.84 10.03 25 83 96 144 1179.18 5.72 423.15 32.59 12.45 4.79 271.85 347.48 6366.67 8.07 14002.26 1679.90 5853.23 127.38 2256.75 36.43 1322.34 33350.84 49.25 71.44 2967.90 21133 6066 6330.78 205 27.06 149.40 174 2405.12 10.02 25 83 96 144 1179.18 5.38 557.07 42.04 14.44 5.22 253.80 323.45 5981.03 7.72 13452.37 1618.93 5642.53 131.24 2266.12 37.34 1345.11 33266.72 48.64 72.35 2951.94 20826 5989 6269.66 203 27.30 150.41 173 2399.71 10.03 25 83 96 144 1179.18 6.05 OpenBenchmarking.org
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 120 240 360 480 600 SE +/- 0.72, N = 4 SE +/- 0.42, N = 4 SE +/- 0.41, N = 4 557.07 423.15 422.32
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 42.04 32.59 32.94 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 14.44 12.45 12.49
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 1.1745 2.349 3.5235 4.698 5.8725 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 5.22 4.79 4.76 1. (CXX) g++ options: -O3 -march=native -lm
NAS Parallel Benchmarks Test / Class: EP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: EP.B GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 60 120 180 240 300 SE +/- 0.27, N = 3 SE +/- 1.62, N = 3 SE +/- 1.84, N = 3 253.80 271.85 274.39 1. (F9X) gfortran options: -fopenmp
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 80 160 240 320 400 SE +/- 5.28, N = 4 SE +/- 0.88, N = 4 SE +/- 0.68, N = 4 323.45 347.48 348.89
NAS Parallel Benchmarks Test / Class: SP.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: SP.A GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 1400 2800 4200 5600 7000 SE +/- 36.88, N = 3 SE +/- 12.79, N = 3 SE +/- 9.52, N = 3 5981.03 6366.67 6357.88 1. (F9X) gfortran options: -fopenmp
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.0.5 256MB File Compression GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 6 7.72 8.07 7.85 1. (CXX) g++ options: -O2 -pthread -lpthread -lbz2
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 3K 6K 9K 12K 15K SE +/- 58.72, N = 3 SE +/- 88.95, N = 3 SE +/- 47.38, N = 3 13452.37 14002.26 13982.63 1. (F9X) gfortran options: -fopenmp
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 400 800 1200 1600 2000 SE +/- 0.95, N = 3 SE +/- 2.27, N = 3 SE +/- 1.55, N = 3 1618.93 1679.90 1681.60 1. (CC) gcc options: -O3 -march=native
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 1300 2600 3900 5200 6500 SE +/- 5.60, N = 3 SE +/- 6.61, N = 3 SE +/- 2.77, N = 3 5642.53 5853.23 5839.11 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 30 60 90 120 150 SE +/- 0.74, N = 3 SE +/- 0.75, N = 3 SE +/- 0.63, N = 3 131.24 127.38 130.25 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 500 1000 1500 2000 2500 SE +/- 1.81, N = 4 SE +/- 0.00, N = 4 SE +/- 1.49, N = 4 2266.12 2256.75 2203.64
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 37.34 36.43 36.61 1. (CXX) g++ options: -lfftw -lmpich
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 300 600 900 1200 1500 SE +/- 1.43, N = 4 SE +/- 0.66, N = 4 SE +/- 0.99, N = 4 1345.11 1322.34 1312.37
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 7K 14K 21K 28K 35K SE +/- 82.48, N = 3 SE +/- 119.87, N = 3 SE +/- 340.91, N = 3 33266.72 33350.84 32548.59 1. (CC) gcc options: -pthread -O3 -march=native -lm -lrt -lcrypt -lpthread -ldl
NAS Parallel Benchmarks Test / Class: UA.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 11 22 33 44 55 SE +/- 0.17, N = 3 SE +/- 0.14, N = 3 SE +/- 0.30, N = 3 48.64 49.25 49.65 1. (F9X) gfortran options: -fopenmp
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 16 32 48 64 80 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 72.35 71.44 72.76 1. (CC) gcc options: -lstdc++ -lm
NAS Parallel Benchmarks Test / Class: CG.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: CG.B GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 600 1200 1800 2400 3000 SE +/- 24.15, N = 3 SE +/- 46.27, N = 5 SE +/- 8.35, N = 3 2951.94 2967.90 3006.25 1. (F9X) gfortran options: -fopenmp
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 5K 10K 15K 20K 25K SE +/- 251.32, N = 3 SE +/- 152.12, N = 3 SE +/- 31.55, N = 3 20826 21133 20885 1. (CXX) g++ options: -pipe -lpthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 1300 2600 3900 5200 6500 SE +/- 14.42, N = 3 SE +/- 5.55, N = 3 SE +/- 9.67, N = 3 5989 6066 5999 1. (CC) gcc options: -fopenmp -lcrypt
NAS Parallel Benchmarks Test / Class: MG.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: MG.B GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 1400 2800 4200 5600 7000 SE +/- 10.79, N = 3 SE +/- 11.05, N = 3 SE +/- 6.73, N = 3 6269.66 6330.78 6333.49 1. (F9X) gfortran options: -fopenmp
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: HWB Color Space GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 203 205 205 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lrt -lpthread
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 6 12 18 24 30 SE +/- 0.18, N = 3 SE +/- 0.30, N = 3 SE +/- 0.24, N = 3 27.30 27.06 27.11 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 30 60 90 120 150 SE +/- 0.96, N = 3 SE +/- 0.66, N = 3 SE +/- 1.36, N = 3 150.41 149.40 149.63
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 173 174 174 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lrt -lpthread
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 500 1000 1500 2000 2500 SE +/- 2.69, N = 4 SE +/- 2.72, N = 4 SE +/- 3.14, N = 4 2399.71 2405.12 2407.84
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.03 10.02 10.03 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 6 12 18 24 30 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 25 25 25 1. (CXX) g++ options: -fopenmp -O3 -march=native
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 83 83 83 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lrt -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 96 96 96 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lrt -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Blur GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 144 144 144 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lrt -lpthread
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 300 600 900 1200 1500 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 1179.18 1179.18 1179.18
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120617 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.19, N = 6 SE +/- 0.14, N = 6 6.05 5.38 5.72 1. (CC) gcc options: -O3 -lm -lpthread
Phoronix Test Suite v10.8.5