GCC 4.x Benchmarking - Intel, AMD 64-bit Benchmarking of GCC 4.2 through GCC 4.8 when building the compiler the same and setting CFLAGS/CXXFLAGS of -O3 and -march=native prior to test installation and execution. Benchmarking for a future article on Phoronix.com by Michael Larabel. Testing on an Intel Core i7 and AMD Opteron 2384 when using the 64-bit (x86_64 target) build of Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1207077-SU-GCCPERFOR59&grr&sro .
Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Intel Core i7 AMD Opteron GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 Intel Core i7 720Q @ 1.60GHz (8 Cores) LENOVO 4318CTO Intel Core DMI 4096MB 160GB INTEL SSDSA2M160 NVIDIA Quadro FX 880M Conexant CX20585 Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300 Ubuntu 12.10 3.5.0-2-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 302.17 3.3.0 GCC 4.2.4 ext4 1600x900 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 AMD Opteron 2384 @ 2.70GHz (4 Cores) TYAN S2927/S2927-E NVIDIA MCP55 64GB AGILITY-EX AMD Radeon HD 4870 512MB ATI R6xx HDMI AL2223W radeon 6.14.4 2.1 Mesa 8.0.3 Gallium 0.4 GCC 4.2.4 1680x1050 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8.0 20120701 OpenBenchmarking.org Compiler Details - --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
tachyon: Total Time povray: Total Time encode-mp3: WAV To MP3 encode-flac: WAV To FLAC crafty: Elapsed Time c-ray: Total Time graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen x264: H.264 Video Encoding vpxenc: vpxenc ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo ffte: N=64, 1D Complex FFT Routine lammps: Rhodopsin Protein npb: UA.A npb: LU.A Intel Core i7 AMD Opteron GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 37.95 1106 23.34 11.43 109.85 138.48 13 40 15 56.91 9.60 788.14 1219.06 1118.38 167.62 371.94 72.32 38.99 1315 23.38 11.15 116.15 118.43 13 42 14 56.49 9.83 16.16 786.63 1217.67 1128.38 163.31 194.68 74.68 38.41 984 23.04 10.62 109.12 116.43 37 78 39 56.85 9.38 53.31 2150 789.67 1219.07 1140.18 157.15 205.71 2954.08 71.59 20.74 4268.49 36.44 979 22.98 10.58 108.61 105.73 37 76 38 57.15 9.50 44.78 2180 786.14 1223.26 1135.42 162.05 348.62 2648.87 71.60 20.64 4966.54 36.09 975 23.83 10.07 110.53 106.69 40 89 41 58.34 10.02 46.57 2216 788.66 1220.46 1144.14 151.85 344.15 3236.52 73.58 20.44 5312.37 37.28 947 23.45 9.56 108.56 77.71 41 90 41 57.90 9.90 51.30 2211 784.62 1217.67 1153.01 162.09 264.50 3756.73 71.82 21.02 5536.77 37.16 949 22.41 9.33 108.42 77.13 41 88 41 58.46 9.87 59.30 2213 788.66 1227.47 1156.25 155.32 265.49 3757.28 71.61 21.08 5522.13 39.24 1346 30.16 10.42 124.49 157.85 12 42 14 57.60 13.04 600.56 946.48 714.26 67.94 284.25 83.87 39.53 1579 27.14 9.94 129.18 119.33 15 52 24 57.24 12.94 14.16 605.90 996.89 709.28 62.54 183.70 84.71 41.11 1268 27.22 9.69 129.38 115.81 41 97 61 57.64 12.98 52.08 3031 595.32 1021.74 702.60 69.07 188.22 3187.78 83.71 18.83 4545.22 35.58 1274 26.80 9.71 124.65 107.75 41 95 61 57.89 13.19 58.67 3077 598.81 751.78 664.43 66.69 284.25 3003.51 84.02 18.61 4893.90 37.10 1301 27.48 9.79 124.79 103.47 41 95 57 59.35 13.51 62.50 3156 599.98 1055.07 681.01 61.59 304.67 3334.60 86.15 19.13 5008.86 36.67 1226 28.75 9.56 122.96 72.01 42 95 58 59.35 13.26 54.90 2995 592.44 1012.02 678.73 62.14 225.11 3501.68 84.29 19.04 5287.87 36.40 1219 29.04 9.37 122.93 72.01 42 93 58 59.23 13.19 56.03 2981 600.56 1092.91 676.50 66.33 252.94 3520.29 86.01 18.65 5299.79 OpenBenchmarking.org
Tachyon Total Time AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.17, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 39.24 39.53 41.11 35.58 37.10 36.67 36.40 37.95 38.99 38.41 36.44 36.09 37.28 37.16 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
POV-Ray Total Time AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 300 600 900 1200 1500 1346 1579 1268 1274 1301 1226 1219 1106 1315 984 979 975 947 949 1. (CXX) g++ options: -pipe -O3 -msse -mfpmath=sse -msse2 -march=k8 -mtune=k8 -march=native -lz -lSM -lICE -lX11 -lm
LAME MP3 Encoding WAV To MP3 AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 7 14 21 28 35 SE +/- 0.03, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 30.16 27.14 27.22 26.80 27.48 28.75 29.04 23.34 23.38 23.04 22.98 23.83 23.45 22.41
FLAC Audio Encoding WAV To FLAC AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 10.42 9.94 9.69 9.71 9.79 9.56 9.37 11.43 11.15 10.62 10.58 10.07 9.56 9.33 1. (CXX) g++ options: -O3 -march=native -logg -lm
Crafty Elapsed Time AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.54, N = 3 SE +/- 0.22, N = 3 SE +/- 0.43, N = 3 SE +/- 0.05, N = 3 SE +/- 0.40, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.24, N = 3 124.49 129.18 129.38 124.65 124.79 122.96 122.93 109.85 116.15 109.12 108.61 110.53 108.56 108.42 1. (CC) gcc options: -lstdc++ -lm
C-Ray Total Time AMD Opteron Intel Core i7 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.22, N = 3 SE +/- 0.03, N = 3 157.85 119.33 115.81 107.75 103.47 72.01 72.01 138.48 118.43 116.43 105.73 106.69 77.71 77.13 1. (CC) gcc options: -lm -lpthread -O3 -march=native
GraphicsMagick Operation: Local Adaptive Thresholding AMD Opteron Intel Core i7 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 12 15 41 41 41 42 42 13 13 37 37 40 41 41 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing AMD Opteron Intel Core i7 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 42 52 97 95 95 95 93 40 42 78 76 89 90 88 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen AMD Opteron Intel Core i7 OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 14 28 42 56 70 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 14 24 61 61 57 58 58 15 14 39 38 41 41 41 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lz -lm -lpthread
x264 H.264 Video Encoding AMD Opteron Intel Core i7 OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 13 26 39 52 65 SE +/- 0.21, N = 3 SE +/- 0.24, N = 3 SE +/- 0.26, N = 3 SE +/- 0.31, N = 3 SE +/- 0.28, N = 3 SE +/- 0.20, N = 3 SE +/- 0.31, N = 3 SE +/- 0.27, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.20, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 57.60 57.24 57.64 57.89 59.35 59.35 59.23 56.91 56.49 56.85 57.15 58.34 57.90 58.46
VP8 libvpx Encoding vpxenc AMD Opteron Intel Core i7 OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 3 6 9 12 15 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 13.04 12.94 12.98 13.19 13.51 13.26 13.19 9.60 9.83 9.38 9.50 10.02 9.90 9.87 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping AMD Opteron Intel Core i7 OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 14 28 42 56 70 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 2.37, N = 6 SE +/- 0.08, N = 3 14.16 52.08 58.67 62.50 54.90 56.03 16.16 53.31 44.78 46.57 51.30 59.30 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
John The Ripper Test: Blowfish AMD Opteron Intel Core i7 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 700 1400 2100 2800 3500 SE +/- 1.86, N = 3 SE +/- 2.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 3.67, N = 3 SE +/- 2.60, N = 3 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 SE +/- 1.67, N = 3 SE +/- 2.60, N = 3 3031 3077 3156 2995 2981 2150 2180 2216 2211 2213 1. (CC) gcc options: -fopenmp -lcrypt
SciMark Computational Test: Jacobi Successive Over-Relaxation AMD Opteron Intel Core i7 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 200 400 600 800 1000 SE +/- 0.59, N = 4 SE +/- 0.00, N = 4 SE +/- 0.67, N = 4 SE +/- 0.96, N = 4 SE +/- 0.68, N = 4 SE +/- 0.57, N = 4 SE +/- 0.59, N = 4 SE +/- 0.83, N = 4 SE +/- 1.27, N = 4 SE +/- 1.28, N = 4 SE +/- 2.17, N = 4 SE +/- 1.52, N = 4 SE +/- 1.50, N = 4 SE +/- 1.93, N = 4 600.56 605.90 595.32 598.81 599.98 592.44 600.56 788.14 786.63 789.67 786.14 788.66 784.62 788.66
SciMark Computational Test: Dense LU Matrix Factorization AMD Opteron Intel Core i7 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 300 600 900 1200 1500 SE +/- 1.60, N = 4 SE +/- 1.87, N = 4 SE +/- 3.20, N = 4 SE +/- 4.52, N = 4 SE +/- 1.05, N = 4 SE +/- 1.11, N = 4 SE +/- 1.83, N = 4 SE +/- 1.39, N = 4 SE +/- 2.27, N = 4 SE +/- 2.66, N = 4 SE +/- 2.29, N = 4 SE +/- 2.78, N = 4 SE +/- 2.27, N = 4 SE +/- 1.41, N = 4 946.48 996.89 1021.74 751.78 1055.07 1012.02 1092.91 1219.06 1217.67 1219.07 1223.26 1220.46 1217.67 1227.47
SciMark Computational Test: Sparse Matrix Multiply AMD Opteron Intel Core i7 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 200 400 600 800 1000 SE +/- 3.30, N = 4 SE +/- 1.84, N = 4 SE +/- 2.30, N = 4 SE +/- 2.82, N = 4 SE +/- 3.11, N = 4 SE +/- 2.32, N = 4 SE +/- 2.63, N = 4 SE +/- 2.29, N = 4 SE +/- 1.27, N = 4 SE +/- 2.99, N = 4 SE +/- 0.79, N = 4 SE +/- 1.30, N = 4 SE +/- 3.06, N = 4 SE +/- 0.81, N = 4 714.26 709.28 702.60 664.43 681.01 678.73 676.50 1118.38 1128.38 1140.18 1135.42 1144.14 1153.01 1156.25
SciMark Computational Test: Fast Fourier Transform AMD Opteron Intel Core i7 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 40 80 120 160 200 SE +/- 1.72, N = 4 SE +/- 1.65, N = 4 SE +/- 0.42, N = 4 SE +/- 0.13, N = 4 SE +/- 0.16, N = 3 SE +/- 0.11, N = 4 SE +/- 0.24, N = 4 SE +/- 0.49, N = 4 SE +/- 0.92, N = 4 SE +/- 0.36, N = 4 SE +/- 0.25, N = 4 SE +/- 0.71, N = 4 SE +/- 3.26, N = 4 SE +/- 1.13, N = 4 67.94 62.54 69.07 66.69 61.59 62.14 66.33 167.62 163.31 157.15 162.05 151.85 162.09 155.32
SciMark Computational Test: Monte Carlo AMD Opteron Intel Core i7 OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 80 160 240 320 400 SE +/- 0.19, N = 4 SE +/- 0.16, N = 4 SE +/- 0.16, N = 4 SE +/- 0.19, N = 4 SE +/- 2.51, N = 4 SE +/- 0.27, N = 4 SE +/- 0.30, N = 4 SE +/- 4.81, N = 7 SE +/- 3.20, N = 4 SE +/- 0.78, N = 4 SE +/- 0.46, N = 4 SE +/- 0.64, N = 4 SE +/- 1.60, N = 4 SE +/- 1.71, N = 4 284.25 183.70 188.22 284.25 304.67 225.11 252.94 371.94 194.68 205.71 348.62 344.15 264.50 265.49
FFTE Test: N=64, 1D Complex FFT Routine AMD Opteron Intel Core i7 OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 800 1600 2400 3200 4000 SE +/- 0.90, N = 3 SE +/- 1.60, N = 3 SE +/- 3.46, N = 3 SE +/- 14.17, N = 3 SE +/- 3.97, N = 3 SE +/- 11.26, N = 3 SE +/- 7.58, N = 3 SE +/- 17.30, N = 3 SE +/- 5.14, N = 3 SE +/- 7.24, N = 3 3187.78 3003.51 3334.60 3501.68 3520.29 2954.08 2648.87 3236.52 3756.73 3757.28 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein AMD Opteron Intel Core i7 OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.2.4 GCC 4.3.6 GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 SE +/- 0.16, N = 3 SE +/- 0.52, N = 3 SE +/- 0.60, N = 3 SE +/- 0.13, N = 3 SE +/- 0.46, N = 3 SE +/- 0.14, N = 3 SE +/- 0.23, N = 3 SE +/- 0.22, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 83.87 84.71 83.71 84.02 86.15 84.29 86.01 72.32 74.68 71.59 71.60 73.58 71.82 71.61 1. (CXX) g++ options: -lfftw -lmpich
NAS Parallel Benchmarks Test / Class: UA.A AMD Opteron Intel Core i7 OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 18.83 18.61 19.13 19.04 18.65 20.74 20.64 20.44 21.02 21.08 1. (F9X) gfortran options: -fopenmp
NAS Parallel Benchmarks Test / Class: LU.A AMD Opteron Intel Core i7 OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.4.7 GCC 4.5.4 GCC 4.6.3 GCC 4.7.1 GCC 4.8 20120701 1200 2400 3600 4800 6000 SE +/- 4.19, N = 3 SE +/- 2.74, N = 3 SE +/- 115.00, N = 6 SE +/- 100.10, N = 6 SE +/- 90.79, N = 6 SE +/- 3.88, N = 3 SE +/- 4.18, N = 3 SE +/- 12.16, N = 3 SE +/- 16.61, N = 3 SE +/- 8.86, N = 3 4545.22 4893.90 5008.86 5287.87 5299.79 4268.49 4966.54 5312.37 5536.77 5522.13 1. (F9X) gfortran options: -fopenmp
Phoronix Test Suite v10.8.5