GCC 4.x Benchmarking - Intel, AMD 64-bit Benchmarking of GCC 4.2 through GCC 4.8 when building the compiler the same and setting CFLAGS/CXXFLAGS of -O3 and -march=native prior to test installation and execution. Benchmarking for a future article on Phoronix.com by Michael Larabel. Testing on an Intel Core i7 and AMD Opteron 2384 when using the 64-bit (x86_64 target) build of Ubuntu Linux.
HTML result view exported from: https://openbenchmarking.org/result/1207077-SU-GCCPERFOR59&rdt&grr .
Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Intel Core i7 AMD Opteron GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 Intel Core i7 720Q @ 1.60GHz (8 Cores) LENOVO 4318CTO Intel Core DMI 4096MB 160GB INTEL SSDSA2M160 NVIDIA Quadro FX 880M Conexant CX20585 Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300 Ubuntu 12.10 3.5.0-2-generic (x86_64) Unity 5.12.0 X Server 1.11.3 NVIDIA 302.17 3.3.0 GCC 4.8.0 20120701 ext4 1600x900 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 AMD Opteron 2384 @ 2.70GHz (4 Cores) TYAN S2927/S2927-E NVIDIA MCP55 64GB AGILITY-EX AMD Radeon HD 4870 512MB ATI R6xx HDMI AL2223W radeon 6.14.4 2.1 Mesa 8.0.3 Gallium 0.4 GCC 4.8.0 20120701 1680x1050 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 OpenBenchmarking.org Compiler Details - --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
tachyon: Total Time povray: Total Time encode-mp3: WAV To MP3 encode-flac: WAV To FLAC crafty: Elapsed Time c-ray: Total Time graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen x264: H.264 Video Encoding vpxenc: vpxenc ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo ffte: N=64, 1D Complex FFT Routine lammps: Rhodopsin Protein npb: UA.A npb: LU.A Intel Core i7 AMD Opteron GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 37.16 949 22.41 9.33 108.42 77.13 41 88 41 58.46 9.87 59.30 2213 788.66 1227.47 1156.25 155.32 265.49 3757.28 71.61 21.08 5522.13 37.28 947 23.45 9.56 108.56 77.71 41 90 41 57.90 9.90 51.30 2211 784.62 1217.67 1153.01 162.09 264.50 3756.73 71.82 21.02 5536.77 36.09 975 23.83 10.07 110.53 106.69 40 89 41 58.34 10.02 46.57 2216 788.66 1220.46 1144.14 151.85 344.15 3236.52 73.58 20.44 5312.37 36.44 979 22.98 10.58 108.61 105.73 37 76 38 57.15 9.50 44.78 2180 786.14 1223.26 1135.42 162.05 348.62 2648.87 71.60 20.64 4966.54 38.41 984 23.04 10.62 109.12 116.43 37 78 39 56.85 9.38 53.31 2150 789.67 1219.07 1140.18 157.15 205.71 2954.08 71.59 20.74 4268.49 38.99 1315 23.38 11.15 116.15 118.43 13 42 14 56.49 9.83 16.16 786.63 1217.67 1128.38 163.31 194.68 74.68 37.95 1106 23.34 11.43 109.85 138.48 13 40 15 56.91 9.60 788.14 1219.06 1118.38 167.62 371.94 72.32 36.40 1219 29.04 9.37 122.93 72.01 42 93 58 59.23 13.19 56.03 2981 600.56 1092.91 676.50 66.33 252.94 3520.29 86.01 18.65 5299.79 36.67 1226 28.75 9.56 122.96 72.01 42 95 58 59.35 13.26 54.90 2995 592.44 1012.02 678.73 62.14 225.11 3501.68 84.29 19.04 5287.87 37.10 1301 27.48 9.79 124.79 103.47 41 95 57 59.35 13.51 62.50 3156 599.98 1055.07 681.01 61.59 304.67 3334.60 86.15 19.13 5008.86 35.58 1274 26.80 9.71 124.65 107.75 41 95 61 57.89 13.19 58.67 3077 598.81 751.78 664.43 66.69 284.25 3003.51 84.02 18.61 4893.90 41.11 1268 27.22 9.69 129.38 115.81 41 97 61 57.64 12.98 52.08 3031 595.32 1021.74 702.60 69.07 188.22 3187.78 83.71 18.83 4545.22 39.53 1579 27.14 9.94 129.18 119.33 15 52 24 57.24 12.94 14.16 605.90 996.89 709.28 62.54 183.70 84.71 39.24 1346 30.16 10.42 124.49 157.85 12 42 14 57.60 13.04 600.56 946.48 714.26 67.94 284.25 83.87 OpenBenchmarking.org
Tachyon Total Time Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 37.16 37.28 36.09 36.44 38.41 38.99 37.95 36.40 36.67 37.10 35.58 41.11 39.53 39.24 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
POV-Ray Total Time Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.6.1 Total Time GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 300 600 900 1200 1500 949 947 975 979 984 1315 1106 1219 1226 1301 1274 1268 1579 1346 1. (CXX) g++ options: -pipe -O3 -msse -mfpmath=sse -msse2 -march=k8 -mtune=k8 -march=native -lz -lSM -lICE -lX11 -lm
LAME MP3 Encoding WAV To MP3 Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 7 14 21 28 35 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.03, N = 5 SE +/- 0.03, N = 5 22.41 23.45 23.83 22.98 23.04 23.38 23.34 29.04 28.75 27.48 26.80 27.22 27.14 30.16
FLAC Audio Encoding WAV To FLAC Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 9.33 9.56 10.07 10.58 10.62 11.15 11.43 9.37 9.56 9.79 9.71 9.69 9.94 10.42 1. (CXX) g++ options: -O3 -march=native -logg -lm
Crafty Elapsed Time Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.28, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.23, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.40, N = 3 SE +/- 0.05, N = 3 SE +/- 0.43, N = 3 SE +/- 0.22, N = 3 SE +/- 0.54, N = 3 SE +/- 0.11, N = 3 108.42 108.56 110.53 108.61 109.12 116.15 109.85 122.93 122.96 124.79 124.65 129.38 129.18 124.49 1. (CC) gcc options: -lstdc++ -lm
C-Ray Total Time Intel Core i7 AMD Opteron OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 30 60 90 120 150 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 77.13 77.71 106.69 105.73 116.43 118.43 138.48 72.01 72.01 103.47 107.75 115.81 119.33 157.85 1. (CC) gcc options: -lm -lpthread -O3 -march=native
GraphicsMagick Operation: Local Adaptive Thresholding Intel Core i7 AMD Opteron OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Local Adaptive Thresholding GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 10 20 30 40 50 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 41 41 40 37 37 13 13 42 42 41 41 41 15 12 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing Intel Core i7 AMD Opteron OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Resizing GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 88 90 89 76 78 42 40 93 95 95 95 97 52 42 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen Intel Core i7 AMD Opteron OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.12 Operation: Sharpen GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 14 28 42 56 70 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 41 41 41 38 39 14 15 58 58 57 61 61 24 14 1. (CC) gcc options: -std=gnu99 -O3 -march=native -pthread -ltiff -lXext -lSM -lICE -lX11 -lz -lm -lpthread
x264 H.264 Video Encoding Intel Core i7 AMD Opteron OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 13 26 39 52 65 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.20, N = 3 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.27, N = 3 SE +/- 0.31, N = 3 SE +/- 0.20, N = 3 SE +/- 0.28, N = 3 SE +/- 0.31, N = 3 SE +/- 0.26, N = 3 SE +/- 0.24, N = 3 SE +/- 0.21, N = 3 58.46 57.90 58.34 57.15 56.85 56.49 56.91 59.23 59.35 59.35 57.89 57.64 57.24 57.60
VP8 libvpx Encoding vpxenc Intel Core i7 AMD Opteron OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 9.87 9.90 10.02 9.50 9.38 9.83 9.60 13.19 13.26 13.51 13.19 12.98 12.94 13.04 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping Intel Core i7 AMD Opteron OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2w Phong Rendering With Soft-Shadow Mapping GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 2.37, N = 6 SE +/- 0.10, N = 3 SE +/- 0.06, N = 3 SE +/- 0.16, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 59.30 51.30 46.57 44.78 53.31 16.16 56.03 54.90 62.50 58.67 52.08 14.16 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -lSDL -lstdc++
John The Ripper Test: Blowfish Intel Core i7 AMD Opteron OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9 Test: Blowfish GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 700 1400 2100 2800 3500 SE +/- 2.60, N = 3 SE +/- 1.67, N = 3 SE +/- 1.33, N = 3 SE +/- 0.33, N = 3 SE +/- 2.60, N = 3 SE +/- 3.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 2.00, N = 3 SE +/- 1.86, N = 3 2213 2211 2216 2180 2150 2981 2995 3156 3077 3031 1. (CC) gcc options: -fopenmp -lcrypt
SciMark Computational Test: Jacobi Successive Over-Relaxation Intel Core i7 AMD Opteron OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 200 400 600 800 1000 SE +/- 1.93, N = 4 SE +/- 1.50, N = 4 SE +/- 1.52, N = 4 SE +/- 2.17, N = 4 SE +/- 1.28, N = 4 SE +/- 1.27, N = 4 SE +/- 0.83, N = 4 SE +/- 0.59, N = 4 SE +/- 0.57, N = 4 SE +/- 0.68, N = 4 SE +/- 0.96, N = 4 SE +/- 0.67, N = 4 SE +/- 0.00, N = 4 SE +/- 0.59, N = 4 788.66 784.62 788.66 786.14 789.67 786.63 788.14 600.56 592.44 599.98 598.81 595.32 605.90 600.56
SciMark Computational Test: Dense LU Matrix Factorization Intel Core i7 AMD Opteron OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 300 600 900 1200 1500 SE +/- 1.41, N = 4 SE +/- 2.27, N = 4 SE +/- 2.78, N = 4 SE +/- 2.29, N = 4 SE +/- 2.66, N = 4 SE +/- 2.27, N = 4 SE +/- 1.39, N = 4 SE +/- 1.83, N = 4 SE +/- 1.11, N = 4 SE +/- 1.05, N = 4 SE +/- 4.52, N = 4 SE +/- 3.20, N = 4 SE +/- 1.87, N = 4 SE +/- 1.60, N = 4 1227.47 1217.67 1220.46 1223.26 1219.07 1217.67 1219.06 1092.91 1012.02 1055.07 751.78 1021.74 996.89 946.48
SciMark Computational Test: Sparse Matrix Multiply Intel Core i7 AMD Opteron OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 200 400 600 800 1000 SE +/- 0.81, N = 4 SE +/- 3.06, N = 4 SE +/- 1.30, N = 4 SE +/- 0.79, N = 4 SE +/- 2.99, N = 4 SE +/- 1.27, N = 4 SE +/- 2.29, N = 4 SE +/- 2.63, N = 4 SE +/- 2.32, N = 4 SE +/- 3.11, N = 4 SE +/- 2.82, N = 4 SE +/- 2.30, N = 4 SE +/- 1.84, N = 4 SE +/- 3.30, N = 4 1156.25 1153.01 1144.14 1135.42 1140.18 1128.38 1118.38 676.50 678.73 681.01 664.43 702.60 709.28 714.26
SciMark Computational Test: Fast Fourier Transform Intel Core i7 AMD Opteron OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 40 80 120 160 200 SE +/- 1.13, N = 4 SE +/- 3.26, N = 4 SE +/- 0.71, N = 4 SE +/- 0.25, N = 4 SE +/- 0.36, N = 4 SE +/- 0.92, N = 4 SE +/- 0.49, N = 4 SE +/- 0.24, N = 4 SE +/- 0.11, N = 4 SE +/- 0.16, N = 3 SE +/- 0.13, N = 4 SE +/- 0.42, N = 4 SE +/- 1.65, N = 4 SE +/- 1.72, N = 4 155.32 162.09 151.85 162.05 157.15 163.31 167.62 66.33 62.14 61.59 66.69 69.07 62.54 67.94
SciMark Computational Test: Monte Carlo Intel Core i7 AMD Opteron OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 80 160 240 320 400 SE +/- 1.71, N = 4 SE +/- 1.60, N = 4 SE +/- 0.64, N = 4 SE +/- 0.46, N = 4 SE +/- 0.78, N = 4 SE +/- 3.20, N = 4 SE +/- 4.81, N = 7 SE +/- 0.30, N = 4 SE +/- 0.27, N = 4 SE +/- 2.51, N = 4 SE +/- 0.19, N = 4 SE +/- 0.16, N = 4 SE +/- 0.16, N = 4 SE +/- 0.19, N = 4 265.49 264.50 344.15 348.62 205.71 194.68 371.94 252.94 225.11 304.67 284.25 188.22 183.70 284.25
FFTE Test: N=64, 1D Complex FFT Routine Intel Core i7 AMD Opteron OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 800 1600 2400 3200 4000 SE +/- 7.24, N = 3 SE +/- 5.14, N = 3 SE +/- 17.30, N = 3 SE +/- 7.58, N = 3 SE +/- 11.26, N = 3 SE +/- 3.97, N = 3 SE +/- 14.17, N = 3 SE +/- 3.46, N = 3 SE +/- 1.60, N = 3 SE +/- 0.90, N = 3 3757.28 3756.73 3236.52 2648.87 2954.08 3520.29 3501.68 3334.60 3003.51 3187.78 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein Intel Core i7 AMD Opteron OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 GCC 4.3.6 GCC 4.2.4 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 SE +/- 0.14, N = 3 SE +/- 0.46, N = 3 SE +/- 0.13, N = 3 SE +/- 0.60, N = 3 SE +/- 0.52, N = 3 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 71.61 71.82 73.58 71.60 71.59 74.68 72.32 86.01 84.29 86.15 84.02 83.71 84.71 83.87 1. (CXX) g++ options: -lfftw -lmpich
NAS Parallel Benchmarks Test / Class: UA.A Intel Core i7 AMD Opteron OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: UA.A GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 21.08 21.02 20.44 20.64 20.74 18.65 19.04 19.13 18.61 18.83 1. (F9X) gfortran options: -fopenmp
NAS Parallel Benchmarks Test / Class: LU.A Intel Core i7 AMD Opteron OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3 Test / Class: LU.A GCC 4.8 20120701 GCC 4.7.1 GCC 4.6.3 GCC 4.5.4 GCC 4.4.7 1200 2400 3600 4800 6000 SE +/- 8.86, N = 3 SE +/- 16.61, N = 3 SE +/- 12.16, N = 3 SE +/- 4.18, N = 3 SE +/- 3.88, N = 3 SE +/- 90.79, N = 6 SE +/- 100.10, N = 6 SE +/- 115.00, N = 6 SE +/- 2.74, N = 3 SE +/- 4.19, N = 3 5522.13 5536.77 5312.37 4966.54 4268.49 5299.79 5287.87 5008.86 4893.90 4545.22 1. (F9X) gfortran options: -fopenmp
Phoronix Test Suite v10.8.5