Intel Haswell GCC 4.8 core-avx2 Tuning

Testing Intel Core i7 4770K with different CFLAGS/CXXFLAGS to look at the core-avx2 Haswell GCC 4.8.1 compiler optimizations. Benchmarks by Michael Larabel of Phoronix for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1309148-SO-1309136DA35
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 2 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 8 Tests
CPU Massive 9 Tests
Creator Workloads 6 Tests
Encoding 2 Tests
HPC - High Performance Computing 2 Tests
Multi-Core 8 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 3 Tests
Scientific Computing 2 Tests
Server CPU Tests 4 Tests
Single-Threaded 2 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
nocona
June 14 2013
 
core2
June 14 2013
 
corei7
June 14 2013
 
corei7-avx
June 14 2013
 
core-avx-i
June 14 2013
 
core-avx2
June 14 2013
 
test
September 13 2013
 
i7-3770K core-avx-i
September 13 2013
 
Q9300@3.33GHz
September 14 2013
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Haswell GCC 4.8 core-avx2 Tuning Testing Intel Core i7 4770K with different CFLAGS/CXXFLAGS to look at the core-avx2 Haswell GCC 4.8.1 compiler optimizations. Benchmarks by Michael Larabel of Phoronix for a future article. ,,"core-avx2","core2","corei7","corei7-avx","core-avx-i","nocona","test","i7-3770K core-avx-i","Q9300@3.33GHz" Processor,,Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-4770K @ 3.50GHz (8 Cores),Intel Core i7-3770K @ 3.90GHz (8 Cores),Intel Core i7-3770K @ 3.90GHz (8 Cores),Intel Core 2 Quad Q9300 @ 3.33GHz (4 Cores) Motherboard,,Intel DH87RL,Intel DH87RL,Intel DH87RL,Intel DH87RL,Intel DH87RL,Intel DH87RL,ASRock Z77 Pro4-M,ASRock Z77 Pro4-M,ASUS P5K3 Deluxe Chipset,,Intel Haswell DRAM,Intel Haswell DRAM,Intel Haswell DRAM,Intel Haswell DRAM,Intel Haswell DRAM,Intel Haswell DRAM,,,Intel 82G33/G31/P35/P31 + ICH9R Memory,,15360MB,15360MB,15360MB,15360MB,15360MB,15360MB,16384MB,16384MB,8192MB Disk,,240GB OCZ VERTEX3,240GB OCZ VERTEX3,240GB OCZ VERTEX3,240GB OCZ VERTEX3,240GB OCZ VERTEX3,240GB OCZ VERTEX3,256GB OCZ VECTOR + 2 x 1000GB SAMSUNG HD103UJ + 80GB INTEL SSDSA2M080,256GB OCZ VECTOR + 2 x 1000GB SAMSUNG HD103UJ + 80GB INTEL SSDSA2M080,1000GB Seagate ST31000340AS Graphics,,Intel Haswell IGP,Intel Haswell IGP,Intel Haswell IGP,Intel Haswell IGP,Intel Haswell IGP,Intel Haswell IGP,Gallium 0.4 on AMD TAHITI 3072MB (810/1250MHz),Gallium 0.4 on AMD TAHITI 3072MB (810/1250MHz),LLVMpipe Audio,,Intel Haswell HDMI,Intel Haswell HDMI,Intel Haswell HDMI,Intel Haswell HDMI,Intel Haswell HDMI,Intel Haswell HDMI,,,Analog Devices AD1988B Monitor,,VA2431,VA2431,VA2431,VA2431,VA2431,VA2431,LCD3090WQXi,LCD3090WQXi,SyncMaster Network,,Intel Connection I217-V,Intel Connection I217-V,Intel Connection I217-V,Intel Connection I217-V,Intel Connection I217-V,Intel Connection I217-V,,,Marvell 88E8056 PCI-E Gigabit OS,,Ubuntu 13.04,Ubuntu 13.04,Ubuntu 13.04,Ubuntu 13.04,Ubuntu 13.04,Ubuntu 13.04,Gentoo Base 2.2,Gentoo Base 2.2,Slackware 14.0 Kernel,,3.10.0-999-generic (x86_64),3.10.0-999-generic (x86_64),3.10.0-999-generic (x86_64),3.10.0-999-generic (x86_64),3.10.0-999-generic (x86_64),3.10.0-999-generic (x86_64),3.11.0-drmfixes20130912-core-avx-i (x86_64),3.11.0-drmfixes20130912-core-avx-i (x86_64),3.2.45 (x86_64) Desktop,,Unity 7.0.0,Unity 7.0.0,Unity 7.0.0,Unity 7.0.0,Unity 7.0.0,Unity 7.0.0,KDE,KDE, Display Server,,X Server 1.13.3,X Server 1.13.3,X Server 1.13.3,X Server 1.13.3,X Server 1.13.3,X Server 1.13.3,X Server 1.14.2.902 (1.14.3 RC 2),X Server 1.14.2.902 (1.14.3 RC 2),X Server 1.12.4 Display Driver,,intel 2.21.9,intel 2.21.9,intel 2.21.9,intel 2.21.9,intel 2.21.9,intel 2.21.9,radeon 7.2.99,radeon 7.2.99,nouveau 0.0.16 OpenGL,,3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.2.0-devel (git-a2e3b1c),3.0 Mesa 9.3.0-devel (git-f4e35f8) Gallium 0.4,3.0 Mesa 9.3.0-devel (git-f4e35f8) Gallium 0.4,2.1 Mesa 8.0.4 Gallium 0.4 Compiler,,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + LLVM 3.2,GCC 4.8.1 + Clang 3.4 + LLVM 3.4svn,GCC 4.8.1 + Clang 3.4 + LLVM 3.4svn,GCC 4.7.1 + Clang 3.0 + LLVM 3.0 File-System,,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4 Screen Resolution,,1920x1080,1920x1080,1920x1080,1920x1080,1920x1080,1920x1080,2560x1600,2560x1600,1680x1050 ,,"core-avx2","core2","corei7","corei7-avx","core-avx-i","nocona","test","i7-3770K core-avx-i","Q9300@3.33GHz" "Smallpt - Global Illumination Renderer; 100 Samples (sec)",LIB,24,26,26,26,26,26,87,25,172 "SciMark - Computational Test: Fast Fourier Transform (Mflops)",HIB,226.57,250.93,249.11,251.86,247.35,245.07,339.88,346.41,93.72 "SciMark - Computational Test: Dense LU Matrix Factorization (Mflops)",HIB,1817.03,1859.97,1863.19,1851.10,1824.28,1825.73,2386.29,2378.31,865.11 "C-Ray - Total Time (sec)",LIB,17.02,22.95,22.95,22.84,22.83,23.07,27.78,28.18,44.15 "Apache Benchmark - Static Web Page Serving (Reqs/sec)",HIB,25644.10,25606.17,25490.14,25580.44,25549.84,24888.11,23897.32,23771.72,12787.34 "TTSIOD 3D Renderer - Phong Rendering With Soft-Shadow Mapping (FPS)",HIB,119.78,121.58,123.14,117.71,116.54,122.02,148.75,148.59,0.93 "Timed HMMer Search - Pfam Database Search (sec)",LIB,10.55,10.14,10.22,10.62,10.45,10.16,10.13,9.87,18.74 "SciMark - Computational Test: Monte Carlo (Mflops)",HIB,596.16,616.21,616.65,616.65,615.76,615.33,553.48,553.48,325.95 "x264 - H.264 Video Encoding (FPS)",HIB,155.18,156.74,156.06,155.63,156.08,156.80,158.19,157.85,84.66 "Timed ImageMagick Compilation - Time To Compile (sec)",LIB,80.66,79.03,79.64,80.91,81.06,76.98,59.51,64.08,106.30 "GraphicsMagick - Operation: Sharpen (Iterations/min)",HIB,136,84,84,96,96,83,83,95, "Timed Linux Kernel Compilation - Time To Compile (sec)",LIB,97.25,97.63,97.77,98.10,97.85,97.89,89.94,89.90,144.27 "FFmpeg - H.264 HD To NTSC DV (sec)",LIB,13.01,13.16,12.93,12.86,13.00,12.94,11.86,11.89,18.06 "Botan - Test: AES-256 (Mbytes/s)",HIB,158.43,158.35,157.96,158.19,158.31,157.97,,,227.29 "Himeno Benchmark - Poisson Pressure Solver (MFLOPS)",HIB,1282.30,1564.22,1560.18,1404.92,1630.12,1517.03,1686.65,1677.67,1190.98 "Botan - Test: Tiger (Mbytes/s)",HIB,424.56,438.87,427.31,442.47,440.37,438.78,,,331.70 "Botan - Test: CAST-256 (Mbytes/s)",HIB,95.76,95.80,95.54,95.77,95.79,95.48,,,75.73 "GraphicsMagick - Operation: Blur (Iterations/min)",HIB,138,117,116,122,122,115,132,138, "GraphicsMagick - Operation: Resizing (Iterations/min)",HIB,182,160,160,166,167,157,161,167, "GraphicsMagick - Operation: Local Adaptive Thresholding (Iterations/min)",HIB,121,120,120,119,120,118,123,116,