Skylake Xeon GCC Compiler Optimization Tests Intel Xeon E3-1280 v5 testing with a MSI C236A WORKSTATION. GCC compiler optimization CFLAGS/CXXFLAGS benchmarks by Michael Larabel. -O0: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -O1: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -O2: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -O2 -march=native: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -O3: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -O3 -march=native: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 -Ofast -march=native: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 120GB Samsung SSD 850, Graphics: MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB, Audio: Realtek ALC1150, Monitor: DELL P2415Q, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-7-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.17.3, Display Driver: radeon 7.6.1, OpenGL: 4.1 Mesa 11.1.2 Gallium 0.4, Compiler: GCC 5.3.1 20160222, File-System: ext4, Screen Resolution: 3840x2160 Timed HMMer Search 2.3.2 Pfam Database Search Seconds < Lower Is Better -O0 .................. 26.89 |================================================= -O1 .................. 8.86 |================ -O2 .................. 8.35 |=============== -O2 -march=native .... 8.35 |=============== -O3 .................. 8.17 |=============== -O3 -march=native .... 8.16 |=============== -Ofast -march=native . 8.00 |=============== TTSIOD 3D Renderer 2.3a Phong Rendering With Soft-Shadow Mapping FPS > Higher Is Better -O0 .................. 48.14 |========== -O1 .................. 187.34 |====================================== -O2 .................. 187.08 |====================================== -O2 -march=native .... 192.90 |======================================= -O3 .................. 229.78 |=============================================== -O3 -march=native .... 235.67 |================================================ -Ofast -march=native . 236.52 |================================================ GraphicsMagick 1.3.19 Operation: Blur Iterations Per Minute > Higher Is Better -O0 .................. 107 |=============================== -O1 .................. 170 |================================================= -O2 .................. 169 |================================================ -O2 -march=native .... 173 |================================================== -O3 .................. 170 |================================================= -O3 -march=native .... 178 |=================================================== -Ofast -march=native . 177 |=================================================== GraphicsMagick 1.3.19 Operation: Sharpen Iterations Per Minute > Higher Is Better -O0 .................. 59 |===================== -O1 .................. 131 |============================================== -O2 .................. 132 |============================================== -O2 -march=native .... 140 |================================================= -O3 .................. 132 |============================================== -O3 -march=native .... 142 |================================================== -Ofast -march=native . 145 |=================================================== GraphicsMagick 1.3.19 Operation: Resizing Iterations Per Minute > Higher Is Better -O0 .................. 109 |========================== -O1 .................. 198 |================================================ -O2 .................. 191 |============================================== -O2 -march=native .... 204 |================================================= -O3 .................. 201 |================================================= -O3 -march=native .... 210 |=================================================== -Ofast -march=native . 211 |=================================================== GraphicsMagick 1.3.19 Operation: HWB Color Space Iterations Per Minute > Higher Is Better -O0 .................. 136 |============================= -O1 .................. 226 |================================================= -O2 .................. 213 |============================================== -O2 -march=native .... 228 |================================================= -O3 .................. 228 |================================================= -O3 -march=native .... 234 |================================================== -Ofast -march=native . 237 |=================================================== GraphicsMagick 1.3.19 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better -O0 .................. 22 |=========== -O1 .................. 92 |============================================= -O2 .................. 100 |================================================= -O2 -march=native .... 103 |=================================================== -O3 .................. 103 |=================================================== -O3 -march=native .... 104 |=================================================== -Ofast -march=native . 104 |=================================================== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better -O0 .................. 496.06 |======== -O1 .................. 1302.83 |====================== -O2 .................. 2429.37 |========================================= -O2 -march=native .... 2653.07 |============================================= -O3 .................. 2485.14 |========================================== -O3 -march=native .... 2689.91 |============================================== -Ofast -march=native . 2757.77 |=============================================== Timed Apache Compilation 2.4.7 Time To Compile Seconds < Lower Is Better -O0 .................. 11.08 |====================== -O1 .................. 17.00 |================================= -O2 .................. 23.12 |============================================= -O2 -march=native .... 22.84 |============================================ -O3 .................. 24.88 |================================================ -O3 -march=native .... 25.17 |================================================= -Ofast -march=native . 25.16 |================================================= Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better -O0 .................. 12.23 |============ -O1 .................. 24.96 |======================== -O2 .................. 39.67 |======================================= -O2 -march=native .... 37.57 |===================================== -O3 .................. 49.70 |================================================= -O3 -march=native .... 48.97 |================================================ -Ofast -march=native . 49.99 |================================================= Timed PHP Compilation 5.2.9 Time To Compile Seconds < Lower Is Better -O0 .................. 7.27 |================ -O1 .................. 13.35 |============================== -O2 .................. 19.12 |=========================================== -O2 -march=native .... 18.68 |========================================== -O3 .................. 21.21 |================================================ -O3 -march=native .... 21.68 |================================================= -Ofast -march=native . 21.75 |================================================= C-Ray 1.1 Total Time Seconds < Lower Is Better -O0 .................. 61.53 |================================================= -O1 .................. 43.15 |================================== -O2 .................. 38.19 |============================== -O2 -march=native .... 27.83 |====================== -O3 .................. 19.65 |================ -O3 -march=native .... 14.60 |============ -Ofast -march=native . 13.92 |=========== FLAC Audio Encoding 1.3.1 WAV To FLAC Seconds < Lower Is Better -O0 .................. 37.56 |================================================= -O1 .................. 5.97 |======== -O2 .................. 5.22 |======= -O2 -march=native .... 4.87 |====== -O3 .................. 5.10 |======= -O3 -march=native .... 4.88 |====== -Ofast -march=native . 4.89 |====== LAME MP3 Encoding 3.99.3 WAV To MP3 Seconds < Lower Is Better -O0 .................. 30.50 |================================================= -O1 .................. 13.25 |===================== -O2 .................. 12.90 |===================== -O2 -march=native .... 11.99 |=================== -O3 .................. 12.10 |=================== -O3 -march=native .... 10.69 |================= -Ofast -march=native . 9.51 |=============== Redis 3.0.1 Test: GET Requests Per Second > Higher Is Better -O0 .................. 1663526.96 |======================= -O1 .................. 2953039.46 |========================================== -O2 .................. 3007445.96 |========================================== -O2 -march=native .... 2951037.21 |========================================== -O3 .................. 2910194.79 |========================================= -O3 -march=native .... 3116395.33 |============================================ -Ofast -march=native . 3023321.58 |=========================================== Redis 3.0.1 Test: SET Requests Per Second > Higher Is Better -O0 .................. 1189110.86 |======================= -O1 .................. 2175575.75 |=========================================== -O2 .................. 2136957.54 |========================================== -O2 -march=native .... 2076614.15 |========================================= -O3 .................. 2183433.83 |=========================================== -O3 -march=native .... 2104521.04 |========================================= -Ofast -march=native . 2243907.58 |============================================ Hierarchical INTegration 1.0 Test: FLOAT QUIPs > Higher Is Better -O0 .................. 127505415.48 |============= -O1 .................. 295705864.24 |============================== -O2 .................. 370257620.55 |====================================== -O2 -march=native .... 408904948.74 |========================================== -O3 .................. 381719210.59 |======================================= -O3 -march=native .... 407575756.08 |========================================== -Ofast -march=native . 392373251.20 |========================================