Skylake Xeon GCC Compiler Optimization Tests

Intel Xeon E3-1280 v5 testing with a MSI C236A WORKSTATION. GCC compiler optimization CFLAGS/CXXFLAGS benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1602298-BE-1602289GA06&sor.

Skylake Xeon GCC Compiler Optimization TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/nativeIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB120GB Samsung SSD 850MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MBRealtek ALC1150DELL P2415QIntel ConnectionUbuntu 16.044.4.0-7-generic (x86_64)Unity 7.4.0X Server 1.17.3radeon 7.6.14.1 Mesa 11.1.2 Gallium 0.4GCC 5.3.1 20160222ext43840x2160Intel Core i7-3930K @ 5.70GHz (12 Cores)ASUS P9X79 PROIntel Xeon E5/Core32768MB240GB Patriot Pyro SE + 128GB Patriot Torqx 12NVIDIA GeForce GTX 680 2048MB (810/3004MHz)Realtek ALC898Intel 82579V Gigabit ConnectionFedora 234.3.5-300.fc23.x86_64 (x86_64)Xfce 4.12NVIDIA 361.184.4.0Clang 3.7.0 + LLVM 3.7.0 + CUDA 5.07680x2160OpenBenchmarking.orgCompiler Details- -O0, -O1, -O2, -O2 -march=native, -O3, -O3 -march=native, -Ofast -march=native: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate powersaveSystem Details- i7, i7-fast/native: SELinux: Enabled.

Skylake Xeon GCC Compiler Optimization Testshmmer: Pfam Database Searchttsiod-renderer: Phong Rendering With Soft-Shadow Mappinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-apache: Time To Compilebuild-imagemagick: Time To Compilebuild-php: Time To Compilec-ray: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3redis: GETredis: SEThint: FLOAT-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native26.8948.141075910913622496.0611.0812.237.2761.5337.5630.501663526.961189110.86127505415.488.86187.34170131198226921302.8317.0024.9613.3543.155.9713.252953039.462175575.75295705864.248.35187.081691321912131002429.3723.1239.6719.1238.195.2212.903007445.962136957.54370257620.558.35192.901731402042281032653.0722.8437.5718.6827.834.8711.992951037.212076614.15408904948.748.17229.781701322012281032485.1424.8849.7021.2119.655.1012.102910194.792183433.83381719210.598.16235.671781422102341042689.9125.1748.9721.6814.604.8810.693116395.332104521.04407575756.088.00236.521771452112371042757.7725.1649.9921.7513.924.899.513023321.582243907.58392373251.207.23244.79163156203205971858.3523.8440.5420.0716.606.9611.461810651.911516309.15360004342.856.83249.01167160207224992088.0624.5648.2420.0814.856.5811.071924469.841364890.50343569657.42OpenBenchmarking.org

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Searchi7-fast/nativei7-Ofast -march=native-O3 -march=native-O3-O2-O2 -march=native-O1-O0612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.837.238.008.168.178.358.358.8626.89-Ofast -march=native-O2-Ofast -march=native-O3 -march=native-O3-O2-O2 -march=native-O1-O01. (CC) gcc options: -pthread -lhmmer -lsquid -lm

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mappingi7-fast/nativei7-Ofast -march=native-O3 -march=native-O3-O2 -march=native-O1-O2-O050100150200250SE +/- 0.67, N = 3SE +/- 2.06, N = 3SE +/- 0.78, N = 3SE +/- 0.37, N = 3SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.42, N = 3SE +/- 0.04, N = 3249.01244.79236.52235.67229.78192.90187.34187.0848.14-Ofast -march=native -lpthread-O3 -lpthread-Ofast -march=native-O3 -march=native-O3-O2 -march=native-O1-O2-O01. (CXX) g++ options: -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O3 -march=native-Ofast -march=native-O2 -march=native-O3-O1-O2i7-fast/nativei7-O04080120160200SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.76, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 3178177173170170169167163107-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -march=native -ljbig-O3 -ljbig-O1 -ljbig-O2 -ljbig-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml2-O2 -llcms2 -lfreetype -ljasper -lxml2-O0 -ljbig1. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpeni7-fast/nativei7-Ofast -march=native-O3 -march=native-O2 -march=native-O3-O2-O1-O04080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316015614514214013213213159-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml2-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -ljbig-O3 -march=native -ljbig-O2 -march=native -ljbig-O3 -ljbig-O2 -ljbig-O1 -ljbig-O0 -ljbig1. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizing-Ofast -march=native-O3 -march=nativei7-fast/native-O2 -march=nativei7-O3-O1-O2-O050100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 3SE +/- 0.00, N = 3211210207204203201198191109-Ofast -march=native -ljbig-O3 -march=native -ljbig-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml2-O2 -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-O3 -ljbig-O1 -ljbig-O2 -ljbig-O0 -ljbig1. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-Ofast -march=native-O3 -march=native-O3-O2 -march=native-O1i7-fast/native-O2i7-O050100150200250SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.73, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.00, N = 3237234228228226224213205136-Ofast -march=native -ljbig-O3 -march=native -ljbig-O3 -ljbig-O2 -march=native -ljbig-O1 -ljbig-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml2-O2 -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-O0 -ljbig1. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-Ofast -march=native-O3 -march=native-O3-O2 -march=native-O2i7-fast/nativei7-O1-O020406080100SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310410410310310099979222-Ofast -march=native -ljbig-O3 -march=native -ljbig-O3 -ljbig-O2 -march=native -ljbig-O2 -ljbig-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml2-O2 -llcms2 -lfreetype -ljasper -lxml2-O1 -ljbig-O0 -ljbig1. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-Ofast -march=native-O3 -march=native-O2 -march=native-O3-O2i7-fast/nativei7-O1-O06001200180024003000SE +/- 8.78, N = 3SE +/- 13.72, N = 3SE +/- 5.04, N = 3SE +/- 2.83, N = 3SE +/- 2.14, N = 3SE +/- 1.09, N = 3SE +/- 5.77, N = 3SE +/- 7.15, N = 3SE +/- 1.58, N = 32757.772689.912653.072485.142429.372088.061858.351302.83496.06-Ofast -march=native -mavx2-march=native -mavx2-O2 -march=native -mavx2-mavx2-O2 -mavx2-Ofast -march=native-O1 -mavx2-O0 -mavx21. (CC) gcc options: -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O0-O1-O2 -march=native-O2i7i7-fast/native-O3-Ofast -march=native-O3 -march=native612182430SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.0817.0022.8423.1223.8424.5624.8825.1625.17

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O0-O1-O2 -march=native-O2i7i7-fast/native-O3 -march=native-O3-Ofast -march=native1122334455SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 312.2324.9637.5739.6740.5448.2448.9749.7049.99

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compile-O0-O1-O2 -march=native-O2i7i7-fast/native-O3-O3 -march=native-Ofast -march=native510152025SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.30, N = 6SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 37.2713.3518.6819.1220.0720.0821.2121.6821.75-O0-O1-O2 -march=native-O2-O2-Ofast -march=native-O3-O3 -march=native-Ofast -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-Ofast -march=native-O3 -march=nativei7-fast/nativei7-O3-O2 -march=native-O2-O1-O01428425670SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 313.9214.6014.8516.6019.6527.8338.1943.1561.53-Ofast -march=native-march=native-Ofast -march=native-O2 -march=native-O2-O1-O01. (CC) gcc options: -lm -lpthread -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O2 -march=native-O3 -march=native-Ofast -march=native-O3-O2-O1i7-fast/nativei7-O0918273645SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.03, N = 5SE +/- 0.09, N = 54.874.884.895.105.225.976.586.9637.56-O2 -march=native-O3 -march=native-Ofast -march=native-O3-O2-O1-Ofast -march=native-O2-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-Ofast -march=native-O3 -march=nativei7-fast/nativei7-O2 -march=native-O3-O2-O1-O0714212835SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.03, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.00, N = 5SE +/- 0.07, N = 59.5110.6911.0711.4611.9912.1012.9013.2530.50-Ofast -march=native-O3 -march=native-Ofast -march=native-O3 -ffast-math -funroll-loops-O2 -march=native-O3-O2-O1-O01. (CC) gcc options: -pipe -lncurses -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GET-O3 -march=native-Ofast -march=native-O2-O1-O2 -march=native-O3i7-fast/nativei7-O0700K1400K2100K2800K3500KSE +/- 112169.48, N = 6SE +/- 119850.30, N = 6SE +/- 115634.67, N = 6SE +/- 149849.11, N = 6SE +/- 147992.17, N = 6SE +/- 114387.02, N = 6SE +/- 12423.96, N = 3SE +/- 33773.84, N = 3SE +/- 38735.61, N = 63116395.333023321.583007445.962953039.462951037.212910194.791924469.841810651.911663526.96-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SET-Ofast -march=native-O3-O1-O2-O3 -march=native-O2 -march=nativei7i7-fast/native-O0500K1000K1500K2000K2500KSE +/- 9379.40, N = 3SE +/- 5504.91, N = 3SE +/- 42146.85, N = 3SE +/- 42585.88, N = 3SE +/- 62171.83, N = 6SE +/- 59339.63, N = 6SE +/- 73282.81, N = 6SE +/- 20673.84, N = 3SE +/- 18888.09, N = 62243907.582183433.832175575.752136957.542104521.042076614.151516309.151364890.501189110.86-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O2 -march=native-O3 -march=native-Ofast -march=native-O3-O2i7i7-fast/native-O1-O090M180M270M360M450MSE +/- 839964.65, N = 3SE +/- 469746.65, N = 3SE +/- 623958.85, N = 3SE +/- 705518.12, N = 3SE +/- 807099.70, N = 3SE +/- 683702.70, N = 3SE +/- 124219.71, N = 3SE +/- 261713.92, N = 3SE +/- 333507.82, N = 3408904948.74407575756.08392373251.20381719210.59370257620.55360004342.85343569657.42295705864.24127505415.48-O2 -march=native-O3 -march=native-Ofast -march=native-O3-O2-O3 -march=native-Ofast -march=native-O1-O01. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4