Skylake Xeon GCC Compiler Optimization Tests

Intel Xeon E3-1280 v5 testing with a MSI C236A WORKSTATION. GCC compiler optimization CFLAGS/CXXFLAGS benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1602298-BE-1602289GA06.

Skylake Xeon GCC Compiler Optimization TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/nativeIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB120GB Samsung SSD 850MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MBRealtek ALC1150DELL P2415QIntel ConnectionUbuntu 16.044.4.0-7-generic (x86_64)Unity 7.4.0X Server 1.17.3radeon 7.6.14.1 Mesa 11.1.2 Gallium 0.4GCC 5.3.1 20160222ext43840x2160Intel Core i7-3930K @ 5.70GHz (12 Cores)ASUS P9X79 PROIntel Xeon E5/Core32768MB240GB Patriot Pyro SE + 128GB Patriot Torqx 12NVIDIA GeForce GTX 680 2048MB (810/3004MHz)Realtek ALC898Intel 82579V Gigabit ConnectionFedora 234.3.5-300.fc23.x86_64 (x86_64)Xfce 4.12NVIDIA 361.184.4.0Clang 3.7.0 + LLVM 3.7.0 + CUDA 5.07680x2160OpenBenchmarking.orgCompiler Details- -O0, -O1, -O2, -O2 -march=native, -O3, -O3 -march=native, -Ofast -march=native: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate powersaveSystem Details- i7, i7-fast/native: SELinux: Enabled.

Skylake Xeon GCC Compiler Optimization Testshmmer: Pfam Database Searchttsiod-renderer: Phong Rendering With Soft-Shadow Mappinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-apache: Time To Compilebuild-imagemagick: Time To Compilebuild-php: Time To Compilec-ray: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3redis: GETredis: SEThint: FLOAT-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native26.8948.141075910913622496.0611.0812.237.2761.5337.5630.501663526.961189110.86127505415.488.86187.34170131198226921302.8317.0024.9613.3543.155.9713.252953039.462175575.75295705864.248.35187.081691321912131002429.3723.1239.6719.1238.195.2212.903007445.962136957.54370257620.558.35192.901731402042281032653.0722.8437.5718.6827.834.8711.992951037.212076614.15408904948.748.17229.781701322012281032485.1424.8849.7021.2119.655.1012.102910194.792183433.83381719210.598.16235.671781422102341042689.9125.1748.9721.6814.604.8810.693116395.332104521.04407575756.088.00236.521771452112371042757.7725.1649.9921.7513.924.899.513023321.582243907.58392373251.207.23244.79163156203205971858.3523.8440.5420.0716.606.9611.461810651.911516309.15360004342.856.83249.01167160207224992088.0624.5648.2420.0814.856.5811.071924469.841364890.50343569657.42OpenBenchmarking.org

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 326.898.868.358.358.178.168.007.236.83-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O2-Ofast -march=native1. (CC) gcc options: -pthread -lhmmer -lsquid -lm

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow Mapping-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native50100150200250SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.42, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.37, N = 3SE +/- 0.78, N = 3SE +/- 2.06, N = 3SE +/- 0.67, N = 348.14187.34187.08192.90229.78235.67236.52244.79249.01-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O3 -lpthread-Ofast -march=native -lpthread1. (CXX) g++ options: -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Blur-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.88, N = 3SE +/- 1.76, N = 3107170169173170178177163167-O0 -ljbig-O1 -ljbig-O2 -ljbig-O2 -march=native -ljbig-O3 -ljbig-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml21. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Sharpen-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 359131132140132142145156160-O0 -ljbig-O1 -ljbig-O2 -ljbig-O2 -march=native -ljbig-O3 -ljbig-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml21. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Resizing-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native50100150200250SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3109198191204201210211203207-O0 -ljbig-O1 -ljbig-O2 -ljbig-O2 -march=native -ljbig-O3 -ljbig-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml21. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color Space-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native50100150200250SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.73, N = 3136226213228228234237205224-O0 -ljbig-O1 -ljbig-O2 -ljbig-O2 -march=native -ljbig-O3 -ljbig-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml21. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive Thresholding-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 322921001031031041049799-O0 -ljbig-O1 -ljbig-O2 -ljbig-O2 -march=native -ljbig-O3 -ljbig-O3 -march=native -ljbig-Ofast -march=native -ljbig-O2 -llcms2 -lfreetype -ljasper -lxml2-Ofast -march=native -llcms2 -lfreetype -ljasper -lxml21. (CC) gcc options: -fopenmp -pthread -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native6001200180024003000SE +/- 1.58, N = 3SE +/- 7.15, N = 3SE +/- 2.14, N = 3SE +/- 5.04, N = 3SE +/- 2.83, N = 3SE +/- 13.72, N = 3SE +/- 8.78, N = 3SE +/- 5.77, N = 3SE +/- 1.09, N = 3496.061302.832429.372653.072485.142689.912757.771858.352088.06-O0 -mavx2-O1 -mavx2-O2 -mavx2-O2 -march=native -mavx2-mavx2-march=native -mavx2-Ofast -march=native -mavx2-Ofast -march=native1. (CC) gcc options: -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native612182430SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.24, N = 3SE +/- 0.15, N = 311.0817.0023.1222.8424.8825.1725.1623.8424.56

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native1122334455SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.18, N = 3SE +/- 0.25, N = 3SE +/- 0.23, N = 312.2324.9639.6737.5749.7048.9749.9940.5448.24

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compile-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native510152025SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.30, N = 6SE +/- 0.02, N = 37.2713.3519.1218.6821.2121.6821.7520.0720.08-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O2-Ofast -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native1428425670SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 361.5343.1538.1927.8319.6514.6013.9216.6014.85-O0-O1-O2-O2 -march=native-march=native-Ofast -march=native-Ofast -march=native1. (CC) gcc options: -lm -lpthread -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLAC-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native918273645SE +/- 0.09, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.04, N = 537.565.975.224.875.104.884.896.966.58-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O2-Ofast -march=native1. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native714212835SE +/- 0.07, N = 5SE +/- 0.00, N = 5SE +/- 0.03, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.03, N = 530.5013.2512.9011.9912.1010.699.5111.4611.07-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O3 -ffast-math -funroll-loops-Ofast -march=native1. (CC) gcc options: -pipe -lncurses -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GET-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native700K1400K2100K2800K3500KSE +/- 38735.61, N = 6SE +/- 149849.11, N = 6SE +/- 115634.67, N = 6SE +/- 147992.17, N = 6SE +/- 114387.02, N = 6SE +/- 112169.48, N = 6SE +/- 119850.30, N = 6SE +/- 33773.84, N = 3SE +/- 12423.96, N = 31663526.962953039.463007445.962951037.212910194.793116395.333023321.581810651.911924469.84-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SET-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native500K1000K1500K2000K2500KSE +/- 18888.09, N = 6SE +/- 42146.85, N = 3SE +/- 42585.88, N = 3SE +/- 59339.63, N = 6SE +/- 5504.91, N = 3SE +/- 62171.83, N = 6SE +/- 9379.40, N = 3SE +/- 73282.81, N = 6SE +/- 20673.84, N = 31189110.862175575.752136957.542076614.152183433.832104521.042243907.581516309.151364890.50-std=gnu99 -pipe -g3 -O3 -funroll-loops1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=nativei7i7-fast/native90M180M270M360M450MSE +/- 333507.82, N = 3SE +/- 261713.92, N = 3SE +/- 807099.70, N = 3SE +/- 839964.65, N = 3SE +/- 705518.12, N = 3SE +/- 469746.65, N = 3SE +/- 623958.85, N = 3SE +/- 683702.70, N = 3SE +/- 124219.71, N = 3127505415.48295705864.24370257620.55408904948.74381719210.59407575756.08392373251.20360004342.85343569657.42-O0-O1-O2-O2 -march=native-O3-O3 -march=native-Ofast -march=native-O3 -march=native-Ofast -march=native1. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4