Intel Haswell GCC 4.8 core-avx2 Tuning

Testing Intel Core i7 4770K with different CFLAGS/CXXFLAGS to look at the core-avx2 Haswell GCC 4.8.1 compiler optimizations. Benchmarks by Michael Larabel of Phoronix for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1309148-SO-1309136DA35&grr.

Intel Haswell GCC 4.8 core-avx2 TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolutionnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHzIntel Core i7-4770K @ 3.50GHz (8 Cores)Intel DH87RLIntel Haswell DRAM15360MB240GB OCZ VERTEX3Intel Haswell IGPIntel Haswell HDMIVA2431Intel Connection I217-VUbuntu 13.043.10.0-999-generic (x86_64)Unity 7.0.0X Server 1.13.3intel 2.21.93.0 Mesa 9.2.0-devel (git-a2e3b1c)GCC 4.8.1 + LLVM 3.2ext41920x1080Intel Core i7-3770K @ 3.90GHz (8 Cores)ASRock Z77 Pro4-M16384MB256GB OCZ VECTOR + 2 x 1000GB SAMSUNG HD103UJ + 80GB INTEL SSDSA2M080Gallium 0.4 on AMD TAHITI 3072MB (810/1250MHz)LCD3090WQXiGentoo Base 2.23.11.0-drmfixes20130912-core-avx-i (x86_64)KDEX Server 1.14.2.902 (1.14.3 RC 2)radeon 7.2.993.0 Mesa 9.3.0-devel (git-f4e35f8) Gallium 0.4GCC 4.8.1 + Clang 3.4 + LLVM 3.4svn2560x1600Intel Core 2 Quad Q9300 @ 3.33GHz (4 Cores)ASUS P5K3 DeluxeIntel 82G33/G31/P35/P31 + ICH9R8192MB1000GB Seagate ST31000340ASLLVMpipeAnalog Devices AD1988BSyncMasterMarvell 88E8056 PCI-E GigabitSlackware 14.03.2.45 (x86_64)X Server 1.12.4nouveau 0.0.162.1 Mesa 8.0.4 Gallium 0.4GCC 4.7.1 + Clang 3.0 + LLVM 3.01680x1050OpenBenchmarking.orgCompiler Details- nocona: --enable-checking=release --enable-languages=c,c++,fortran- core2: --enable-checking=release --enable-languages=c,c++,fortran- corei7: --enable-checking=release --enable-languages=c,c++,fortran- corei7-avx: --enable-checking=release --enable-languages=c,c++,fortran- core-avx-i: --enable-checking=release --enable-languages=c,c++,fortran- core-avx2: --enable-checking=release --enable-languages=c,c++,fortran- test: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.1 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1 --disable-altivec --disable-fixed-point --disable-isl-version-check --disable-libgcj --disable-libssp --disable-lto --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-languages=c,c++,fortran --enable-libgomp --enable-libmudflap --enable-libstdcxx-time --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.1/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/man --with-cloog --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/python - i7-3770K core-avx-i: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.1 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1 --disable-altivec --disable-fixed-point --disable-isl-version-check --disable-libgcj --disable-libssp --disable-lto --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-languages=c,c++,fortran --enable-libgomp --enable-libmudflap --enable-libstdcxx-time --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.1/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/man --with-cloog --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/python - Q9300@3.33GHz: --build=x86_64-slackware-linux --disable-gtktest --disable-libunwind-exceptions --disable-multilib --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-java-awt=gtk --enable-java-home --enable-languages=ada,c,c++,fortran,go,java,lto,objc --enable-libssp --enable-lto --enable-objc-gc --enable-shared --enable-threads=posix --host=x86_64-slackware-linux --mandir=/usr/man --target=x86_64-slackware-linux --verbose --with-antlr-jar=/slack/TMPTMPTMP/gcc-round-two/antlr-runtime-3.4.jar --with-arch-directory=amd64 --with-gnu-ld --with-java-home=/usr/lib64/jvm/jre --with-jvm-jar-dir=/usr/lib64/jvm/jvm-exports --with-jvm-root-dir=/usr/lib64/jvm --with-python-dir=/lib64/python2.7/site-packages Processor Details- nocona: Scaling Governor: acpi- freq ondemand- core2: Scaling Governor: acpi- freq ondemand- corei7: Scaling Governor: acpi- freq ondemand- corei7-avx: Scaling Governor: acpi- freq ondemand- core-avx-i: Scaling Governor: acpi- freq ondemand- core-avx2: Scaling Governor: acpi- freq ondemand- test: Scaling Governor: intel_pstate powersave- i7-3770K core-avx-i: Scaling Governor: intel_pstate powersave

Intel Haswell GCC 4.8 core-avx2 Tuningapache: Static Web Page Servingffmpeg: H.264 HD To NTSC DVsmallpt: Global Illumination Renderer; 100 Samplesc-ray: Total Timebuild-linux-kernel: Time To Compilebuild-imagemagick: Time To Compilehimeno: Poisson Pressure Solvergraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Sharpengraphics-magick: Blurx264: H.264 Video Encodingttsiod-renderer: Phong Rendering With Soft-Shadow Mappingscimark2: Dense LU Matrix Factorizationscimark2: Fast Fourier Transformscimark2: Monte Carlobotan: CAST-256botan: AES-256botan: Tigerhmmer: Pfam Database Searchnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz24888.1112.942623.0797.8976.981517.0311815783115156.80122.021825.73245.07615.3395.48157.97438.7810.1625606.1713.162622.9597.6379.031564.2212016084117156.74121.581859.97250.93616.2195.80158.35438.8710.1425490.1412.932622.9597.7779.641560.1812016084116156.06123.141863.19249.11616.6595.54157.96427.3110.2225580.4412.862622.8498.1080.911404.9211916696122155.63117.711851.10251.86616.6595.77158.19442.4710.6225549.8413.002622.8397.8581.061630.1212016796122156.08116.541824.28247.35615.7695.79158.31440.3710.4525644.1013.012417.0297.2580.661282.30121182136138155.18119.781817.03226.57596.1695.76158.43424.5610.5523897.3211.868727.7889.9459.511686.6512316183132158.19148.752386.29339.88553.4810.1323771.7211.892528.1889.9064.081677.6711616795138157.85148.592378.31346.41553.489.8712787.3418.0617244.15144.27106.301190.9884.660.93865.1193.72325.9575.73227.29331.7018.74OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page Servingnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz5K10K15K20K25KSE +/- 107.43, N = 3SE +/- 229.80, N = 3SE +/- 193.34, N = 3SE +/- 178.37, N = 3SE +/- 126.25, N = 3SE +/- 170.85, N = 3SE +/- 94.17, N = 3SE +/- 100.68, N = 3SE +/- 65.23, N = 324888.1125606.1725490.1425580.4425549.8425644.1023897.3223771.7212787.34-O3 -march=nocona-O3 -march=core2-O3 -march=corei7-O3 -march=corei7-avx-O3 -march=core-avx-i-O3 -march=core-avx2-O2-march=core-avx-i -O3-O21. (CC) gcc options: -shared -fPIC -pthread

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 1.1H.264 HD To NTSC DVnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz48121620SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 312.9413.1612.9312.8613.0013.0111.8611.8918.06-march=nocona-march=core2-march=corei7-march=corei7-avx-march=core-avx-i-march=core-avx2-lva -lpthread -lrt-lva -lpthread -lrt -march=core-avx-i-lpthread -lrt1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -O3 -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 32626262626248725172-O3 -march=nocona-O3 -march=core2-O3 -march=corei7-O3 -march=corei7-avx-O3 -march=core-avx-i-O3 -march=core-avx2-march=core-avx-i -O31. (CXX) g++ options: -fopenmp

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timenoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz1020304050SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 323.0722.9522.9522.8422.8317.0227.7828.1844.15-march=nocona-march=core2-march=corei7-march=corei7-avx-march=core-avx-i-march=core-avx2-march=core-avx-i1. (CC) gcc options: -lm -lpthread -O3

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 3.1Time To Compilenoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz306090120150SE +/- 0.59, N = 3SE +/- 0.54, N = 3SE +/- 0.69, N = 3SE +/- 0.54, N = 3SE +/- 0.76, N = 3SE +/- 0.60, N = 3SE +/- 0.79, N = 3SE +/- 0.59, N = 3SE +/- 2.13, N = 597.8997.6397.7798.1097.8597.2589.9489.90144.27

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.8.1-10Time To Compilenoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz20406080100SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.31, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 3SE +/- 0.23, N = 3SE +/- 0.23, N = 376.9879.0379.6480.9181.0680.6659.5164.08106.30

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solvernoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz400800120016002000SE +/- 1.20, N = 3SE +/- 3.07, N = 3SE +/- 0.75, N = 3SE +/- 105.46, N = 6SE +/- 1.05, N = 3SE +/- 19.87, N = 6SE +/- 0.98, N = 3SE +/- 1.04, N = 3SE +/- 2.83, N = 31517.031564.221560.181404.921630.121282.301686.651677.671190.98-march=nocona-march=core2-march=corei7-march=corei7-avx-march=core-avx-i-march=core-avx2-march=core-avx-i1. (CC) gcc options: -O3

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive Thresholdingnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-i306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3118120120119120121123116-O3 -march=nocona -ljbig-O3 -march=core2 -ljbig-O3 -march=corei7 -ljbig-O3 -march=corei7-avx -ljbig-O3 -march=core-avx-i -ljbig-O3 -march=core-avx2 -ljbig-O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt-march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Resizingnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-i4080120160200SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3157160160166167182161167-O3 -march=nocona -ljbig-O3 -march=core2 -ljbig-O3 -march=corei7 -ljbig-O3 -march=corei7-avx -ljbig-O3 -march=core-avx-i -ljbig-O3 -march=core-avx2 -ljbig-O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt-march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Sharpennoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-i306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 383848496961368395-O3 -march=nocona -ljbig-O3 -march=core2 -ljbig-O3 -march=corei7 -ljbig-O3 -march=corei7-avx -ljbig-O3 -march=core-avx-i -ljbig-O3 -march=core-avx2 -ljbig-O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt-march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Blurnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-i306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3115117116122122138132138-O3 -march=nocona -ljbig-O3 -march=core2 -ljbig-O3 -march=corei7 -ljbig-O3 -march=corei7-avx -ljbig-O3 -march=core-avx-i -ljbig-O3 -march=core-avx2 -ljbig-O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt-march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video Encodingnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz306090120150SE +/- 0.30, N = 5SE +/- 0.55, N = 5SE +/- 0.51, N = 5SE +/- 0.20, N = 5SE +/- 0.50, N = 5SE +/- 0.90, N = 5SE +/- 0.52, N = 5SE +/- 0.20, N = 5SE +/- 0.12, N = 5156.80156.74156.06155.63156.08155.18158.19157.8584.66-march=nocona-march=core2-march=corei7-march=corei7-avx-march=core-avx-i-march=core-avx2-lavformat -lavcodec -lavutil -lswscale-lavformat -lavcodec -lavutil -lswscale -march=core-avx-i1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2zPhong Rendering With Soft-Shadow Mappingnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz306090120150SE +/- 0.39, N = 3SE +/- 0.76, N = 3SE +/- 0.45, N = 3SE +/- 0.66, N = 3SE +/- 0.36, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.81, N = 3SE +/- 0.00, N = 3122.02121.58123.14117.71116.54119.78148.75148.590.93-march=nocona -flto-march=core2 -flto-march=corei7 -flto-march=corei7-avx -flto-march=core-avx-i -flto-march=core-avx2 -flto-lpthread-march=core-avx-i -lpthread1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz5001000150020002500SE +/- 21.90, N = 4SE +/- 5.53, N = 4SE +/- 3.12, N = 4SE +/- 22.67, N = 4SE +/- 23.35, N = 4SE +/- 28.95, N = 4SE +/- 3.08, N = 4SE +/- 2.64, N = 4SE +/- 1.76, N = 41825.731859.971863.191851.101824.281817.032386.292378.31865.11

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz80160240320400SE +/- 2.50, N = 4SE +/- 0.67, N = 4SE +/- 0.86, N = 4SE +/- 1.22, N = 4SE +/- 2.13, N = 4SE +/- 2.02, N = 4SE +/- 0.34, N = 4SE +/- 0.00, N = 4SE +/- 0.10, N = 4245.07250.93249.11251.86247.35226.57339.88346.4193.72

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlonoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz130260390520650SE +/- 0.72, N = 4SE +/- 0.51, N = 4SE +/- 0.44, N = 4SE +/- 0.44, N = 4SE +/- 0.44, N = 4SE +/- 20.17, N = 8SE +/- 0.00, N = 4SE +/- 0.00, N = 4SE +/- 8.41, N = 8615.33616.21616.65616.65615.76596.16553.48553.48325.95

Botan

Test: CAST-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: CAST-256noconacore2corei7corei7-avxcore-avx-icore-avx2Q9300@3.33GHz2040608010095.4895.8095.5495.7795.7995.7675.731. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: AES-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: AES-256noconacore2corei7corei7-avxcore-avx-icore-avx2Q9300@3.33GHz50100150200250157.97158.35157.96158.19158.31158.43227.291. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: Tiger

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: Tigernoconacore2corei7corei7-avxcore-avx-icore-avx2Q9300@3.33GHz100200300400500438.78438.87427.31442.47440.37424.56331.701. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Searchnoconacore2corei7corei7-avxcore-avx-icore-avx2testi7-3770K core-avx-iQ9300@3.33GHz510152025SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.1610.1410.2210.6210.4510.5510.139.8718.74-O3 -march=nocona-O3 -march=core2-O3 -march=corei7-O3 -march=corei7-avx-O3 -march=core-avx-i-O3 -march=core-avx2-O2-march=core-avx-i -O3-O21. (CC) gcc options: -pthread -lhmmer -lsquid -lm


Phoronix Test Suite v10.8.4