Intel Haswell GCC 4.8 core-avx2 Tuning Testing Intel Core i7 4770K with different CFLAGS/CXXFLAGS to look at the core-avx2 Haswell GCC 4.8.1 compiler optimizations. Benchmarks by Michael Larabel of Phoronix for a future article.
HTML result view exported from: https://openbenchmarking.org/result/1309148-SO-1309136DA35&sor&grr .
Intel Haswell GCC 4.8 core-avx2 Tuning Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution nocona core2 corei7 corei7-avx core-avx-i core-avx2 test i7-3770K core-avx-i [email protected] Intel Core i7-4770K @ 3.50GHz (8 Cores) Intel DH87RL Intel Haswell DRAM 15360MB 240GB OCZ VERTEX3 Intel Haswell IGP Intel Haswell HDMI VA2431 Intel Connection I217-V Ubuntu 13.04 3.10.0-999-generic (x86_64) Unity 7.0.0 X Server 1.13.3 intel 2.21.9 3.0 Mesa 9.2.0-devel (git-a2e3b1c) GCC 4.8.1 + LLVM 3.2 ext4 1920x1080 Intel Core i7-3770K @ 3.90GHz (8 Cores) ASRock Z77 Pro4-M 16384MB 256GB OCZ VECTOR + 2 x 1000GB SAMSUNG HD103UJ + 80GB INTEL SSDSA2M080 Gallium 0.4 on AMD TAHITI 3072MB (810/1250MHz) LCD3090WQXi Gentoo Base 2.2 3.11.0-drmfixes20130912-core-avx-i (x86_64) KDE X Server 1.14.2.902 (1.14.3 RC 2) radeon 7.2.99 3.0 Mesa 9.3.0-devel (git-f4e35f8) Gallium 0.4 GCC 4.8.1 + Clang 3.4 + LLVM 3.4svn 2560x1600 Intel Core 2 Quad Q9300 @ 3.33GHz (4 Cores) ASUS P5K3 Deluxe Intel 82G33/G31/P35/P31 + ICH9R 8192MB 1000GB Seagate ST31000340AS LLVMpipe Analog Devices AD1988B SyncMaster Marvell 88E8056 PCI-E Gigabit Slackware 14.0 3.2.45 (x86_64) X Server 1.12.4 nouveau 0.0.16 2.1 Mesa 8.0.4 Gallium 0.4 GCC 4.7.1 + Clang 3.0 + LLVM 3.0 1680x1050 OpenBenchmarking.org Compiler Details - nocona: --enable-checking=release --enable-languages=c,c++,fortran - core2: --enable-checking=release --enable-languages=c,c++,fortran - corei7: --enable-checking=release --enable-languages=c,c++,fortran - corei7-avx: --enable-checking=release --enable-languages=c,c++,fortran - core-avx-i: --enable-checking=release --enable-languages=c,c++,fortran - core-avx2: --enable-checking=release --enable-languages=c,c++,fortran - test: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.1 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1 --disable-altivec --disable-fixed-point --disable-isl-version-check --disable-libgcj --disable-libssp --disable-lto --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-languages=c,c++,fortran --enable-libgomp --enable-libmudflap --enable-libstdcxx-time --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.1/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/man --with-cloog --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/python - i7-3770K core-avx-i: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/4.8.1 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1 --disable-altivec --disable-fixed-point --disable-isl-version-check --disable-libgcj --disable-libssp --disable-lto --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-languages=c,c++,fortran --enable-libgomp --enable-libmudflap --enable-libstdcxx-time --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/4.8.1/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/man --with-cloog --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/4.8.1/python - [email protected] : --build=x86_64-slackware-linux --disable-gtktest --disable-libunwind-exceptions --disable-multilib --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-java-awt=gtk --enable-java-home --enable-languages=ada,c,c++,fortran,go,java,lto,objc --enable-libssp --enable-lto --enable-objc-gc --enable-shared --enable-threads=posix --host=x86_64-slackware-linux --mandir=/usr/man --target=x86_64-slackware-linux --verbose --with-antlr-jar=/slack/TMPTMPTMP/gcc-round-two/antlr-runtime-3.4.jar --with-arch-directory=amd64 --with-gnu-ld --with-java-home=/usr/lib64/jvm/jre --with-jvm-jar-dir=/usr/lib64/jvm/jvm-exports --with-jvm-root-dir=/usr/lib64/jvm --with-python-dir=/lib64/python2.7/site-packages Processor Details - nocona: Scaling Governor: acpi- freq ondemand - core2: Scaling Governor: acpi- freq ondemand - corei7: Scaling Governor: acpi- freq ondemand - corei7-avx: Scaling Governor: acpi- freq ondemand - core-avx-i: Scaling Governor: acpi- freq ondemand - core-avx2: Scaling Governor: acpi- freq ondemand - test: Scaling Governor: intel_pstate powersave - i7-3770K core-avx-i: Scaling Governor: intel_pstate powersave
Intel Haswell GCC 4.8 core-avx2 Tuning apache: Static Web Page Serving ffmpeg: H.264 HD To NTSC DV smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time build-linux-kernel: Time To Compile build-imagemagick: Time To Compile himeno: Poisson Pressure Solver graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Sharpen graphics-magick: Blur x264: H.264 Video Encoding ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping scimark2: Dense LU Matrix Factorization scimark2: Fast Fourier Transform scimark2: Monte Carlo botan: CAST-256 botan: AES-256 botan: Tiger hmmer: Pfam Database Search nocona core2 corei7 corei7-avx core-avx-i core-avx2 test i7-3770K core-avx-i [email protected] 24888.11 12.94 26 23.07 97.89 76.98 1517.03 118 157 83 115 156.80 122.02 1825.73 245.07 615.33 95.48 157.97 438.78 10.16 25606.17 13.16 26 22.95 97.63 79.03 1564.22 120 160 84 117 156.74 121.58 1859.97 250.93 616.21 95.80 158.35 438.87 10.14 25490.14 12.93 26 22.95 97.77 79.64 1560.18 120 160 84 116 156.06 123.14 1863.19 249.11 616.65 95.54 157.96 427.31 10.22 25580.44 12.86 26 22.84 98.10 80.91 1404.92 119 166 96 122 155.63 117.71 1851.10 251.86 616.65 95.77 158.19 442.47 10.62 25549.84 13.00 26 22.83 97.85 81.06 1630.12 120 167 96 122 156.08 116.54 1824.28 247.35 615.76 95.79 158.31 440.37 10.45 25644.10 13.01 24 17.02 97.25 80.66 1282.30 121 182 136 138 155.18 119.78 1817.03 226.57 596.16 95.76 158.43 424.56 10.55 23897.32 11.86 87 27.78 89.94 59.51 1686.65 123 161 83 132 158.19 148.75 2386.29 339.88 553.48 10.13 23771.72 11.89 25 28.18 89.90 64.08 1677.67 116 167 95 138 157.85 148.59 2378.31 346.41 553.48 9.87 12787.34 18.06 172 44.15 144.27 106.30 1190.98 84.66 0.93 865.11 93.72 325.95 75.73 227.29 331.70 18.74 OpenBenchmarking.org
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving core-avx2 core2 corei7-avx core-avx-i corei7 nocona test i7-3770K core-avx-i [email protected] 5K 10K 15K 20K 25K SE +/- 170.85, N = 3 SE +/- 229.80, N = 3 SE +/- 178.37, N = 3 SE +/- 126.25, N = 3 SE +/- 193.34, N = 3 SE +/- 107.43, N = 3 SE +/- 94.17, N = 3 SE +/- 100.68, N = 3 SE +/- 65.23, N = 3 25644.10 25606.17 25580.44 25549.84 25490.14 24888.11 23897.32 23771.72 12787.34 -O3 -march=core-avx2 -O3 -march=core2 -O3 -march=corei7-avx -O3 -march=core-avx-i -O3 -march=corei7 -O3 -march=nocona -O2 -march=core-avx-i -O3 -O2 1. (CC) gcc options: -shared -fPIC -pthread
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.1 H.264 HD To NTSC DV test i7-3770K core-avx-i corei7-avx corei7 nocona core-avx-i core-avx2 core2 [email protected] 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 11.86 11.89 12.86 12.93 12.94 13.00 13.01 13.16 18.06 -lva -lpthread -lrt -lva -lpthread -lrt -march=core-avx-i -march=corei7-avx -march=corei7 -march=nocona -march=core-avx-i -march=core-avx2 -march=core2 -lpthread -lrt 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -std=c99 -fomit-frame-pointer -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples core-avx2 i7-3770K core-avx-i nocona core2 corei7 corei7-avx core-avx-i test [email protected] 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 24 25 26 26 26 26 26 87 172 -O3 -march=core-avx2 -march=core-avx-i -O3 -O3 -march=nocona -O3 -march=core2 -O3 -march=corei7 -O3 -march=corei7-avx -O3 -march=core-avx-i 1. (CXX) g++ options: -fopenmp
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time core-avx2 core-avx-i corei7-avx core2 corei7 nocona test i7-3770K core-avx-i [email protected] 10 20 30 40 50 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 17.02 22.83 22.84 22.95 22.95 23.07 27.78 28.18 44.15 -march=core-avx2 -march=core-avx-i -march=corei7-avx -march=core2 -march=corei7 -march=nocona -march=core-avx-i 1. (CC) gcc options: -lm -lpthread -O3
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 3.1 Time To Compile i7-3770K core-avx-i test core-avx2 core2 corei7 core-avx-i nocona corei7-avx [email protected] 30 60 90 120 150 SE +/- 0.59, N = 3 SE +/- 0.79, N = 3 SE +/- 0.60, N = 3 SE +/- 0.54, N = 3 SE +/- 0.69, N = 3 SE +/- 0.76, N = 3 SE +/- 0.59, N = 3 SE +/- 0.54, N = 3 SE +/- 2.13, N = 5 89.90 89.94 97.25 97.63 97.77 97.85 97.89 98.10 144.27
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.8.1-10 Time To Compile test i7-3770K core-avx-i nocona core2 corei7 core-avx2 corei7-avx core-avx-i [email protected] 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.23, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.18, N = 3 SE +/- 0.32, N = 3 SE +/- 0.10, N = 3 SE +/- 0.31, N = 3 SE +/- 0.23, N = 3 59.51 64.08 76.98 79.03 79.64 80.66 80.91 81.06 106.30
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver test i7-3770K core-avx-i core-avx-i core2 corei7 nocona corei7-avx core-avx2 [email protected] 400 800 1200 1600 2000 SE +/- 0.98, N = 3 SE +/- 1.04, N = 3 SE +/- 1.05, N = 3 SE +/- 3.07, N = 3 SE +/- 0.75, N = 3 SE +/- 1.20, N = 3 SE +/- 105.46, N = 6 SE +/- 19.87, N = 6 SE +/- 2.83, N = 3 1686.65 1677.67 1630.12 1564.22 1560.18 1517.03 1404.92 1282.30 1190.98 -march=core-avx-i -march=core-avx-i -march=core2 -march=corei7 -march=nocona -march=corei7-avx -march=core-avx2 1. (CC) gcc options: -O3
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding test core-avx2 core-avx-i corei7 core2 corei7-avx nocona i7-3770K core-avx-i 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 123 121 120 120 120 119 118 116 -O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=core-avx2 -ljbig -O3 -march=core-avx-i -ljbig -O3 -march=corei7 -ljbig -O3 -march=core2 -ljbig -O3 -march=corei7-avx -ljbig -O3 -march=nocona -ljbig -march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt 1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing core-avx2 i7-3770K core-avx-i core-avx-i corei7-avx test corei7 core2 nocona 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 182 167 167 166 161 160 160 157 -O3 -march=core-avx2 -ljbig -march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=core-avx-i -ljbig -O3 -march=corei7-avx -ljbig -O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=corei7 -ljbig -O3 -march=core2 -ljbig -O3 -march=nocona -ljbig 1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen core-avx2 core-avx-i corei7-avx i7-3770K core-avx-i corei7 core2 test nocona 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 136 96 96 95 84 84 83 83 -O3 -march=core-avx2 -ljbig -O3 -march=core-avx-i -ljbig -O3 -march=corei7-avx -ljbig -march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=corei7 -ljbig -O3 -march=core2 -ljbig -O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=nocona -ljbig 1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur i7-3770K core-avx-i core-avx2 test core-avx-i corei7-avx core2 corei7 nocona 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.88, N = 3 SE +/- 0.00, N = 3 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 SE +/- 0.00, N = 3 SE +/- 1.00, N = 3 SE +/- 0.00, N = 3 138 138 132 122 122 117 116 115 -march=core-avx-i -O3 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=core-avx2 -ljbig -O2 -llcms2 -ltiff -lfreetype -lxml2 -lrt -O3 -march=core-avx-i -ljbig -O3 -march=corei7-avx -ljbig -O3 -march=core2 -ljbig -O3 -march=corei7 -ljbig -O3 -march=nocona -ljbig 1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2013-06-08 H.264 Video Encoding test i7-3770K core-avx-i nocona core2 core-avx-i corei7 corei7-avx core-avx2 [email protected] 30 60 90 120 150 SE +/- 0.52, N = 5 SE +/- 0.20, N = 5 SE +/- 0.30, N = 5 SE +/- 0.55, N = 5 SE +/- 0.50, N = 5 SE +/- 0.51, N = 5 SE +/- 0.20, N = 5 SE +/- 0.90, N = 5 SE +/- 0.12, N = 5 158.19 157.85 156.80 156.74 156.08 156.06 155.63 155.18 84.66 -lavformat -lavcodec -lavutil -lswscale -lavformat -lavcodec -lavutil -lswscale -march=core-avx-i -march=nocona -march=core2 -march=core-avx-i -march=corei7 -march=corei7-avx -march=core-avx2 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2z Phong Rendering With Soft-Shadow Mapping test i7-3770K core-avx-i corei7 nocona core2 core-avx2 corei7-avx core-avx-i [email protected] 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.81, N = 3 SE +/- 0.45, N = 3 SE +/- 0.39, N = 3 SE +/- 0.76, N = 3 SE +/- 0.09, N = 3 SE +/- 0.66, N = 3 SE +/- 0.36, N = 3 SE +/- 0.00, N = 3 148.75 148.59 123.14 122.02 121.58 119.78 117.71 116.54 0.93 -lpthread -march=core-avx-i -lpthread -march=corei7 -flto -march=nocona -flto -march=core2 -flto -march=core-avx2 -flto -march=corei7-avx -flto -march=core-avx-i -flto 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization test i7-3770K core-avx-i corei7 core2 corei7-avx nocona core-avx-i core-avx2 [email protected] 500 1000 1500 2000 2500 SE +/- 3.08, N = 4 SE +/- 2.64, N = 4 SE +/- 3.12, N = 4 SE +/- 5.53, N = 4 SE +/- 22.67, N = 4 SE +/- 21.90, N = 4 SE +/- 23.35, N = 4 SE +/- 28.95, N = 4 SE +/- 1.76, N = 4 2386.29 2378.31 1863.19 1859.97 1851.10 1825.73 1824.28 1817.03 865.11
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform i7-3770K core-avx-i test corei7-avx core2 corei7 core-avx-i nocona core-avx2 [email protected] 80 160 240 320 400 SE +/- 0.00, N = 4 SE +/- 0.34, N = 4 SE +/- 1.22, N = 4 SE +/- 0.67, N = 4 SE +/- 0.86, N = 4 SE +/- 2.13, N = 4 SE +/- 2.50, N = 4 SE +/- 2.02, N = 4 SE +/- 0.10, N = 4 346.41 339.88 251.86 250.93 249.11 247.35 245.07 226.57 93.72
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo corei7-avx corei7 core2 core-avx-i nocona core-avx2 i7-3770K core-avx-i test [email protected] 130 260 390 520 650 SE +/- 0.44, N = 4 SE +/- 0.44, N = 4 SE +/- 0.51, N = 4 SE +/- 0.44, N = 4 SE +/- 0.72, N = 4 SE +/- 20.17, N = 8 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 SE +/- 8.41, N = 8 616.65 616.65 616.21 615.76 615.33 596.16 553.48 553.48 325.95
Botan Test: CAST-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: CAST-256 core2 core-avx-i corei7-avx core-avx2 corei7 nocona [email protected] 20 40 60 80 100 95.80 95.79 95.77 95.76 95.54 95.48 75.73 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: AES-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: AES-256 [email protected] core-avx2 core2 core-avx-i corei7-avx nocona corei7 50 100 150 200 250 227.29 158.43 158.35 158.31 158.19 157.97 157.96 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: Tiger OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: Tiger corei7-avx core-avx-i core2 nocona corei7 core-avx2 [email protected] 100 200 300 400 500 442.47 440.37 438.87 438.78 427.31 424.56 331.70 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search i7-3770K core-avx-i test core2 nocona corei7 core-avx-i core-avx2 corei7-avx [email protected] 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.17, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 9.87 10.13 10.14 10.16 10.22 10.45 10.55 10.62 18.74 -march=core-avx-i -O3 -O2 -O3 -march=core2 -O3 -march=nocona -O3 -march=corei7 -O3 -march=core-avx-i -O3 -march=core-avx2 -O3 -march=corei7-avx -O2 1. (CC) gcc options: -pthread -lhmmer -lsquid -lm
Phoronix Test Suite v10.8.5