Intel Core i9-7980XE testing with GCC 9 compiler by Michael Larabel.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1905120-HV-COREII79889 Core i9 7980XE GCC 9 Compiler Benchmarks - Phoronix Test Suite Core i9 7980XE GCC 9 Compiler Benchmarks Intel Core i9-7980XE testing with GCC 9 compiler by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1905120-HV-COREII79889&obr_sgm=y&obr_sor=y&obr_rro=y&grt&rdt .
Core i9 7980XE GCC 9 Compiler Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads) ASUS PRIME X299-A (1704 BIOS) Intel Sky Lake-E DMI3 Registers 16384MB Samsung SSD 970 EVO 500GB NVIDIA NV120 12GB Realtek ALC1220 ASUS PB278 Intel I219-V Clear Linux OS 29350 5.0.14-753.native (x86_64) GNOME Shell 3.32.1 X Server 1.20.4 nouveau 1.0.16 4.3 Mesa 19.1.0-devel GCC 9.1.1 20190509 gcc-9-branch@271025 + clang (Clear Linux OS for Intel Architecture) 9.1.1 20190509 gcc-9-branch@271025 + LLVM 8.0.0 ext4 2560x1440 OpenBenchmarking.org Environment Details - -O3 -march=skylake: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O3 -march=skylake-avx512 -mprefer-vector-width=512: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=512 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O3 -march=skylake-avx512 -mprefer-vector-width=128: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=128 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O3 -march=skylake-avx512 -mprefer-vector-width=256: CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags CXXFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=skylake-avx512-mprefer-vector-width=256 THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" Compiler Details - --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details - Scaling Governor: intel_pstate performance Python Details - Python 3.7.3 Security Details - KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable
Core i9 7980XE GCC 9 Compiler Benchmarks compress-7zip: Compress Speed Test aobench: 2048 x 2048 - Total Time apache: Static Web Page Serving blake2: botan: KASUMI - Encrypt botan: KASUMI - Decrypt botan: AES-256 - Encrypt botan: AES-256 - Decrypt botan: Twofish - Encrypt botan: Twofish - Decrypt botan: Blowfish - Encrypt botan: Blowfish - Decrypt botan: CAST-256 - Encrypt botan: CAST-256 - Decrypt c-ray: Total Time - 4K, 16 Rays Per Pixel clomp: Static OMP Speedup cloverleaf: Lagrangian-Eulerian Hydrodynamics cpp-perf-bench: Atol cpp-perf-bench: Ctype cpp-perf-bench: Math Library cpp-perf-bench: Rand Numbers cpp-perf-bench: Stepanov Vector cpp-perf-bench: Function Objects cpp-perf-bench: Stepanov Abstraction cpuminer-opt: m7m cpuminer-opt: deep cpuminer-opt: lbry cpuminer-opt: skein cpuminer-opt: myr-gr cpuminer-opt: sha256t crafty: Elapsed Time dav1d: Summer Nature 4K dav1d: Summer Nature 1080p ffmpeg: H.264 HD To NTSC DV fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 encode-flac: WAV To FLAC graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space himeno: Poisson Pressure Solver encode-mp3: WAV To MP3 tjbench: Decompression Throughput luajit: Composite lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression m-queens: Time To Solve mcperf: Add mcperf: Get mcperf: Set mcperf: Append mcperf: Delete mcperf: Prepend mcperf: Replace mkl-dnn: IP Batch 1D - f32 mkl-dnn: Deconvolution Batch deconv_1d - f32 mkl-dnn: Convolution Batch conv_alexnet - f32 mkl-dnn: Deconvolution Batch deconv_all - f32 mkl-dnn: Convolution Batch conv_googlenet_v3 - f32 npb: BT.A npb: EP.C npb: FT.A npb: FT.B npb: LU.A npb: LU.C npb: SP.A nginx: Static Web Page Serving parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding polybench-c: 3 Matrix Multiplications pgbench: Buffer Test - Normal Load - Read Only pgbench: Buffer Test - Normal Load - Read Write primesieve: 1e12 Prime Number Generation redis: LPOP redis: SADD redis: LPUSH redis: GET redis: SET rodinia: OpenMP LavaMD rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster stockfish: Total Time svt-av1: 1080p 8-bit YUV To AV1 Video Encode svt-hevc: 1080p 8-bit YUV To HEVC Video Encode svt-vp9: 1080p 8-bit YUV To VP9 Video Encode t-test1: 2 hmmer: Pfam Database Search build-linux-kernel: Time To Compile mrbayes: Primate Phylogeny Analysis build-php: Time To Compile tscp: AI Chess Performance ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping x264: H.264 Video Encoding x265: H.265 1080p Video Encoding xsbench: compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 100157 31.17 36040 4.02 96.02 95.01 4220 4212 400 393 295 293 145 145 30.87 2.12 2.32 57.08 29.69 260 846 82.98 13.62 32.49 499 15323 48547 56227 16360 125653 8852846 19.15 6.93 5.45 6327 17183 8.77 252 291 204 227 277 198 288 3302 9.32 196 1606 39 108 451 1137 430 620 248 1189 48.42 72123 117212 72585 78294 120897 77484 78460 20.79 23.03 1822 10446 752 4391 2038 8581 9376 37394 33669 4062 47570 68.54 2.38 7.74 132 2.64 520100 13311 12.79 3140283 2339849 2059237 2958563 2163293 16.96 11.52 13.63 47112136 24.44 304 342 5.93 3.49 36.34 65.67 51.80 1368057 710 139 53.71 5164164 15.88 9.71 100416 31.29 33240 4.02 96.25 95.30 3835 3839 401 395 295 294 145 146 35.86 2.12 2.30 57.06 30.56 263 846 98.92 13.39 37.27 479 14490 53620 62027 15310 124567 8840945 19.67 7.05 5.45 5673 17334 9.14 228 250 186 196 243 195 253 2814 10.19 197 1608 39 108 452 1138 428 621 249 1191 48.41 73346 112844 72940 77932 115719 79234 78798 20.66 23.07 1819 10424 751 4391 2113 8543 9387 37305 33676 4080 44072 68.18 2.41 7.80 132 2.63 498539 13424 12.96 3059844 2383273 2026656 3002750 2135179 16.98 11.71 14.12 46997822 24.43 304 337 5.82 3.88 37.36 65.66 52.19 1373514 669 140 50.05 5166043 15.99 9.76 99546 31.15 35982 4.02 96.22 95.26 3847 3840 401 395 295 294 145 146 33.34 2.11 2.29 57.07 30.57 262 821 82.88 13.28 32.44 503 15497 52793 62813 15560 124672 8840395 19.47 6.97 5.50 6863 17522 8.76 251 290 207 225 277 197 289 3310 9.05 196 1605 39 107 451 1138 429 620 249 1190 48.42 73307 107497 73045 78021 122131 77547 78432 20.84 23.01 1820 10397 749 4773 2090 8581 9337 39791 34962 4073 46938 68.35 2.38 7.98 135 2.61 519930 13495 12.80 3095145 2624995 1962269 2992962 2189741 16.99 11.47 13.72 46810779 24.37 339 346 5.81 3.40 36.33 63.06 51.84 1388054 738 140 53.34 5171354 15.84 9.59 99960 31.11 35946 4.02 96.21 95.17 3833 3855 401 395 295 293 145 146 33.47 2.12 2.29 57.00 30.89 264 857 83.16 13.70 32.44 506 15490 53473 62560 15695 124030 8845689 19.34 6.95 5.47 6615 17326 8.66 251 290 206 224 275 198 288 3292 9.30 194 1607 39 107 452 1137 429 620 249 1191 48.40 72947 117621 72688 77703 114714 79076 77373 20.74 22.93 1819 10405 751 4784 2109 8544 9331 39343 35007 4185 49221 68.26 2.38 7.77 134 2.63 521901 13576 12.78 3139485 2449355 2013442 2944968 1971745 16.95 11.31 13.88 47046623 24.34 337 347 5.82 3.50 36.93 63.20 51.90 1380406 729 140 53.97 5140078 15.80 9.69 OpenBenchmarking.org
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 264.40, N = 3 SE +/- 414.93, N = 3 SE +/- 201.98, N = 3 SE +/- 174.10, N = 3 99546 99960 100157 100416 1. (CXX) g++ options: -pipe -lpthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 7 14 21 28 35 SE +/- 0.15, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 31.29 31.17 31.15 31.11 -march=skylake 1. (CC) gcc options: -lm -O3
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 8K 16K 24K 32K 40K SE +/- 36.99, N = 3 SE +/- 63.22, N = 3 SE +/- 144.00, N = 3 SE +/- 52.04, N = 3 33240 35946 35982 36040 -march=skylake 1. (CC) gcc options: -shared -fPIC -pthread -O3
BLAKE2 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20170307 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 0.9045 1.809 2.7135 3.618 4.5225 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.02 4.02 4.02 4.02 1. (CC) gcc options: -O3 -march=native -lcrypto -lz
Botan Test: KASUMI - Encrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 96.02 96.21 96.22 96.25 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: KASUMI - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 95.01 95.17 95.26 95.30 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Encrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Encrypt -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 900 1800 2700 3600 4500 SE +/- 1.21, N = 3 SE +/- 1.37, N = 3 SE +/- 0.98, N = 3 SE +/- 0.41, N = 3 3833 3835 3847 4220 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: AES-256 - Decrypt -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 900 1800 2700 3600 4500 SE +/- 1.97, N = 3 SE +/- 2.47, N = 3 SE +/- 1.90, N = 3 SE +/- 0.57, N = 3 3839 3840 3855 4212 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Encrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 90 180 270 360 450 SE +/- 1.99, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 SE +/- 0.34, N = 3 400 401 401 401 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Twofish - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 90 180 270 360 450 SE +/- 1.58, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.21, N = 3 393 395 395 395 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Encrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 60 120 180 240 300 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.22, N = 3 SE +/- 0.17, N = 3 295 295 295 295 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: Blowfish - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 60 120 180 240 300 SE +/- 0.34, N = 3 SE +/- 0.18, N = 3 SE +/- 0.03, N = 3 SE +/- 0.51, N = 3 293 293 294 294 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Encrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Encrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.18, N = 3 145 145 145 145 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.8.0 Test: CAST-256 - Decrypt -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 145 146 146 146 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 35.86 33.47 33.34 30.87 -march=skylake 1. (CC) gcc options: -lm -lpthread -O3
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 0.477 0.954 1.431 1.908 2.385 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 2.11 2.12 2.12 2.12 1. (CC) gcc options: --openmp -O3 -lm
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 0.522 1.044 1.566 2.088 2.61 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.32 2.30 2.29 2.29 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 13 26 39 52 65 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 57.08 57.07 57.06 57.00 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 30.89 30.57 30.56 29.69 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 60 120 180 240 300 SE +/- 1.04, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 264 263 262 260 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Random Numbers OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 200 400 600 800 1000 SE +/- 0.00, N = 3 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 857 846 846 821 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 98.92 83.16 82.98 82.88 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 13.70 13.62 13.39 13.28 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 37.27 32.49 32.44 32.44 -march=skylake 1. (CXX) g++ options: -O3 -std=c++11
Cpuminer-Opt Algorithm: m7m OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: m7m -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 110 220 330 440 550 SE +/- 1.44, N = 3 SE +/- 1.19, N = 3 SE +/- 1.28, N = 3 SE +/- 0.55, N = 3 479 499 503 506 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: deep OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: deep -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 3K 6K 9K 12K 15K SE +/- 65.06, N = 3 SE +/- 21.86, N = 3 SE +/- 15.28, N = 3 SE +/- 31.80, N = 3 14490 15323 15490 15497 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: lbry OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: lbry -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 11K 22K 33K 44K 55K SE +/- 3.33, N = 3 SE +/- 41.77, N = 3 SE +/- 283.57, N = 3 SE +/- 706.79, N = 4 48547 52793 53473 53620 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: skein OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: skein -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 13K 26K 39K 52K 65K SE +/- 3.33, N = 3 SE +/- 1026.67, N = 3 SE +/- 955.00, N = 3 SE +/- 678.39, N = 3 56227 62027 62560 62813 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: myr-gr OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: myr-gr -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 4K 8K 12K 16K 20K SE +/- 32.15, N = 3 SE +/- 92.92, N = 3 SE +/- 219.87, N = 4 SE +/- 76.38, N = 3 15310 15560 15695 16360 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: sha256t OpenBenchmarking.org kH/s - Hash Speed, More Is Better Cpuminer-Opt 3.8.8.1 Algorithm: sha256t -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 30K 60K 90K 120K 150K SE +/- 240.62, N = 3 SE +/- 661.72, N = 3 SE +/- 1469.65, N = 6 SE +/- 3.33, N = 3 124030 124567 124672 125653 -march=skylake 1. (CXX) g++ options: -O3 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 2M 4M 6M 8M 10M SE +/- 5800.58, N = 3 SE +/- 7719.64, N = 3 SE +/- 7027.53, N = 3 SE +/- 4114.60, N = 3 8840395 8840945 8845689 8852846 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 4K -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 19.67 19.47 19.34 19.15 -march=skylake 1. (CC) gcc options: -O3 -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.3 Video Input: Summer Nature 1080p -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 7.05 6.97 6.95 6.93 -march=skylake 1. (CC) gcc options: -O3 -pthread
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 4.0.2 H.264 HD To NTSC DV -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 1.2375 2.475 3.7125 4.95 6.1875 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 5.50 5.47 5.45 5.45 -march=skylake 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shm -lxcb-shape -lxcb-xfixes -pthread -lSDL2 -lbz2 -O3 -std=c11 -fomit-frame-pointer -fPIC -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 1500 3000 4500 6000 7500 SE +/- 31.31, N = 3 SE +/- 55.64, N = 3 SE +/- 19.27, N = 3 SE +/- 28.38, N = 3 5673 6327 6615 6863 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 4K 8K 12K 16K 20K SE +/- 39.05, N = 3 SE +/- 90.51, N = 3 SE +/- 71.84, N = 3 SE +/- 111.64, N = 3 17183 17326 17334 17522 -march=skylake 1. (CC) gcc options: -pthread -O3 -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 9.14 8.77 8.76 8.66 -march=skylake 1. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 60 120 180 240 300 SE +/- 1.00, N = 3 SE +/- 1.33, N = 3 SE +/- 0.33, N = 3 228 251 251 252 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 60 120 180 240 300 SE +/- 1.00, N = 3 250 290 290 291 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 186 204 206 207 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 50 100 150 200 250 SE +/- 0.33, N = 3 196 224 225 227 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 60 120 180 240 300 SE +/- 0.58, N = 3 243 275 277 277 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 40 80 120 160 200 SE +/- 0.67, N = 3 195 197 198 198 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 253 288 288 289 -march=skylake 1. (CC) gcc options: -fopenmp -O3 -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lgomp -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 700 1400 2100 2800 3500 SE +/- 9.47, N = 3 SE +/- 4.67, N = 3 SE +/- 5.71, N = 3 SE +/- 2.31, N = 3 2814 3292 3302 3310 -march=skylake 1. (CC) gcc options: -O3 -mavx2
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 10.19 9.32 9.30 9.05 -march=skylake 1. (CC) gcc options: -O3 -lncurses -lm
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.06, N = 3 SE +/- 0.64, N = 3 SE +/- 0.42, N = 3 194 196 196 197 -march=skylake 1. (CC) gcc options: -O3 -lm
LuaJIT Test: Composite OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 300 600 900 1200 1500 SE +/- 0.83, N = 3 SE +/- 2.67, N = 3 SE +/- 5.01, N = 3 SE +/- 1.35, N = 3 1605 1606 1607 1608 -march=skylake 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector
lzbench Test: XZ 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 9 18 27 36 45 39 39 39 39 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Decompression -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 20 40 60 80 100 107 107 108 108 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 100 200 300 400 500 SE +/- 0.58, N = 3 451 451 452 452 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 1137 1137 1138 1138 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 90 180 270 360 450 SE +/- 1.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 428 429 429 430 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Brotli 0 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 130 260 390 520 650 SE +/- 1.00, N = 3 SE +/- 1.00, N = 3 SE +/- 0.58, N = 3 620 620 620 621 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Compression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 50 100 150 200 250 SE +/- 0.33, N = 3 248 249 249 249 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Libdeflate 1 - Process: Decompression OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 1189 1190 1191 1191 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 48.42 48.42 48.41 48.40 -march=skylake 1. (CXX) g++ options: -fopenmp -O3 -O2 -march=native
Memcached mcperf Method: Add OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 16K 32K 48K 64K 80K SE +/- 582.90, N = 3 SE +/- 163.23, N = 3 SE +/- 259.01, N = 3 SE +/- 622.40, N = 3 72123 72947 73307 73346 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Get OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 30K 60K 90K 120K 150K SE +/- 876.94, N = 3 SE +/- 1768.29, N = 15 SE +/- 1600.60, N = 15 SE +/- 1645.69, N = 15 107497 112844 117212 117621 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Set OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 16K 32K 48K 64K 80K SE +/- 335.26, N = 3 SE +/- 384.96, N = 3 SE +/- 141.11, N = 3 SE +/- 429.23, N = 3 72585 72688 72940 73045 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Append OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 20K 40K 60K 80K 100K SE +/- 182.38, N = 3 SE +/- 290.53, N = 3 SE +/- 510.45, N = 3 SE +/- 495.69, N = 3 77703 77932 78021 78294 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Delete OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Delete -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 30K 60K 90K 120K 150K SE +/- 1539.98, N = 15 SE +/- 1672.82, N = 15 SE +/- 1837.94, N = 3 SE +/- 1623.43, N = 4 114714 115719 120897 122131 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Prepend OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 755.85, N = 3 SE +/- 109.89, N = 3 SE +/- 428.91, N = 3 SE +/- 121.74, N = 3 77484 77547 79076 79234 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
Memcached mcperf Method: Replace OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 20K 40K 60K 80K 100K SE +/- 642.32, N = 3 SE +/- 549.37, N = 3 SE +/- 240.24, N = 3 SE +/- 216.99, N = 3 77373 78432 78460 78798 -march=skylake 1. (CC) gcc options: -O3 -lm -rdynamic
MKL-DNN Harness: IP Batch 1D - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 20.84 20.79 20.74 20.66 MIN: 19.89 -march=skylake - MIN: 19.84 MIN: 19.8 MIN: 19.73 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN Harness: Deconvolution Batch deconv_1d - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_1d - Data Type: f32 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 6 12 18 24 30 SE +/- 0.22, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 23.07 23.03 23.01 22.93 MIN: 22.7 -march=skylake - MIN: 22.69 MIN: 22.68 MIN: 22.61 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN Harness: Convolution Batch conv_alexnet - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_alexnet - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 400 800 1200 1600 2000 SE +/- 0.89, N = 3 SE +/- 0.22, N = 3 SE +/- 0.28, N = 3 SE +/- 0.68, N = 3 1822 1820 1819 1819 -march=skylake - MIN: 1818.58 MIN: 1817.59 MIN: 1816.37 MIN: 1815.94 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN Harness: Deconvolution Batch deconv_all - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Deconvolution Batch deconv_all - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 2K 4K 6K 8K 10K SE +/- 15.77, N = 3 SE +/- 6.33, N = 3 SE +/- 5.96, N = 3 SE +/- 3.81, N = 3 10446 10424 10405 10397 -march=skylake - MIN: 10397.8 MIN: 10383.1 MIN: 10370.6 MIN: 10360.8 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
MKL-DNN Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 160 320 480 640 800 SE +/- 0.92, N = 3 SE +/- 0.50, N = 3 SE +/- 0.64, N = 3 SE +/- 1.26, N = 3 752 751 751 749 -march=skylake - MIN: 744 MIN: 744.39 MIN: 745 MIN: 741.69 1. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl
NAS Parallel Benchmarks Test / Class: BT.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: BT.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 1000 2000 3000 4000 5000 SE +/- 10.11, N = 3 SE +/- 4.76, N = 3 SE +/- 7.81, N = 3 SE +/- 9.00, N = 3 4391 4391 4773 4784 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: EP.C -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 500 1000 1500 2000 2500 SE +/- 71.82, N = 12 SE +/- 9.18, N = 3 SE +/- 6.91, N = 3 SE +/- 14.49, N = 3 2038 2090 2109 2113 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: FT.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 2K 4K 6K 8K 10K SE +/- 32.16, N = 3 SE +/- 16.12, N = 3 SE +/- 18.45, N = 3 SE +/- 15.81, N = 3 8543 8544 8581 8581 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: FT.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: FT.B -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 2K 4K 6K 8K 10K SE +/- 39.71, N = 3 SE +/- 63.29, N = 3 SE +/- 11.85, N = 3 SE +/- 26.14, N = 3 9331 9337 9376 9387 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: LU.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.A -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 9K 18K 27K 36K 45K SE +/- 98.50, N = 3 SE +/- 40.23, N = 3 SE +/- 73.39, N = 3 SE +/- 36.60, N = 3 37305 37394 39343 39791 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: LU.C -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 7K 14K 21K 28K 35K SE +/- 62.55, N = 3 SE +/- 86.87, N = 3 SE +/- 79.15, N = 3 SE +/- 34.14, N = 3 33669 33676 34962 35007 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: SP.A OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.3.1 Test / Class: SP.A -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 900 1800 2700 3600 4500 SE +/- 41.78, N = 3 SE +/- 7.44, N = 3 SE +/- 17.99, N = 3 SE +/- 38.60, N = 3 4062 4073 4080 4185 -march=skylake 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 11K 22K 33K 44K 55K SE +/- 88.86, N = 3 SE +/- 834.73, N = 15 SE +/- 643.64, N = 15 SE +/- 314.76, N = 3 44072 46938 47570 49221 -march=skylake 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 68.54 68.35 68.26 68.18 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 0.5423 1.0846 1.6269 2.1692 2.7115 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.41 2.38 2.38 2.38 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 7.98 7.80 7.77 7.74 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 30 60 90 120 150 SE +/- 0.73, N = 3 SE +/- 0.33, N = 3 SE +/- 0.63, N = 3 SE +/- 0.79, N = 3 135 134 132 132 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
PolyBench-C Test: 3 Matrix Multiplications OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 4.2 Test: 3 Matrix Multiplications -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 0.594 1.188 1.782 2.376 2.97 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.64 2.63 2.63 2.61 -march=skylake 1. (CC) gcc options: -O3 -march=native
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 110K 220K 330K 440K 550K SE +/- 1960.25, N = 3 SE +/- 772.32, N = 3 SE +/- 1607.28, N = 3 SE +/- 2392.99, N = 3 498539 519930 520100 521901 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 3K 6K 9K 12K 15K SE +/- 128.04, N = 3 SE +/- 82.67, N = 3 SE +/- 44.82, N = 3 SE +/- 45.37, N = 3 13311 13424 13495 13576 -march=skylake 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 12.96 12.80 12.79 12.78 -march=skylake 1. (CXX) g++ options: -O3 -lpthread
Redis Test: LPOP OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 700K 1400K 2100K 2800K 3500K SE +/- 59550.07, N = 15 SE +/- 71217.78, N = 12 SE +/- 61579.63, N = 15 SE +/- 55290.27, N = 15 3059844 3095145 3139485 3140283 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: SADD OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 600K 1200K 1800K 2400K 3000K SE +/- 33796.90, N = 15 SE +/- 50359.83, N = 15 SE +/- 56601.20, N = 15 SE +/- 20507.83, N = 3 2339849 2383273 2449355 2624995 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: LPUSH OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 400K 800K 1200K 1600K 2000K SE +/- 36055.36, N = 15 SE +/- 32556.93, N = 15 SE +/- 36075.54, N = 15 SE +/- 34258.37, N = 15 1962269 2013442 2026656 2059237 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 600K 1200K 1800K 2400K 3000K SE +/- 56848.49, N = 15 SE +/- 45086.74, N = 15 SE +/- 54716.38, N = 15 SE +/- 58238.45, N = 15 2944968 2958563 2992962 3002750 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 500K 1000K 1500K 2000K 2500K SE +/- 25293.75, N = 3 SE +/- 44027.91, N = 15 SE +/- 42954.48, N = 15 SE +/- 38284.96, N = 15 1971745 2135179 2163293 2189741 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.99 16.98 16.96 16.95 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 11.71 11.52 11.47 11.31 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 4 8 12 16 20 SE +/- 0.20, N = 4 SE +/- 0.13, N = 15 SE +/- 0.11, N = 15 SE +/- 0.13, N = 15 14.12 13.88 13.72 13.63 1. (CXX) g++ options: -O2 -lOpenCL
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake 10M 20M 30M 40M 50M SE +/- 146682.07, N = 3 SE +/- 311424.12, N = 3 SE +/- 342455.58, N = 3 SE +/- 407607.78, N = 3 46810779 46997822 47046623 47112136 -march=skylake 1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
SVT-AV1 1080p 8-bit YUV To AV1 Video Encode OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-03-07 1080p 8-bit YUV To AV1 Video Encode -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake 6 12 18 24 30 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 24.34 24.37 24.43 24.44 -march=skylake 1. (CXX) g++ options: -O3 -pie -lpthread -lm
SVT-HEVC 1080p 8-bit YUV To HEVC Video Encode OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 70 140 210 280 350 SE +/- 3.61, N = 3 SE +/- 3.22, N = 3 SE +/- 3.64, N = 3 SE +/- 2.96, N = 3 304 304 337 339 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
SVT-VP9 1080p 8-bit YUV To VP9 Video Encode OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 80 160 240 320 400 SE +/- 4.48, N = 5 SE +/- 4.85, N = 4 SE +/- 4.39, N = 4 SE +/- 5.15, N = 4 337 342 346 347 -march=skylake 1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
t-test1 Threads: 2 OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 1.3343 2.6686 4.0029 5.3372 6.6715 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 5.93 5.82 5.82 5.81 -march=skylake 1. (CC) gcc options: -pthread -O3
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 0.873 1.746 2.619 3.492 4.365 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 3.88 3.50 3.49 3.40 -march=skylake 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.18 Time To Compile -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 9 18 27 36 45 SE +/- 0.53, N = 4 SE +/- 0.53, N = 4 SE +/- 0.38, N = 8 SE +/- 0.38, N = 8 37.36 36.93 36.34 36.33
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 15 30 45 60 75 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 65.67 65.66 63.20 63.06 -march=skylake 1. (CC) gcc options: -march=native -O3 -msse -mfpmath=sse -lm -pthread -lmpi
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 52.19 51.90 51.84 51.80 -march=skylake 1. (CC) gcc options: -O3 -pedantic -ldl -lz -lm
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 300K 600K 900K 1200K 1500K SE +/- 1357.03, N = 5 SE +/- 1083.24, N = 5 SE +/- 1890.20, N = 5 SE +/- 1307.98, N = 5 1368057 1373514 1380406 1388054 -march=skylake 1. (CC) gcc options: -O3 -march=native
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 160 320 480 640 800 SE +/- 0.52, N = 3 SE +/- 1.31, N = 3 SE +/- 7.91, N = 7 SE +/- 1.58, N = 3 669 710 729 738 -march=skylake 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lpthread -fopenmp -fwhole-program -lstdc++
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 30 60 90 120 150 SE +/- 1.19, N = 3 SE +/- 1.04, N = 3 SE +/- 1.54, N = 3 SE +/- 1.37, N = 3 139 140 140 140 -march=skylake 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 H.265 1080p Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 12 24 36 48 60 SE +/- 0.05, N = 3 SE +/- 0.21, N = 3 SE +/- 0.25, N = 3 SE +/- 0.07, N = 3 50.05 53.34 53.71 53.97 -march=skylake 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Xsbench OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake-avx512 -mprefer-vector-width=128 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 24818.37, N = 3 SE +/- 5774.93, N = 3 SE +/- 5240.08, N = 3 SE +/- 3451.22, N = 3 5140078 5164164 5166043 5171354 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
XZ Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=128 -O3 -march=skylake-avx512 -mprefer-vector-width=256 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 15.99 15.88 15.84 15.80 -march=skylake 1. (CC) gcc options: -pthread -fvisibility=hidden -O3
Zstd Compression Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 9.76 9.71 9.69 9.59 -march=skylake 1. (CC) gcc options: -O3 -pthread -lz
Geometric Mean Of All Test Results Result Composite - Core i9 7980XE GCC 9 Compiler Benchmarks OpenBenchmarking.org Geometric Mean, More Is Better Geometric Mean Of All Test Results Result Composite - Core i9 7980XE GCC 9 Compiler Benchmarks -O3 -march=skylake-avx512 -mprefer-vector-width=512 -O3 -march=skylake -O3 -march=skylake-avx512 -mprefer-vector-width=256 -O3 -march=skylake-avx512 -mprefer-vector-width=128 90 180 270 360 450 405.16 414.04 415.02 415.90
Phoronix Test Suite v10.8.4