GCC 9 Compiler Tuning

Intel Core i9-7980XE compiler benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1901086-SP-GCC9COMPI66&rdt&grr.

GCC 9 Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (1602 BIOS)Intel Sky Lake-E DMI3 Registers16384MB15GB Ultra USB 3.0 + Samsung SSD 970 EVO 500GBNVIDIA NV120 12GBRealtek ALC1220ASUS PB278Intel I219-VClear Linux OS 270304.19.13-680.native (x86_64)GNOME Shell 3.30.2X Server 1.20.3nouveau 1.0.154.3 Mesa 19.0.0-develGCC 9.0.0 20181228 + Clang 7.0.1 + LLVM 7.0.1ext42560x1440OpenBenchmarking.orgEnvironment Details- -O2: CXXFLAGS_OVERRIDE=-O2 CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O2 CXXFLAGS=-O2 CFLAGS_OVERRIDE=-O2 FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O2 -ftree-vectorize -ftree-slp-vectorize: CXXFLAGS_OVERRIDE=-O2-ftree-vectorize-ftree-slp-vectorize CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS_OVERRIDE=-O2-ftree-vectorize-ftree-slp-vectorize FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O3: CXXFLAGS_OVERRIDE=-O3 CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3 CXXFLAGS=-O3 CFLAGS_OVERRIDE=-O3 FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O3 -march=native: CXXFLAGS_OVERRIDE=-O3-march=native CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O3-march=native CXXFLAGS=-O3-march=native CFLAGS_OVERRIDE=-O3-march=native FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O0: CXXFLAGS_OVERRIDE=-O0 CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O0 CXXFLAGS=-O0 CFLAGS_OVERRIDE=-O0 FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -Og: CXXFLAGS_OVERRIDE=-Og CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-Og CXXFLAGS=-Og CFLAGS_OVERRIDE=-Og FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" - -O1: CXXFLAGS_OVERRIDE=-O1 CFFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl now-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl-sort-common-Wl--enable-new-dtags MESA_GLSL_CACHE_DISABLE=0 CFLAGS=-O1 CXXFLAGS=-O1 CFLAGS_OVERRIDE=-O1 FFLAGS=-g-O3-feliminate-unused-debug-types-pipe-Wall-Wp-D_FORTIFY_SOURCE=2-fexceptions-fstack-protector--param=ssp-buffer-size=32-Wl--copy-dt-needed-entries-m64-fasynchronous-unwind-tables-Wp-D_REENTRANT-ftree-loop-distribute-patterns-Wl-z-Wl relro-malign-data=abi-fno-semantic-interposition-ftree-vectorize-ftree-loop-vectorize-Wl--enable-new-dtags THEANO_FLAGS=floatX=float32 openmp=true gcc.cxxflags="-ftree-vectorize-mavx" Compiler Details- --disable-multi-lib --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate performanceSecurity Details- KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable

GCC 9 Compiler Tuningdav1d: Summer Nature 4Knginx: Static Web Page Servingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9c-ray: Total Time - 4K, 16 Rays Per Pixelpgbench: Buffer Test - Normal Load - Read Onlydav1d: Summer Nature 1080pebizzy: himeno: Poisson Pressure Solverm-queens: Time To Solveaobench: 2048 x 2048 - Total Timebuild-php: Time To Compileredis: SETencode-flac: WAV To FLACcpuminer-opt: skeincpuminer-opt: lbryscimark2: Compositeredis: GETsmallpt: Global Illumination Renderer; 128 Samplesbuild-imagemagick: Time To Compileencode-mp3: WAV To MP3compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19x265: H.265 Video Encodinghmmer: Pfam Database Searchx264: H.264 Video Encodingbullet: 3000 Fallbullet: 1000 Convexbullet: 1000 Stackscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O182.554579075.0178.6949525918.63595906304449.5453.2640.49232321110.3859763501802019306092611.8223.6311.7310.1959.074.211403.954.094.6812103938340461293381.514796775.5579.0049943820.00641913300049.5551.4841.22230543610.4762853531332357314804411.8924.3910.2010.3159.064.181393.994.224.7212345632342057592480.784752372.6644.1550300519.75612542298849.4950.0352.07234970410.4662230537932453322967011.8726.3010.0510.3159.124.091423.984.164.6716075742340057294480.954767073.0633.6150311119.67648338321048.4830.8152.5522913569.246198053657271032585406.0726.229.2410.2959.914.121393.573.893.9321566373346758497278.2045556108.34127.5831408922.57598738452104.3965.3213.65239046068.11696342473870.695.8231.8718.3459.267.3796.583.984.224.75120898079934115279.843786578.0888.5042954719.40628510151557.8954.4017.68232216212.275199045080180932236129.268.6913.6410.5459.315.05135.543.994.094.7012303659326159530178.123717276.4487.4245218519.17663780147750.5756.5622.99220222711.2962157528802094335667912.5117.7311.7610.0859.124.69140.423.974.144.72125044153342601865OpenBenchmarking.org

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 4K-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O120406080100SE +/- 1.98, N = 12SE +/- 1.44, N = 3SE +/- 1.53, N = 3SE +/- 1.54, N = 3SE +/- 1.23, N = 4SE +/- 1.32, N = 3SE +/- 1.56, N = 382.5581.5180.7880.9578.2079.8478.12-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O110K20K30K40K50KSE +/- 624.66, N = 12SE +/- 98.46, N = 3SE +/- 264.29, N = 3SE +/- 316.48, N = 3SE +/- 239.67, N = 3SE +/- 199.70, N = 3SE +/- 406.63, N = 1245790479674752347670455563786537172-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O0-Og-O11. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O120406080100SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3SE +/- 0.43, N = 3SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.20, N = 375.0175.5572.6673.06108.3478.0876.44-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread -fvisibility=hidden

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 378.6979.0044.1533.61127.5888.5087.42-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CC) gcc options: -lm -lpthread -O3

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1110K220K330K440K550KSE +/- 164.82, N = 3SE +/- 856.13, N = 3SE +/- 662.70, N = 3SE +/- 389.64, N = 3SE +/- 238.22, N = 3SE +/- 491.34, N = 3SE +/- 111.16, N = 3495259499438503005503111314089429547452185-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 1080p-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1510152025SE +/- 0.41, N = 12SE +/- 0.26, N = 11SE +/- 0.25, N = 11SE +/- 0.31, N = 10SE +/- 0.28, N = 12SE +/- 0.27, N = 11SE +/- 0.29, N = 1118.6320.0019.7519.6722.5719.4019.17-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1140K280K420K560K700KSE +/- 9064.12, N = 3SE +/- 7472.49, N = 12SE +/- 7719.24, N = 12SE +/- 10257.52, N = 12SE +/- 8592.54, N = 12SE +/- 10433.66, N = 12SE +/- 3655.74, N = 3595906641913612542648338598738628510663780-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O17001400210028003500SE +/- 5.49, N = 3SE +/- 13.57, N = 3SE +/- 15.72, N = 3SE +/- 9.28, N = 3SE +/- 0.84, N = 3SE +/- 5.91, N = 3SE +/- 2.80, N = 3304430002988321045215151477-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CC) gcc options: -O3 -mavx2

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O120406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 349.5449.5549.4948.48104.3957.8950.57-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CXX) g++ options: -fopenmp

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11530456075SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 353.2651.4850.0330.8165.3254.4056.56-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CC) gcc options: -lm -O3

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11224364860SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 340.4941.2252.0752.5513.6517.6822.99-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pedantic -ldl -lz -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1500K1000K1500K2000K2500KSE +/- 25950.06, N = 12SE +/- 22955.99, N = 12SE +/- 22836.26, N = 3SE +/- 40055.48, N = 12SE +/- 5033.74, N = 3SE +/- 32899.51, N = 12SE +/- 51461.98, N = 122323211230543623497042291356239046023221622202227-O2 -ftree-vectorize -ftree-slp-vectorize1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11530456075SE +/- 0.04, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.03, N = 5SE +/- 0.03, N = 510.3810.4710.469.2468.1112.2711.29-O0-Og1. (CXX) g++ options: -fvisibility=hidden -logg -lm

Cpuminer-Opt

Algorithm: skein

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skein-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-Og-O113K26K39K52K65KSE +/- 12.02, N = 3SE +/- 8.82, N = 3SE +/- 10.00, N = 3SE +/- 391.68, N = 3597636285362230619805199062157-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-Og-O11. (CXX) g++ options: -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: lbry

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbry-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-Og-O112K24K36K48K60KSE +/- 63.51, N = 3SE +/- 86.86, N = 3SE +/- 990.96, N = 3SE +/- 686.69, N = 3SE +/- 370.41, N = 3SE +/- 10.00, N = 3501805313353793536574508052880-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-Og-O11. (CXX) g++ options: -lcurl -lz -lpthread -lssl -lcrypto -lgmp

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O16001200180024003000SE +/- 4.53, N = 3SE +/- 32.10, N = 3SE +/- 11.70, N = 3SE +/- 12.91, N = 3SE +/- 1.10, N = 3SE +/- 3.97, N = 3SE +/- 19.25, N = 3201923572453271069618092094-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1700K1400K2100K2800K3500KSE +/- 67207.12, N = 12SE +/- 11876.71, N = 3SE +/- 25216.53, N = 3SE +/- 66381.24, N = 12SE +/- 11768.92, N = 3SE +/- 65213.19, N = 12SE +/- 40253.30, N = 33060926314804432296703258540342473832236123356679-O2 -ftree-vectorize -ftree-slp-vectorize1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11632486480SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.85, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.8211.8911.876.0770.699.2612.51-O01. (CXX) g++ options: -fopenmp -O3

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1612182430SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 323.6324.3926.3026.225.828.6917.73

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1714212835SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 311.7310.2010.059.2431.8713.6411.76-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lncurses -lm

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.1910.3110.3110.2918.3410.5410.08-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread -lz -llzma -llz4

x265

H.265 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 2.8H.265 Video Encoding-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11326395265SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 3SE +/- 0.20, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 359.0759.0659.1259.9159.2659.3159.12-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 64.214.184.094.127.375.054.69-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -pthread -lhmmer -lsquid -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1306090120150SE +/- 0.55, N = 3SE +/- 1.64, N = 3SE +/- 0.31, N = 3SE +/- 0.79, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.67, N = 3140.00139.00142.00139.0096.58135.54140.42-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O10.89781.79562.69343.59124.489SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.953.993.983.573.983.993.97-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O10.94951.8992.84853.7984.7475SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 34.094.224.163.894.224.094.14-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11.06882.13763.20644.27525.344SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 34.684.724.673.934.754.704.72-O2-O2 -ftree-vectorize -ftree-slp-vectorize-march=native-O0-Og-O11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O15001000150020002500SE +/- 0.70, N = 3SE +/- 13.01, N = 3SE +/- 24.96, N = 3SE +/- 34.54, N = 3SE +/- 18.94, N = 3SE +/- 19.55, N = 3SE +/- 19.48, N = 31210123416072156120812301250-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O114002800420056007000SE +/- 3.87, N = 3SE +/- 167.53, N = 3SE +/- 41.42, N = 3SE +/- 11.06, N = 3SE +/- 15.14, N = 3SE +/- 2.18, N = 3SE +/- 74.37, N = 3393856325742637398036594415-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O17001400210028003500SE +/- 21.48, N = 3SE +/- 10.98, N = 3SE +/- 7.90, N = 3SE +/- 28.11, N = 3SE +/- 0.17, N = 3SE +/- 16.41, N = 3SE +/- 28.10, N = 3340434203400346779932613342-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O1130260390520650SE +/- 7.13, N = 3SE +/- 17.15, N = 3SE +/- 13.69, N = 3SE +/- 16.32, N = 3SE +/- 2.23, N = 3SE +/- 16.14, N = 3SE +/- 15.63, N = 3612575572584341595601-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O12004006008001000SE +/- 0.51, N = 3SE +/- 0.72, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.45, N = 3SE +/- 0.15, N = 3933924944972152301865-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O3-O3 -march=native-O0-Og-O11. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4