AMD EPYC Compiler Tuning

GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1902194-SP-AMDEPYCCO19&grt&rdt.

AMD EPYC Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads)Dell 02MJ3T (1.2.5 BIOS)AMD Family 17h16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860Matrox G200eW3VE2282 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 18.045.0.0-050000rc6-generic (x86_64) 20190210GNOME Shell 3.28.3X ServerGCC 9.0.1 20190210ext41600x1200OpenBenchmarking.orgEnvironment Details- -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1- -O0: CXXFLAGS=-O0 CFLAGS=-O0- -O1: CXXFLAGS=-O1 CFLAGS=-O1- -O2: CXXFLAGS=-O2 CFLAGS=-O2- -O3: CXXFLAGS=-O3 CFLAGS=-O3- -O2 -march=znver1: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1- -Ofast -march=znver1: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1- -O2 -ftree-vectorize -ftree-slp-vectorize: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize- -O3 -march=znver1 -flto: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-flto- -O2 -flto: CXXFLAGS=-O2-flto CFLAGS=-O2-flto- -Og: CXXFLAGS=-Og CFLAGS=-OgCompiler Details- --disable-multilib --enable-checking=releaseSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

AMD EPYC Compiler Tuningaobench: 2048 x 2048 - Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshc-ray: Total Time - 4K, 16 Rays Per Pixelctx-clock: Context Switch Timefftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 4096encode-flac: WAV To FLACgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacehint: FLOAThint: DOUBLEhimeno: Poisson Pressure Solverjohn-the-ripper: Blowfishjohn-the-ripper: Traditional DESencode-mp3: WAV To MP3tjbench: Decompression Throughputpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Onlypgbench: Buffer Test - Single Thread - Read Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationstockfish: Total Timesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodebuild-apache: Time To Compilehmmer: Pfam Database Searchbuild-imagemagick: Time To Compilebuild-php: Time To Compiletscp: AI Chess Performancevpxenc: vpxenc VP9 1080p Video Encodevpxenc: vpxenc VP9 1080p Video Encodex264: H.264 Video Encodingx265: H.265 1080p Video Encodingcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og51.493.095.075.805.183.061.111.3211.3550061275213.8519519018319112718021026850647258928992610116682326001966710.57144505031506815188114519615572272482485116891064979941.685.8925.946.2924.8878.1386573212.4114435.5714.3792.503.115.146.005.363.141.111.3544.921708219396.779698829074921022674044455985453423831517921823200041.79111419700458590638864341082015165128321058681751.695.8811.439.025.2315.1986545912.5010235.0023.1256.613.125.166.005.373.151.111.3528.7446321346815.011941911801871261792102684555785850600297856599525706720014.3213951510243011330310651519576226241134669191056980921.705.8717.516.9318.4229.0586410212.5314535.6214.1155.543.125.146.015.373.141.111.3525.8446251339113.6519519118118913118021126731197059948160510176271825740766714.0714051534041671493110371369560230252726099191044804221.695.8123.826.6223.6352.1786437312.5414434.5514.4853.533.115.166.055.383.151.111.3512.6047511355513.6118918317418111817220326731564759542804710086580625386858310.84141490551426215099107918005602322475430714271041218401.735.9026.086.5725.0678.1986491512.3114735.2113.6654.353.105.085.805.193.051.111.3221.5850741334613.8919619118319112718021126726802361751662610016130925595700014.00142510425427215111106015015572292584323110161060842761.705.8495.9123.826.5423.7851.9686491512.4220.0514434.8014.7151.733.095.095.805.183.061.111.3210.4048851316613.951961891821931241872092670554076053318331022628412587706679.80144508384410215352112518255612212579408916761065072441.705.9197.8026.116.0025.2186437312.3720.1314434.9113.7755.533.115.145.985.373.151.111.3525.7713248051328513.7019619018018812817821226717214560253529710076358625705800010.9613952969942391535311251724560231251543969191041978651.675.8994.8224.036.8223.9152.5886491612.5620.3414435.4113.6752.0811.3113255711311014.2119418818318612517820926723940561864410110005876425477733310.381444542564319160121074174714802302052330016751.715.8497.2628.626.16118.4886301812.7520.8613.1655.523.055.216.325.403.241.091.3325.9613250911321413.6419619118319012818021426817340062664040010226511726073666714.1414052057040951485111271307568232229925159181045366051.695.9095.7926.506.5698.6786410135.0714.0877.413.115.156.025.373.141.111.3528.6413243661264215.581811811561731201681952673686715972342667725645323928933316.7814150720337671333310801205210257218825399191057096901.735.8692.6814.597.397.8921.4286518712.5220.3914234.7614.39OpenBenchmarking.org

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og20406080100SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 351.4992.5056.6155.5453.5354.3551.7355.5352.0855.5277.41-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og0.7021.4042.1062.8083.51SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.093.113.123.123.113.103.093.113.053.11-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1.17232.34463.51694.68925.8615SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.075.145.165.145.165.085.095.145.215.15-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.806.006.006.016.055.805.805.986.326.02-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1.2152.433.6454.866.075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.185.365.375.375.385.195.185.375.405.37-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og0.7291.4582.1872.9163.645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.063.143.153.143.153.053.063.153.243.14-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og0.24980.49960.74940.99921.249SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.111.111.111.111.111.111.111.111.091.11-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og0.30380.60760.91141.21521.519SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.321.351.351.351.351.321.321.351.331.35-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1020304050SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 311.3544.9228.7425.8412.6021.5810.4025.7711.3125.9628.64-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm -lpthread -O3

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og306090120150132132132132-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options:

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og12002400360048006000SE +/- 47.88, N = 3SE +/- 5.17, N = 3SE +/- 6.37, N = 3SE +/- 2.28, N = 3SE +/- 41.07, N = 3SE +/- 13.33, N = 3SE +/- 10.24, N = 3SE +/- 26.82, N = 3SE +/- 26.34, N = 3SE +/- 5.87, N = 3SE +/- 11.16, N = 350061708463246254751507448854805557150914366-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -pthread -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og3K6K9K12K15KSE +/- 78.62, N = 3SE +/- 1.00, N = 3SE +/- 70.29, N = 3SE +/- 160.95, N = 3SE +/- 115.66, N = 3SE +/- 15.71, N = 3SE +/- 160.71, N = 8SE +/- 72.78, N = 3SE +/- 49.65, N = 3SE +/- 134.39, N = 3SE +/- 165.58, N = 3127522193134681339113555133461316613285131101321412642-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -pthread -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og20406080100SE +/- 0.10, N = 5SE +/- 0.12, N = 5SE +/- 0.09, N = 5SE +/- 0.08, N = 5SE +/- 0.12, N = 5SE +/- 0.09, N = 5SE +/- 0.08, N = 5SE +/- 0.14, N = 5SE +/- 0.10, N = 5SE +/- 0.11, N = 5SE +/- 0.11, N = 513.8596.7715.0113.6513.6113.8913.9513.7014.2113.6415.58-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 319596194195189196196196194196181-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og4080120160200SE +/- 0.33, N = 319098191191183191189190188191181-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 318382180181174183182180183183156-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 3.18, N = 3SE +/- 0.33, N = 319190187189181191193188186190173-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og306090120150SE +/- 1.94, N = 5SE +/- 1.43, N = 12SE +/- 2.52, N = 3SE +/- 1.20, N = 3SE +/- 1.32, N = 10SE +/- 1.53, N = 3SE +/- 1.50, N = 8SE +/- 1.50, N = 12SE +/- 1.40, N = 812774126131118127124128125128120-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og4080120160200SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 318092179180172180187178178180168-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og50100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3210102210211203211209212209214195-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og60M120M180M240M300MSE +/- 1208988.13, N = 3SE +/- 321625.93, N = 3SE +/- 1109543.66, N = 3SE +/- 232284.24, N = 3SE +/- 219028.41, N = 3SE +/- 211192.91, N = 3SE +/- 193963.83, N = 3SE +/- 54052.05, N = 3SE +/- 67545.97, N = 3SE +/- 1057235.60, N = 3SE +/- 144731.32, N = 3268506472267404445268455578267311970267315647267268023267055407267172145267239405268173400267368671-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -O3 -march=native -lm

Hierarchical INTegration

Test: DOUBLE

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLE-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og130M260M390M520M650MSE +/- 7042504.19, N = 3SE +/- 8177784.85, N = 3SE +/- 1617493.63, N = 3SE +/- 6546514.72, N = 3SE +/- 1832535.45, N = 3SE +/- 9814749.06, N = 4SE +/- 7419115.18, N = 3SE +/- 2338894.48, N = 3SE +/- 7234705.24, N = 9SE +/- 10585229.89, N = 3SE +/- 9099115.17, N = 9589289926598545342585060029599481605595428047617516626605331833602535297618644101626640400597234266-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -O3 -march=native -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og2004006008001000SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 5.78, N = 3SE +/- 5.25, N = 3SE +/- 6.27, N = 3SE +/- 2.90, N = 3SE +/- 8.54, N = 3SE +/- 7.21, N = 3SE +/- 2.81, N = 3SE +/- 2.58, N = 3SE +/- 4.02, N = 310113837851017100810011022100710001022772-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -O3 -mavx2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og14K28K42K56K70KSE +/- 1082.96, N = 12SE +/- 215.16, N = 12SE +/- 1098.57, N = 4SE +/- 1387.83, N = 12SE +/- 1049.43, N = 3SE +/- 1967.27, N = 12SE +/- 1454.31, N = 11SE +/- 1953.49, N = 12SE +/- 1598.30, N = 12SE +/- 1395.50, N = 12SE +/- 1339.16, N = 966823151796599562718658066130962841635865876465117564531. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Traditional DES-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og60M120M180M240M300MSE +/- 2346357.84, N = 3SE +/- 2756947.41, N = 3SE +/- 2774527.11, N = 10SE +/- 2178423.16, N = 3SE +/- 3859011.69, N = 12SE +/- 2041895.52, N = 3SE +/- 1656338.97, N = 3SE +/- 2839112.24, N = 3SE +/- 1374420.40, N = 3SE +/- 642920.77, N = 3SE +/- 2445677.03, N = 32600196672182320002570672002574076672538685832559570002587706672570580002547773332607366672392893331. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.5741.7914.3214.0710.8414.009.8010.9610.3814.1416.78-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression Throughput-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og306090120150SE +/- 0.06, N = 3SE +/- 0.76, N = 3SE +/- 0.03, N = 3SE +/- 0.67, N = 3SE +/- 0.03, N = 3SE +/- 0.91, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3144111139140141142144139144140141-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og110K220K330K440K550KSE +/- 3819.62, N = 3SE +/- 4794.85, N = 3SE +/- 6061.06, N = 3SE +/- 2768.55, N = 3SE +/- 8068.95, N = 9SE +/- 5952.34, N = 3SE +/- 1629.04, N = 3SE +/- 3875.50, N = 3SE +/- 7546.41, N = 4SE +/- 4765.38, N = 3SE +/- 3395.99, N = 3505031419700515102515340490551510425508384529699454256520570507203-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og11002200330044005500SE +/- 50.49, N = 3SE +/- 29.67, N = 3SE +/- 71.59, N = 9SE +/- 49.50, N = 9SE +/- 29.58, N = 3SE +/- 12.79, N = 3SE +/- 66.04, N = 9SE +/- 55.61, N = 6SE +/- 66.20, N = 5SE +/- 26.99, N = 3SE +/- 47.02, N = 350684585430141674262427241024239431940953767-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Only-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og3K6K9K12K15KSE +/- 101.16, N = 3SE +/- 69.77, N = 3SE +/- 119.98, N = 3SE +/- 48.94, N = 3SE +/- 32.93, N = 3SE +/- 122.77, N = 3SE +/- 125.79, N = 3SE +/- 295.94, N = 3SE +/- 224.15, N = 6SE +/- 172.87, N = 3SE +/- 149.12, N = 3151889063133031493115099151111535215353160121485113333-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og2004006008001000SE +/- 19.20, N = 3SE +/- 6.16, N = 3SE +/- 8.54, N = 3SE +/- 17.89, N = 3SE +/- 12.77, N = 8SE +/- 12.77, N = 9SE +/- 19.14, N = 3SE +/- 1.22, N = 3SE +/- 9.80, N = 3SE +/- 3.41, N = 3SE +/- 4.15, N = 31145886106510371079106011251125107411271080-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og400800120016002000SE +/- 11.89, N = 3SE +/- 5.30, N = 3SE +/- 6.18, N = 3SE +/- 12.17, N = 3SE +/- 7.96, N = 3SE +/- 23.45, N = 5SE +/- 20.59, N = 3SE +/- 8.94, N = 3SE +/- 35.09, N = 3SE +/- 24.31, N = 3SE +/- 18.65, N = 51961434151913691800150118251724174713071205-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og30060090012001500SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3SE +/- 0.28, N = 35571085765605605575615601480568210-O3 -march=znver1-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og60120180240300SE +/- 0.03, N = 3SE +/- 1.00, N = 3SE +/- 0.44, N = 3SE +/- 0.61, N = 3SE +/- 0.51, N = 3SE +/- 1.88, N = 3SE +/- 1.04, N = 3SE +/- 0.71, N = 3SE +/- 1.03, N = 3SE +/- 2.57, N = 3SE +/- 2.03, N = 3227201226230232229221231230232257-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og6001200180024003000SE +/- 10.26, N = 3SE +/- 5.25, N = 3SE +/- 59.22, N = 3SE +/- 11.61, N = 3SE +/- 3.37, N = 3SE +/- 14.53, N = 3SE +/- 12.59, N = 3SE +/- 12.19, N = 3SE +/- 2.13, N = 3SE +/- 5.41, N = 3SE +/- 3.66, N = 32482516241125272475258425792515205222992188-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og10002000300040005000SE +/- 65.35, N = 3SE +/- 30.37, N = 3SE +/- 32.13, N = 3SE +/- 54.41, N = 3SE +/- 42.05, N = 3SE +/- 173.68, N = 3SE +/- 107.53, N = 3SE +/- 57.07, N = 3SE +/- 178.79, N = 3SE +/- 129.41, N = 3SE +/- 139.26, N = 34851512346626094307323140894396330025152539-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og400800120016002000SE +/- 0.21, N = 3SE +/- 0.35, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 316898329199191427101616769191675918919-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og20M40M60M80M100MSE +/- 402849.09, N = 3SE +/- 1595135.10, N = 3SE +/- 549524.55, N = 3SE +/- 1016511.53, N = 3SE +/- 673773.08, N = 3SE +/- 324013.38, N = 3SE +/- 460638.54, N = 3SE +/- 468403.16, N = 3SE +/- 579693.09, N = 3SE +/- 823190.91, N = 3106497994105868175105698092104480422104121840106084276106507244104197865104536605105709690-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-Og1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-031080p 8-bit YUV To AV1 Video Encode-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og0.38930.77861.16791.55721.9465SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 9SE +/- 0.02, N = 8SE +/- 0.02, N = 3SE +/- 0.02, N = 6SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 31.681.691.701.691.731.701.701.671.711.691.731. (CC) gcc options: -mavx2 -fPIE -fPIC -O2 -pie -lpthread -lm

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-151080p 8-bit YUV To AV1 Video Encode-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1.32982.65963.98945.31926.649SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 35.895.885.875.815.905.845.915.895.845.905.861. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og20406080100SE +/- 0.18, N = 3SE +/- 0.30, N = 3SE +/- 1.09, N = 3SE +/- 0.50, N = 3SE +/- 0.75, N = 3SE +/- 1.17, N = 395.9197.8094.8297.2695.7992.68-march=native-Ofast -march=znver1-ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1-Og1. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og714212835SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 325.9411.4317.5123.8226.0823.8226.1124.0328.6226.5014.59

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og3691215SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 36.299.026.936.626.576.546.006.826.166.567.39-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -pthread -lhmmer -lsquid -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og306090120150SE +/- 0.30, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 8SE +/- 0.10, N = 3SE +/- 0.44, N = 3SE +/- 0.45, N = 3SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.45, N = 3SE +/- 0.98, N = 3SE +/- 0.04, N = 324.885.2318.4223.6325.0623.7825.2123.91118.4898.677.89

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-Og20406080100SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.22, N = 3SE +/- 0.25, N = 3SE +/- 0.26, N = 3SE +/- 0.10, N = 378.1315.1929.0552.1778.1951.9652.5821.42-O3 -march=znver1-O1-O2-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-Og1. (CC) gcc options: -pedantic -ldl -lz -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og200K400K600K800K1000KSE +/- 667.00, N = 5SE +/- 333.13, N = 5SE +/- 542.88, N = 5SE +/- 507.80, N = 5SE +/- 272.00, N = 5SE +/- 272.00, N = 5SE +/- 507.80, N = 5SE +/- 508.06, N = 5SE +/- 270.20, N = 5SE +/- 331.91, N = 5SE +/- 333.13, N = 5865732865459864102864373864915864915864373864916863018864101865187-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -O3 -march=native

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-Og3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 5SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.4112.5012.5312.5412.3112.4212.3712.5612.7512.52-march=znver1 -std=c++11-O0 -std=c++11-O1 -std=c++11-O2 -std=c++11-std=c++11-O2 -march=znver1 -std=c++11-Ofast -march=znver1 -std=c++11-O2 -ftree-vectorize -ftree-slp-vectorize -std=c++11-march=znver1 -flto-Og -std=c++111. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-Og510152025SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 320.0520.1320.3420.8620.39-O2 -march=native -std=c++11-Ofast -march=znver1 -std=c++11-O2 -ftree-vectorize -ftree-slp-vectorize -std=c++11-march=znver1 -flto-Og -std=c++111. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-Og306090120150SE +/- 1.78, N = 3SE +/- 0.12, N = 3SE +/- 1.49, N = 3SE +/- 0.97, N = 3SE +/- 0.81, N = 3SE +/- 1.40, N = 3SE +/- 0.52, N = 3SE +/- 0.47, N = 3SE +/- 1.09, N = 3144102145144147144144144142-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-Og1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og816243240SE +/- 0.18, N = 3SE +/- 0.67, N = 3SE +/- 0.37, N = 11SE +/- 0.24, N = 3SE +/- 0.41, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.46, N = 3SE +/- 0.38, N = 3SE +/- 0.58, N = 335.5735.0035.6234.5535.2134.8034.9135.4135.0734.76-march=znver1-O0-O1-O2-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-Og1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og612182430SE +/- 0.44, N = 12SE +/- 0.38, N = 4SE +/- 0.25, N = 12SE +/- 0.49, N = 9SE +/- 0.38, N = 12SE +/- 0.35, N = 12SE +/- 0.21, N = 12SE +/- 0.33, N = 12SE +/- 0.24, N = 11SE +/- 0.31, N = 12SE +/- 0.29, N = 1214.3723.1214.1114.4813.6614.7113.7713.6713.1614.0814.39-O3 -march=znver1-O0-O1-O2-O3-O2 -march=znver1-Ofast -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O3 -march=znver1 -flto-O2 -flto-Og1. (CC) gcc options: -pthread -lz -llzma


Phoronix Test Suite v10.8.4