AMD EPYC Compiler Tuning

GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1902194-SP-AMDEPYCCO19&grr&sro&rro.

AMD EPYC Compiler TuningProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution-O0-Og-O1-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -march=znver1-O2 -flto-O3-O3 -march=znver1-O3 -march=znver1 -flto-Ofast -march=znver12 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads)Dell 02MJ3T (1.2.5 BIOS)AMD Family 17h16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860Matrox G200eW3VE2282 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 18.045.0.0-050000rc6-generic (x86_64) 20190210GNOME Shell 3.28.3X ServerGCC 9.0.1 20190210ext41600x1200OpenBenchmarking.orgEnvironment Details- -O0: CXXFLAGS=-O0 CFLAGS=-O0- -Og: CXXFLAGS=-Og CFLAGS=-Og- -O1: CXXFLAGS=-O1 CFLAGS=-O1- -O2: CXXFLAGS=-O2 CFLAGS=-O2- -O2 -ftree-vectorize -ftree-slp-vectorize: CXXFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize CFLAGS=-O2-ftree-vectorize-ftree-slp-vectorize- -O2 -march=znver1: CXXFLAGS=-O2-march=znver1 CFLAGS=-O2-march=znver1- -O2 -flto: CXXFLAGS=-O2-flto CFLAGS=-O2-flto- -O3: CXXFLAGS=-O3 CFLAGS=-O3- -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1- -O3 -march=znver1 -flto: CXXFLAGS=-O3 march=znver1-flto CFLAGS=-O3-march=znver1-flto- -Ofast -march=znver1: CXXFLAGS=-Ofast-march=znver1 CFLAGS=-Ofast-march=znver1Compiler Details- --disable-multilib --enable-checking=releaseSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

AMD EPYC Compiler Tuningpgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Single Thread - Read Onlyhint: DOUBLEfftw: Float + SSE - 2D FFT Size 4096vpxenc: vpxenc VP9 1080p Video Encodehint: FLOATvpxenc: vpxenc VP9 1080p Video Encodefftw: Stock - 2D FFT Size 4096svt-av1: 1080p 8-bit YUV To AV1 Video Encodegraphics-magick: Resizingstockfish: Total Timejohn-the-ripper: Blowfishgraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Sharpengraphics-magick: Swirlgraphics-magick: HWB Color Spacegraphics-magick: Rotatejohn-the-ripper: Traditional DEShimeno: Poisson Pressure Solveraobench: 2048 x 2048 - Total Timecompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19build-php: Time To Compilebuild-imagemagick: Time To Compilesvt-av1: 1080p 8-bit YUV To AV1 Video Encodescimark2: Compositeencode-flac: WAV To FLACc-ray: Total Time - 4K, 16 Rays Per Pixelbuild-apache: Time To Compilex265: H.265 1080p Video Encodingencode-mp3: WAV To MP3svt-vp9: 1080p 8-bit YUV To VP9 Video Encodehmmer: Pfam Database Searchtjbench: Decompression Throughputbullet: Raytestsx264: H.264 Video Encodingtscp: AI Chess Performancectx-clock: Context Switch Timebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O0-Og-O1-O2-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -march=znver1-O2 -flto-O3-O3 -march=znver1-O3 -march=znver1 -flto-Ofast -march=znver145858864197009063598545342219312.5026740444517081.697410586817515179929082961029821823200038392.5023.1215.195.235.8843496.7744.9211.4335.0041.799.021113.111028654591.351.113.145.366.005.1483251251620110837671080507203133335972342661264212.5226736867120.3943661.731201057096905645316817315618119518123928933377277.4114.3921.427.895.86120515.5828.6414.5934.7616.7892.687.391413.111428651871321.351.113.145.376.025.159192539218825721043011065515102133035850600291346812.5326845557846321.701261056980926599517918718019421019125706720078556.6114.1129.0518.425.87151915.0128.7417.5135.6214.326.931393.121458641021.351.113.155.376.005.169193466241122657641671037515340149315994816051339112.5426731197046251.6913110448042262718180189181195211191257407667101755.5414.4852.1723.635.81136913.6525.8423.8234.5514.076.621403.121448643731.351.113.145.376.015.149192609252723056042391125529699153536025352971328512.5626717214520.3448051.6712810419786563586178188180196212190257058000100755.5313.6752.5823.915.89172413.7025.7724.0335.4110.9694.826.821393.111448649161321.351.113.155.375.985.149194396251523156042721060510425151116175166261334612.4226726802320.0550741.7012710608427661309180191183196211191255957000100154.3514.7151.9623.785.84150113.8921.5823.8234.8014.0095.916.541423.101448649151.321.113.055.195.805.0810163231258422955740951127520570148516266404001321426817340050911.6912810453660565117180190183196214191260736667102255.5214.0898.675.90130713.6425.9626.5035.0714.1495.796.561403.058641011321.331.093.245.406.325.219182515229923256842621079490551150995954280471355512.3126731564747511.7311810412184065806172181174189203183253868583100853.5313.6678.1925.065.90180013.6112.6026.0835.2110.846.571413.111478649151.351.113.155.386.055.1614274307247523256050681145505031151885892899261275212.4126850647250061.6812710649799466823180191183195210190260019667101151.4914.3778.1324.885.89196113.8511.3525.9435.5710.576.291443.091448657321.321.113.065.185.805.0716894851248222755743191074454256160126186441011311012.7526723940520.8655711.7112558764178186183194209188254777333100052.0813.16118.485.84174714.2111.3128.6210.3897.266.16144863018132167533002052230148041021125508384153526053318331316612.3726705540720.1348851.7012410650724462841187193182196209189258770667102251.7313.7725.215.91182513.9510.4026.1134.919.8097.806.001443.091448643731.321.113.065.185.805.09167640892579221561OpenBenchmarking.org

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O011002200330044005500SE +/- 47.02, N = 3SE +/- 66.04, N = 9SE +/- 66.20, N = 5SE +/- 50.49, N = 3SE +/- 29.58, N = 3SE +/- 12.79, N = 3SE +/- 55.61, N = 6SE +/- 26.99, N = 3SE +/- 49.50, N = 9SE +/- 71.59, N = 9SE +/- 29.67, N = 337674102431950684262427242394095416743014585-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Write-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O02004006008001000SE +/- 4.15, N = 3SE +/- 19.14, N = 3SE +/- 9.80, N = 3SE +/- 19.20, N = 3SE +/- 12.77, N = 8SE +/- 12.77, N = 9SE +/- 1.22, N = 3SE +/- 3.41, N = 3SE +/- 17.89, N = 3SE +/- 8.54, N = 3SE +/- 6.16, N = 31080112510741145107910601125112710371065886-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0110K220K330K440K550KSE +/- 3395.99, N = 3SE +/- 1629.04, N = 3SE +/- 7546.41, N = 4SE +/- 3819.62, N = 3SE +/- 8068.95, N = 9SE +/- 5952.34, N = 3SE +/- 3875.50, N = 3SE +/- 4765.38, N = 3SE +/- 2768.55, N = 3SE +/- 6061.06, N = 3SE +/- 4794.85, N = 3507203508384454256505031490551510425529699520570515340515102419700-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Single Thread - Mode: Read Only-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O03K6K9K12K15KSE +/- 149.12, N = 3SE +/- 125.79, N = 3SE +/- 224.15, N = 6SE +/- 101.16, N = 3SE +/- 32.93, N = 3SE +/- 122.77, N = 3SE +/- 295.94, N = 3SE +/- 172.87, N = 3SE +/- 48.94, N = 3SE +/- 119.98, N = 3SE +/- 69.77, N = 3133331535216012151881509915111153531485114931133039063-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Hierarchical INTegration

Test: DOUBLE

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLE-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0130M260M390M520M650MSE +/- 9099115.17, N = 9SE +/- 7419115.18, N = 3SE +/- 7234705.24, N = 9SE +/- 7042504.19, N = 3SE +/- 1832535.45, N = 3SE +/- 9814749.06, N = 4SE +/- 2338894.48, N = 3SE +/- 10585229.89, N = 3SE +/- 6546514.72, N = 3SE +/- 1617493.63, N = 3SE +/- 8177784.85, N = 3597234266605331833618644101589289926595428047617516626602535297626640400599481605585060029598545342-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O03K6K9K12K15KSE +/- 165.58, N = 3SE +/- 160.71, N = 8SE +/- 49.65, N = 3SE +/- 78.62, N = 3SE +/- 115.66, N = 3SE +/- 15.71, N = 3SE +/- 72.78, N = 3SE +/- 134.39, N = 3SE +/- 160.95, N = 3SE +/- 70.29, N = 3SE +/- 1.00, N = 3126421316613110127521355513346132851321413391134682193-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -pthread -lm

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O1-O03691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 5SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5212.3712.7512.4112.3112.4212.5612.5412.5312.50-Og -std=c++11-Ofast -march=znver1 -std=c++11-march=znver1 -flto-march=znver1 -std=c++11-std=c++11-O2 -march=znver1 -std=c++11-O2 -ftree-vectorize -ftree-slp-vectorize -std=c++11-O2 -std=c++11-O1 -std=c++11-O0 -std=c++111. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O060M120M180M240M300MSE +/- 144731.32, N = 3SE +/- 193963.83, N = 3SE +/- 67545.97, N = 3SE +/- 1208988.13, N = 3SE +/- 219028.41, N = 3SE +/- 211192.91, N = 3SE +/- 54052.05, N = 3SE +/- 1057235.60, N = 3SE +/- 232284.24, N = 3SE +/- 1109543.66, N = 3SE +/- 321625.93, N = 3267368671267055407267239405268506472267315647267268023267172145268173400267311970268455578267404445-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -O3 -march=native -lm

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize510152025SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 320.3920.1320.8620.0520.34-Og -std=c++11-Ofast -march=znver1 -std=c++11-march=znver1 -flto-O2 -march=native -std=c++11-O2 -ftree-vectorize -ftree-slp-vectorize -std=c++111. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O012002400360048006000SE +/- 11.16, N = 3SE +/- 10.24, N = 3SE +/- 26.34, N = 3SE +/- 47.88, N = 3SE +/- 41.07, N = 3SE +/- 13.33, N = 3SE +/- 26.82, N = 3SE +/- 5.87, N = 3SE +/- 2.28, N = 3SE +/- 6.37, N = 3SE +/- 5.17, N = 343664885557150064751507448055091462546321708-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -pthread -lm

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-031080p 8-bit YUV To AV1 Video Encode-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O00.38930.77861.16791.55721.9465SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 6SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 8SE +/- 0.02, N = 9SE +/- 0.00, N = 31.731.701.711.681.731.701.671.691.691.701.691. (CC) gcc options: -mavx2 -fPIE -fPIC -O2 -pie -lpthread -lm

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0306090120150SE +/- 1.40, N = 8SE +/- 1.32, N = 10SE +/- 1.50, N = 8SE +/- 1.94, N = 5SE +/- 1.20, N = 3SE +/- 1.53, N = 3SE +/- 1.50, N = 12SE +/- 2.52, N = 3SE +/- 1.43, N = 1212012412512711812712812813112674-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O020M40M60M80M100MSE +/- 823190.91, N = 3SE +/- 460638.54, N = 3SE +/- 402849.09, N = 3SE +/- 673773.08, N = 3SE +/- 324013.38, N = 3SE +/- 468403.16, N = 3SE +/- 579693.09, N = 3SE +/- 1016511.53, N = 3SE +/- 549524.55, N = 3SE +/- 1595135.10, N = 3105709690106507244106497994104121840106084276104197865104536605104480422105698092105868175-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O2-O1-O01. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O014K28K42K56K70KSE +/- 1339.16, N = 9SE +/- 1454.31, N = 11SE +/- 1598.30, N = 12SE +/- 1082.96, N = 12SE +/- 1049.43, N = 3SE +/- 1967.27, N = 12SE +/- 1953.49, N = 12SE +/- 1395.50, N = 12SE +/- 1387.83, N = 12SE +/- 1098.57, N = 4SE +/- 215.16, N = 1256453628415876466823658066130963586651176271865995151791. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O04080120160200SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 316818717818017218017818018017992-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O04080120160200SE +/- 3.18, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 317319318619118119118819018918790-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O04080120160200SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 315618218318317418318018318118082-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O04080120160200SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 318119619419518919619619619519496-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O050100150200250SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3195209209210203211212214211210102-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O04080120160200SE +/- 0.33, N = 318118918819018319119019119119198-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -fopenmp -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Traditional DES-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O060M120M180M240M300MSE +/- 2445677.03, N = 3SE +/- 1656338.97, N = 3SE +/- 1374420.40, N = 3SE +/- 2346357.84, N = 3SE +/- 3859011.69, N = 12SE +/- 2041895.52, N = 3SE +/- 2839112.24, N = 3SE +/- 642920.77, N = 3SE +/- 2178423.16, N = 3SE +/- 2774527.11, N = 10SE +/- 2756947.41, N = 32392893332587706672547773332600196672538685832559570002570580002607366672574076672570672002182320001. (CC) gcc options: -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O02004006008001000SE +/- 4.02, N = 3SE +/- 8.54, N = 3SE +/- 2.81, N = 3SE +/- 0.08, N = 3SE +/- 6.27, N = 3SE +/- 2.90, N = 3SE +/- 7.21, N = 3SE +/- 2.58, N = 3SE +/- 5.25, N = 3SE +/- 5.78, N = 3SE +/- 0.11, N = 377210221000101110081001100710221017785383-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -O3 -mavx2

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O020406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 377.4151.7352.0851.4953.5354.3555.5355.5255.5456.6192.50-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm -O3

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0612182430SE +/- 0.29, N = 12SE +/- 0.21, N = 12SE +/- 0.24, N = 11SE +/- 0.44, N = 12SE +/- 0.38, N = 12SE +/- 0.35, N = 12SE +/- 0.33, N = 12SE +/- 0.31, N = 12SE +/- 0.49, N = 9SE +/- 0.25, N = 12SE +/- 0.38, N = 414.3913.7713.1614.3713.6614.7113.6714.0814.4814.1123.12-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -pthread -lz -llzma

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-Og-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O1-O020406080100SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.22, N = 3SE +/- 0.25, N = 3SE +/- 0.26, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 321.4278.1378.1951.9652.5852.1729.0515.19-Og-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O11. (CC) gcc options: -pedantic -ldl -lz -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0306090120150SE +/- 0.04, N = 3SE +/- 0.34, N = 3SE +/- 0.45, N = 3SE +/- 0.30, N = 3SE +/- 0.44, N = 3SE +/- 0.45, N = 3SE +/- 0.20, N = 3SE +/- 0.98, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 8SE +/- 0.10, N = 37.8925.21118.4824.8825.0623.7823.9198.6723.6318.425.23

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-02-151080p 8-bit YUV To AV1 Video Encode-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01.32982.65963.98945.31926.649SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 35.865.915.845.895.905.845.895.905.815.875.881. (CC) gcc options: -mavx -fPIE -fPIC -O2 -pie -lpthread -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0400800120016002000SE +/- 18.65, N = 5SE +/- 20.59, N = 3SE +/- 35.09, N = 3SE +/- 11.89, N = 3SE +/- 7.96, N = 3SE +/- 23.45, N = 5SE +/- 8.94, N = 3SE +/- 24.31, N = 3SE +/- 12.17, N = 3SE +/- 6.18, N = 3SE +/- 5.30, N = 31205182517471961180015011724130713691519434-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O020406080100SE +/- 0.11, N = 5SE +/- 0.08, N = 5SE +/- 0.10, N = 5SE +/- 0.10, N = 5SE +/- 0.12, N = 5SE +/- 0.09, N = 5SE +/- 0.14, N = 5SE +/- 0.11, N = 5SE +/- 0.08, N = 5SE +/- 0.09, N = 5SE +/- 0.12, N = 515.5813.9514.2113.8513.6113.8913.7013.6413.6515.0196.77-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01020304050SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 328.6410.4011.3111.3512.6021.5825.7725.9625.8428.7444.92-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm -lpthread -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To Compile-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0714212835SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 314.5926.1128.6225.9426.0823.8224.0326.5023.8217.5111.43

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0816243240SE +/- 0.58, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.41, N = 3SE +/- 0.09, N = 3SE +/- 0.46, N = 3SE +/- 0.38, N = 3SE +/- 0.24, N = 3SE +/- 0.37, N = 11SE +/- 0.67, N = 334.7634.9135.5735.2134.8035.4135.0734.5535.6235.00-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.789.8010.3810.5710.8414.0010.9614.1414.0714.3241.79-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto20406080100SE +/- 1.17, N = 3SE +/- 0.30, N = 3SE +/- 0.50, N = 3SE +/- 0.18, N = 3SE +/- 1.09, N = 3SE +/- 0.75, N = 392.6897.8097.2695.9194.8295.79-Og-Ofast -march=znver1-O3 -march=znver1-march=native-ftree-vectorize -ftree-slp-vectorize1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O03691215SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 37.396.006.166.296.576.546.826.566.626.939.02-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -pthread -lhmmer -lsquid -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression Throughput-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0306090120150SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.91, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.67, N = 3SE +/- 0.03, N = 3SE +/- 0.76, N = 3141144144144141142139140140139111-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O00.7021.4042.1062.8083.51SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.113.093.093.113.103.113.053.123.123.11-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O1-O0306090120150SE +/- 1.09, N = 3SE +/- 0.52, N = 3SE +/- 1.78, N = 3SE +/- 0.81, N = 3SE +/- 1.40, N = 3SE +/- 0.47, N = 3SE +/- 0.97, N = 3SE +/- 1.49, N = 3SE +/- 0.12, N = 3142144144147144144144145102-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2-O1-O01. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0200K400K600K800K1000KSE +/- 333.13, N = 5SE +/- 507.80, N = 5SE +/- 270.20, N = 5SE +/- 667.00, N = 5SE +/- 272.00, N = 5SE +/- 272.00, N = 5SE +/- 508.06, N = 5SE +/- 331.91, N = 5SE +/- 507.80, N = 5SE +/- 542.88, N = 5SE +/- 333.13, N = 5865187864373863018865732864915864915864916864101864373864102865459-Og-Ofast -march=znver1-march=znver1 -flto-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -O3 -march=native

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-Og-O3 -march=znver1 -flto-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto306090120150132132132132-Og-O3 -march=znver1 -flto-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto1. (CC) gcc options:

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O00.30380.60760.91141.21521.519SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.351.321.321.351.321.351.331.351.351.35-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O00.24980.49960.74940.99921.249SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.111.111.111.111.111.111.091.111.111.11-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O00.7291.4582.1872.9163.645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.143.063.063.153.053.153.243.143.153.14-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01.2152.433.6454.866.075SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.375.185.185.385.195.375.405.375.375.36-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.025.805.806.055.805.986.326.016.006.00-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-Og-Ofast -march=znver1-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01.17232.34463.51694.68925.8615SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.155.095.075.165.085.145.215.145.165.14-Og-Ofast -march=znver1-march=znver1-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O0400800120016002000SE +/- 0.31, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.35, N = 391916761675168914271016919918919919832-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O010002000300040005000SE +/- 139.26, N = 3SE +/- 107.53, N = 3SE +/- 178.79, N = 3SE +/- 65.35, N = 3SE +/- 42.05, N = 3SE +/- 173.68, N = 3SE +/- 57.07, N = 3SE +/- 129.41, N = 3SE +/- 54.41, N = 3SE +/- 32.13, N = 3SE +/- 30.37, N = 32539408933004851430732314396251526093466512-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O06001200180024003000SE +/- 3.66, N = 3SE +/- 12.59, N = 3SE +/- 2.13, N = 3SE +/- 10.26, N = 3SE +/- 3.37, N = 3SE +/- 14.53, N = 3SE +/- 12.19, N = 3SE +/- 5.41, N = 3SE +/- 11.61, N = 3SE +/- 59.22, N = 3SE +/- 5.25, N = 32188257920522482247525842515229925272411516-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O060120180240300SE +/- 2.03, N = 3SE +/- 1.04, N = 3SE +/- 1.03, N = 3SE +/- 0.03, N = 3SE +/- 0.51, N = 3SE +/- 1.88, N = 3SE +/- 0.71, N = 3SE +/- 2.57, N = 3SE +/- 0.61, N = 3SE +/- 0.44, N = 3SE +/- 1.00, N = 3257221230227232229231232230226201-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O01. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O1-O030060090012001500SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.33, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.03, N = 32105611480557560557560568560576108-Og-Ofast -march=znver1-O3 -march=znver1 -flto-O3 -march=znver1-O3-O2 -march=znver1-O2 -ftree-vectorize -ftree-slp-vectorize-O2 -flto-O2-O11. (CC) gcc options: -lm


Phoronix Test Suite v10.8.5