Znver2 GCC9 Compiler Tests

AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2003230-VE-1907117HV93&rdt&grr.

Znver2 GCC9 Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.2.0-999-generic (x86_64) 20190703GNOME Shell 3.28.3X Server 1.20.1modesetting 1.20.14.5 Mesa 18.2.2 (LLVM 7.0.0)GCC 9.1.0ext43840x2160AMD Ryzen 7 3800X 8-Core @ 4.20GHz (8 Cores / 16 Threads)ASUS TUF GAMING X570-PLUS_BR (1405 BIOS)AMD Starship/Matisse16GB480GB Force MP510 + 320GB SAMSUNG HD322HJNVIDIA GeForce GTX 960 2GB (1126/3505MHz)NVIDIA GM206 HD AudioRealtek RTL8111/8168/8411 + Broadcom BCM4360 802.11acUbuntu 20.045.4.0-18-generic (x86_64)MATE 1.24.0X Server 1.20.7NVIDIA 435.214.6.0GCC 9.3.01920x1080OpenBenchmarking.orgEnvironment Details- -O3 -march=znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1- -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64Compiler Details- -O3 -march=znver2: --disable-multilib --enable-checking=release- -O3 -march=znver1: --disable-multilib --enable-checking=release- -O3 -march=x86-64: --disable-multilib --enable-checking=release- ryzen3800x4200: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- -O3 -march=znver2: Scaling Governor: acpi-cpufreq ondemand- -O3 -march=znver1: Scaling Governor: acpi-cpufreq ondemand- -O3 -march=x86-64: Scaling Governor: acpi-cpufreq ondemand- ryzen3800x4200: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013Python Details- -O3 -march=znver2: Python 2.7.15+ + Python 3.6.8- -O3 -march=znver1: Python 2.7.15+ + Python 3.6.8- -O3 -march=x86-64: Python 2.7.15+ + Python 3.6.8- ryzen3800x4200: + Python 3.8.2Security Details- -O3 -march=znver2: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled- -O3 -march=znver1: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled- -O3 -march=x86-64: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled- ryzen3800x4200: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected

Znver2 GCC9 Compiler Testsfftw: Float + SSE - 2D FFT Size 4096cpp-perf-bench: Math Libraryfftw: Stock - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writebuild-llvm: Time To Compilevpxenc: vpxenc VP9 1080p Video Encodemcperf: Getpgbench: Buffer Test - Normal Load - Read Onlystockfish: Total Timemcperf: Setmkl-dnn: IP Batch 1D - f32graphics-magick: Sharpengraphics-magick: Rotategraphics-magick: Resizinghimeno: Poisson Pressure Solverbuild-php: Time To Compileredis: SETc-ray: Total Time - 4K, 16 Rays Per Pixelmkl-dnn: Deconvolution Batch deconv_1d - f32redis: GETaobench: 2048 x 2048 - Total Timecompress-7zip: Compress Speed Testjohn-the-ripper: Blowfishcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9scimark2: Compositemkl-dnn: Convolution Batch conv_alexnet - f32cpp-perf-bench: Function Objectsencode-flac: WAV To FLACencode-mp3: WAV To MP3x265: H.265 1080p Video Encodingfftw: Float + SSE - 1D FFT Size 4096x264: H.264 Video Encodingfftw: Stock - 1D FFT Size 4096bullet: Rayteststscp: AI Chess Performancebullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420019960309.028001.6730044.32286.56175.34110755.89382751.073947172669121.64158.461942762851347.9653.412089609.4739.46216.703090850.6934.64785622023225.213700.642524.9314.568.127.0452.9156652140.96112312.0613216812.053.593.793.232422.1011431.933575.96274.11799.0719405311.777660.9029380.50284.24174.40112447.77383329.303990875160810.38159.951932602791345.9553.442080013.1239.42219.293126726.2335.14784122822125.093128.652562.8914.998.156.9853.1551757141.82114482.1313721482.133.733.983.362291.628631.933702.03260.12757.56312.017039.4330143.80281.90175.99107814.85385510.953953793060770.80152.361792612701336.7752.892074827.2443.20221.003026826.3236.49786552840125.222786.332512.0414.907.757.1653.33143.279534.702.0913339262.173.704.083.372145.576959.693762.43297.13766.8118240281.8795968.316722.385240415.041202.4194377.5264125.5660922746589163869.322.63611562612761274.46313954.8471893390.8560.31731.00872695667.5032.233545351399332.865618.82331.46815.6297.92226.53745.8254956101.978510.62.15475012714232.2291573.8625704.1892413.5094031259.58607.72750.46323.04153.28OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver1ryzen3800x42004K8K12K16K20KSE +/- 71.31, N = 3SE +/- 83.01, N = 3SE +/- 314.37, N = 3199601940518240-march=znver2-march=znver1-fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math1. (CC) gcc options: -pthread -O3 -lm

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420070140210280350SE +/- 1.69, N = 3SE +/- 5.00, N = 3SE +/- 4.58, N = 4SE +/- 0.19, N = 3309.02311.77312.01281.88-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -std=c++11

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42002K4K6K8K10KSE +/- 28.05, N = 3SE +/- 34.96, N = 3SE +/- 7.75, N = 3SE +/- 9.60, N = 38001.677660.907039.435968.30-march=znver2-march=znver1-fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math1. (CC) gcc options: -pthread -O3 -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42006K12K18K24K30KSE +/- 197.60, N = 3SE +/- 361.55, N = 15SE +/- 169.86, N = 3SE +/- 14.38, N = 330044.3229380.5030143.8016722.39-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420090180270360450286.56284.24281.90415.04

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42004080120160200SE +/- 0.55, N = 3SE +/- 0.98, N = 3SE +/- 0.86, N = 3SE +/- 1.23, N = 3175.34174.40175.99202.41-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420020K40K60K80K100KSE +/- 1084.23, N = 15SE +/- 1283.75, N = 15SE +/- 879.11, N = 15SE +/- 703.67, N = 14110755.89112447.77107814.8594377.50-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -lm -rdynamic

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420080K160K240K320K400KSE +/- 738.26, N = 3SE +/- 539.80, N = 3SE +/- 837.27, N = 3SE +/- 844.67, N = 3382751.07383329.30385510.95264125.57-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42009M18M27M36M45MSE +/- 265193.75, N = 3SE +/- 208989.61, N = 3SE +/- 75524.60, N = 3SE +/- 251488.84, N = 339471726399087513953793027465891-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420015K30K45K60K75KSE +/- 4868.10, N = 15SE +/- 1488.06, N = 15SE +/- 807.63, N = 4SE +/- 77.26, N = 369121.6460810.3860770.8063869.30-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -lm -rdynamic

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42004080120160200SE +/- 2.44, N = 15SE +/- 2.27, N = 15SE +/- 3.65, N = 15SE +/- 0.23, N = 3158.46159.95152.3622.64-march=znver2 - MIN: 112.01-march=znver1 - MIN: 108.37-march=x86-64 - MIN: 109.43MIN: 13.131. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42004080120160200SE +/- 0.58, N = 3194193179156-O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O2 -lgomp1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420060120180240300SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3276260261261-O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O2 -lgomp1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420060120180240300SE +/- 1.53, N = 3285279270276-O3 -march=znver2 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=znver1 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O3 -march=x86-64 -ljbig -lwebp -lwebpmux -ltiff -ljpeg -llzma -lbz2 -ldl-O2 -lgomp1. (CC) gcc options: -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420030060090012001500SE +/- 12.91, N = 3SE +/- 22.06, N = 3SE +/- 13.46, N = 3SE +/- 4.19, N = 31347.961345.951336.771274.46-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -mavx2

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42001224364860SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.34, N = 3SE +/- 0.23, N = 353.4153.4452.8954.85-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -pedantic -ldl -lz -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200400K800K1200K1600K2000KSE +/- 26820.33, N = 15SE +/- 26581.97, N = 15SE +/- 28039.49, N = 15SE +/- 24405.05, N = 152089609.472080013.122074827.241893390.851. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42001428425670SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 339.4639.4243.2060.32-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -lm -lpthread -O3

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420050100150200250SE +/- 2.63, N = 6SE +/- 1.89, N = 11SE +/- 2.51, N = 3SE +/- 0.18, N = 3216.70219.29221.0031.01-march=znver2 - MIN: 202.77-march=znver1 - MIN: 203.58-march=x86-64 - MIN: 202.64MIN: 29.381. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200700K1400K2100K2800K3500KSE +/- 61045.92, N = 12SE +/- 62689.04, N = 13SE +/- 58033.83, N = 15SE +/- 18403.22, N = 33090850.693126726.233026826.322695667.501. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200816243240SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.32, N = 3SE +/- 0.01, N = 334.6435.1436.4932.23-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -lm -O3

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420020K40K60K80K100KSE +/- 205.86, N = 3SE +/- 193.52, N = 3SE +/- 419.64, N = 3SE +/- 103.41, N = 3785627841278655545351. (CXX) g++ options: -pipe -lpthread

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: Blowfish-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42006K12K18K24K30KSE +/- 50.35, N = 3SE +/- 51.72, N = 3SE +/- 61.75, N = 3SE +/- 8.95, N = 320232282212840113993-lbz2-lbz2-lbz21. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200816243240SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 325.2125.0925.2232.87-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64-O21. (CC) gcc options: -pthread -fvisibility=hidden

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42008001600240032004000SE +/- 7.82, N = 3SE +/- 7.70, N = 3SE +/- 27.49, N = 3SE +/- 1.56, N = 33700.643128.652786.33618.82-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42005001000150020002500SE +/- 12.02, N = 3SE +/- 19.59, N = 3SE +/- 11.23, N = 3SE +/- 0.83, N = 32524.932562.892512.04331.47-march=znver2 - MIN: 2478.96-march=znver1 - MIN: 2476.4-march=x86-64 - MIN: 2466.12MIN: 327.271. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420048121620SE +/- 0.16, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 314.5614.9914.9015.63-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -std=c++11

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200246810SE +/- 0.060, N = 5SE +/- 0.038, N = 5SE +/- 0.040, N = 5SE +/- 0.009, N = 58.1208.1507.7507.922-O3 -march=znver2 -logg-O3 -march=znver1 -logg-O3 -march=x86-64 -logg-O21. (CXX) g++ options: -fvisibility=hidden -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200612182430SE +/- 0.121, N = 3SE +/- 0.088, N = 4SE +/- 0.070, N = 3SE +/- 0.005, N = 37.0406.9807.16026.537-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lncurses -lm

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42001224364860SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.28, N = 352.9153.1553.3345.82-march=znver2 -lnuma-march=znver1 -lnuma-march=x86-64 -lnuma1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096-O3 -march=znver2-O3 -march=znver1ryzen3800x420012K24K36K48K60KSE +/- 647.09, N = 3SE +/- 455.36, N = 3SE +/- 114.46, N = 3566525175754956-march=znver2-march=znver1-fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math1. (CC) gcc options: -pthread -O3 -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200306090120150SE +/- 0.93, N = 3SE +/- 0.82, N = 3SE +/- 0.64, N = 3SE +/- 0.09, N = 3140.96141.82143.27101.97-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42002K4K6K8K10KSE +/- 123.02, N = 3SE +/- 53.59, N = 3SE +/- 11.52, N = 3SE +/- 2.07, N = 311231.0011448.009534.708510.60-march=znver2-march=znver1-fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math1. (CC) gcc options: -pthread -O3 -lm

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42000.48480.96961.45441.93922.424SE +/- 0.002581, N = 3SE +/- 0.018779, N = 3SE +/- 0.020748, N = 3SE +/- 0.000756, N = 32.0600002.1300002.0900002.154750-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x4200300K600K900K1200K1500KSE +/- 8868.66, N = 5SE +/- 1369.78, N = 5SE +/- 10274.76, N = 5SE +/- 1439.32, N = 51321681137214813339261271423-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -march=native

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42000.50161.00321.50482.00642.508SE +/- 0.003680, N = 3SE +/- 0.015833, N = 3SE +/- 0.029906, N = 3SE +/- 0.000665, N = 32.0500002.1300002.1700002.229157-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42000.86911.73822.60733.47644.3455SE +/- 0.007030, N = 3SE +/- 0.032731, N = 3SE +/- 0.037882, N = 3SE +/- 0.000795, N = 33.5900003.7300003.7000003.862570-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42000.94261.88522.82783.77044.713SE +/- 0.004049, N = 3SE +/- 0.033405, N = 3SE +/- 0.041553, N = 3SE +/- 0.002805, N = 33.7900003.9800004.0800004.189241-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42000.78961.57922.36883.15843.948SE +/- 0.007900, N = 3SE +/- 0.026494, N = 3SE +/- 0.035000, N = 3SE +/- 0.003247, N = 33.2300003.3600003.3700003.509403-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42005001000150020002500SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 20.09, N = 3SE +/- 0.10, N = 32422.102291.622145.571259.58-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42002K4K6K8K10KSE +/- 10.78, N = 3SE +/- 25.99, N = 3SE +/- 73.09, N = 3SE +/- 4.26, N = 311431.938631.936959.69607.72-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42008001600240032004000SE +/- 48.59, N = 3SE +/- 48.59, N = 3SE +/- 48.28, N = 3SE +/- 10.72, N = 33575.963702.033762.43750.46-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x420070140210280350SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 3.16, N = 3SE +/- 0.10, N = 3274.11260.12297.13323.04-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64ryzen3800x42002004006008001000SE +/- 1.53, N = 3SE +/- 0.16, N = 3SE +/- 7.83, N = 3SE +/- 0.10, N = 3799.07757.56766.81153.28-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-641. (CC) gcc options: -lm


Phoronix Test Suite v10.8.4