GCC 9.1 Compiler Tuning Threadripper AMD znver1

AMD Ryzen Threadripper 2990WX compiler benchmarks on GCC 9.1 with Ubuntu Linux by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1905122-HV-GCC91COMP71&grr&sor.

GCC 9.1 Compiler Tuning Threadripper AMD znver1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-CoreAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX 64 8GB (1590/800MHz)Realtek ALC1220ASUS VP28UIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 18.044.18.0-18-generic (x86_64)GNOME Shell 3.28.3X Server 1.20.1amdgpu 18.1.04.5 Mesa 18.2.8 (LLVM 7.0.0)GCC 9.1.0ext43840x2160OpenBenchmarking.orgEnvironment Details- -O2 -march=athlon64: CXXFLAGS=-O2-march=athlon64 CFLAGS=-O2-march=athlon64- -O3 -march=athlon64: CXXFLAGS=-O3-march=athlon64 CFLAGS=-O3-march=athlon64- -O3 -march=athlon64-sse3: CXXFLAGS=-O3-march=athlon64-sse3 CFLAGS=-O3-march=athlon64-sse3- -O2 -march=native: CXXFLAGS=-O2-march=native CFLAGS=-O2-march=native- -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- -O3 -march=native -flto: CXXFLAGS=-O3-march=native-flto CFLAGS=-O3-march=native-flto- PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- AMD Ryzen Threadripper 2990WX 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checing=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15rc1 + Python 3.6.7Security Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

GCC 9.1 Compiler Tuning Threadripper AMD znver1cpp-perf-bench: Rand Numberscpp-perf-bench: Math Libraryvpxenc: vpxenc VP9 1080p Video Encodefftw: Float + SSE - 2D FFT Size 4096hpcg: c-ray: Total Time - 4K, 16 Rays Per Pixelfftw: Stock - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writembw: Memory Copy, Fixed Block Size - 8192 MiBbuild-llvm: Time To Compilecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9aom-av1: AV1 Video Encodingcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19mcperf: Addmbw: Memory Copy - 8192 MiBpgbench: Buffer Test - Normal Load - Read Onlymcperf: Setnginx: Static Web Page Servingstockfish: Total Timecpp-perf-bench: Stepanov Vectorcpp-perf-bench: Atolgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solvermcperf: Prependbuild-php: Time To Compilemcperf: Appendt-test1: 1aobench: 2048 x 2048 - Total Timemcperf: Replacescimark2: Compositecpp-perf-bench: Ctypeluajit: Compositebuild-imagemagick: Time To Compilesmallpt: Global Illumination Renderer; 128 Samplessvt-vp9: 1080p 8-bit YUV To VP9 Video Encodecpp-perf-bench: Stepanov Abstractionmcperf: Deleteredis: LPOPmcperf: Gett-test1: 2openssl: RSA 4096-bit Performancex265: H.265 1080p Video Encodingencode-flac: WAV To FLACredis: SETredis: SADDredis: GETcpp-perf-bench: Function Objectsmkl-dnn: IP Batch 1D - f32svt-av1: 1080p 8-bit YUV To AV1 Video Encoderedis: LPUSHsvt-hevc: 1080p 8-bit YUV To HEVC Video Encodemafft: Multiple Sequence Alignmentencode-mp3: WAV To MP3x264: H.264 Video Encodingbullet: Rayteststscp: AI Chess Performancectx-clock: Context Switch Timebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloluajit: Jacobi Successive Over-Relaxationluajit: Dense LU Matrix Factorizationluajit: Sparse Matrix Multiplyluajit: Fast Fourier Transformluajit: Monte Carlo-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core107239925.440.8945.4343425098652622926.750.2018.02546001263245360559924297046689068775.7969.0819223519820224424327213283546743.866121925.6047.2835829178233.37149117.414.57101.1328.33557472654025556478.68583233.5315.5817307021975322238414316.2071.9818.5115485561722.6011.411462.4910942111501.070.872.424.504.683.96118635933119291723186536231183286498107039826.120.9229.39436115616675522526.500.2017.74531061293046384034880297266751360275.5368.3719023319519923723826113163596863.195617229.6645.2036231202133.28148919.774.51103.4528.26568222544762568379.59583733.8915.4818408062051129254039816.1770.9418.8415202761682.589.431462.4611153901501.060.862.364.454.653.92184242742874294736186735901200287498105539925.580.8529.53446316252668122425.930.2118.38348221269945959648548292816757115075.5668.4219123119419823724026313164569162.934571627.1244.9845707203933.31149619.734.5497.8128.28695092506276690048.82583833.8815.4317555902032921246172516.1471.0218.6615229521662.559.361462.4611158411501.060.862.354.454.623.91184242393082294737186836191207287499104135626.02162630.8134.22633513558677822126.780.2119.31341381279545855134446278346669748775.9969.8021723823120024524927113193555244.443505826.0742.3435486198134.50146717.583.89100.8228.69561412340595557919.40583133.449.7817449792000709227501915.6172.0818.6314633881632.5810.801432.3811020131501.020.842.394.064.573.95130645073105265721183035501182281490102335126.37149270.8318.00660016339668522125.700.2219.09449031285146672338646292746820016474.9369.0321924323220424724927213134382463.254245526.2339.1153865251433.88149519.153.87102.9128.22589692573823576529.01582533.769.5318072702046557250218215.4071.2620.2715402571652.698.001472.3511167471501.000.842.303.974.423.842218598931742707321868361112082864991011352148640.9817.85701914989672195125.620.2217.94500551317647362043570273526745068974.9269.252212402332032502482741304455874533926.3139.9245591254332.32149788.893.85104.4228.25686962616703686448.89583333.689.4817554722084089250943315.5020.4115496471652.657.982.3011356261500.970.822.443.904.823.972202538829512701904186836241204287500102735326.36152870.9117.9667176281675326.090.2217.364777412406460414463966784187775.2769.3122024323420325025127413214708563.014610628.7439.1045956255534.02149919.093.83103.9728.36689802648345684269.41583033.799.4417988002083430253302715.4670.9419.5715320381852.637.981452.3411091141501.000.832.323.974.413.832208635632202617281870363612032864991154355150980.904024.62641164726596290708.150.08265.7935174126182590583523978.1469.25431840261755913223557985.403592027.8343.3635978225737.6719.69564.075.0729.08567975632414.0557919.959.8122.0771.0018.7723.012.7411.09981778150219054293153260255OpenBenchmarking.org

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random Numbers-O3 -march=native -flto-O3 -march=nativePGO-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core2004006008001000SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 4.70, N = 3SE +/- 0.04, N = 3SE +/- 0.71, N = 3SE +/- 0.07, N = 310111023102710411055107010721154-march=native -flto-march=native-march=native-O2 -march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native-O3 -march=athlon64-O2 -march=athlon64-O3 -march=athlon64-sse390180270360450SE +/- 0.11, N = 3SE +/- 0.73, N = 3SE +/- 0.93, N = 3SE +/- 1.79, N = 3SE +/- 0.79, N = 3SE +/- 0.87, N = 3SE +/- 0.83, N = 3351352353355356398399399-march=native-march=native -flto-march=native-march=native-O2 -march=native-march=athlon64-O2 -march=athlon64-march=athlon64-sse31. (CXX) g++ options: -O3 -std=c++11

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=nativePGO-O3 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O2 -march=athlon64612182430SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.22, N = 3SE +/- 0.40, N = 3SE +/- 0.26, N = 926.3726.3626.1226.0225.5825.44-march=native-march=native-march=athlon64-O2 -march=native-march=athlon64-sse3-O2 -march=athlon641. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O2 -march=nativePGOAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=native-O3 -march=native -flto3K6K9K12K15KSE +/- 13.87, N = 3SE +/- 152.85, N = 3SE +/- 71.85, N = 3SE +/- 74.46, N = 31626315287150981492714864-O2-O3-O3-O3-O3 -flto1. (CC) gcc options: -pthread -march=native -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0-O3 -march=native -flto-O3 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=native0.22050.4410.66150.8821.1025SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.03, N = 12SE +/- 0.02, N = 13SE +/- 0.01, N = 3SE +/- 0.03, N = 12SE +/- 0.02, N = 150.980.920.910.900.890.850.830.81

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native -fltoPGO-O3 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core9001800270036004500SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 317.8517.9618.0029.3929.5334.2245.434024.62-march=native1. (CC) gcc options: -lm -lpthread -O3

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=native -fltoPGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon6415003000450060007500SE +/- 8.84, N = 3SE +/- 14.46, N = 3SE +/- 51.66, N = 3SE +/- 34.65, N = 3SE +/- 1.73, N = 3SE +/- 63.69, N = 3SE +/- 28.80, N = 370196717660064116335446343614342-O3 -flto-O3-O3-O3-O21. (CC) gcc options: -pthread -march=native -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-CorePGO-O2 -march=athlon644K8K12K16K20KSE +/- 98.94, N = 3SE +/- 51.55, N = 3SE +/- 247.31, N = 3SE +/- 164.04, N = 7SE +/- 279.82, N = 15SE +/- 93.40, N = 4SE +/- 65.89, N = 41633916252156161498913558647262815098-O3 -march=native -lpq-O3 -march=athlon64-sse3 -lpq-O3 -march=athlon64 -lpq-O3 -march=native -flto -lpq-O2 -march=native -lpq-O3 -march=native-O3 -march=native -lpq-O2 -march=athlon64 -lpq1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB-O2 -march=native-O3 -march=athlon64PGO-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-Core-O2 -march=athlon6415003000450060007500SE +/- 38.61, N = 3SE +/- 7.74, N = 3SE +/- 26.90, N = 3SE +/- 96.68, N = 3SE +/- 27.83, N = 3SE +/- 10.14, N = 3SE +/- 60.66, N = 367786755675367216685668165966526-O2-march=athlon64-flto-march=athlon64-sse3-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O2 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O3 -march=native -flto2004006008001000221221224225229290951

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3PGO-O3 -march=athlon64-O2 -march=athlon64-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core150300450600750SE +/- 0.23, N = 15SE +/- 0.26, N = 15SE +/- 0.22, N = 12SE +/- 0.07, N = 3SE +/- 0.27, N = 3SE +/- 0.21, N = 15SE +/- 0.10, N = 325.6225.7025.9326.0926.5026.7526.78708.15-O3 -march=native1. (CC) gcc options: -pthread -fvisibility=hidden

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video EncodingPGO-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core0.04950.0990.14850.1980.2475SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.220.220.220.210.210.200.200.08-march=native-march=native -flto-march=native-O2 -march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19PGO-O3 -march=athlon64-O3 -march=native -flto-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.45, N = 15SE +/- 0.69, N = 12SE +/- 0.66, N = 15SE +/- 0.60, N = 15SE +/- 0.66, N = 15SE +/- 0.65, N = 15SE +/- 0.61, N = 1517.3617.7417.9418.0218.3819.0919.31265.79-O3 -march=native1. (CC) gcc options: -pthread -lz -llzma

Memcached mcperf

Method: Add

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Add-O2 -march=athlon64-O3 -march=athlon64-O3 -march=native -fltoPGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O2 -march=native12K24K36K48K60KSE +/- 2866.90, N = 12SE +/- 2252.95, N = 15SE +/- 2185.23, N = 15SE +/- 2440.20, N = 12SE +/- 1053.86, N = 15SE +/- 121.25, N = 3SE +/- 125.83, N = 35460053106500554777444903351743482234138-O2 -march=athlon64-O3 -march=athlon64-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O2 -march=native1. (CC) gcc options: -lm -rdynamic

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB-O3 -march=native -flto-O3 -march=athlon64-O3 -march=native-O2 -march=native-O3 -march=athlon64-sse3-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-CorePGO3K6K9K12K15KSE +/- 153.84, N = 3SE +/- 162.89, N = 5SE +/- 169.00, N = 3SE +/- 167.95, N = 5SE +/- 217.78, N = 3SE +/- 152.24, N = 5SE +/- 61.39, N = 31317612930128511279512699126321261812406-flto-march=athlon64-O2-march=athlon64-sse3-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64PGO-O3 -march=athlon64-sse3-O2 -march=native-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core100K200K300K400K500KSE +/- 1547.77, N = 3SE +/- 1148.20, N = 3SE +/- 6987.31, N = 3SE +/- 7464.16, N = 3SE +/- 6062.49, N = 4SE +/- 3403.88, N = 3SE +/- 2874.20, N = 3473620466723463840460414459596458551453605259058-O3 -march=native -flto -lpq-O3 -march=native -lpq-O3 -march=athlon64 -lpq-O3 -march=native -lpq-O3 -march=athlon64-sse3 -lpq-O2 -march=native -lpq-O2 -march=athlon64 -lpq-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O2 -march=athlon64-O3 -march=athlon64-sse3PGO-O3 -march=native -flto-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-O2 -march=native13K26K39K52K65KSE +/- 2193.95, N = 15SE +/- 2384.39, N = 15SE +/- 1474.59, N = 12SE +/- 195.82, N = 3SE +/- 877.57, N = 15SE +/- 44.53, N = 3SE +/- 67.69, N = 35992448548463964357038646352393488034446-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=athlon64-O2 -march=native1. (CC) gcc options: -lm -rdynamic

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O3 -march=athlon64-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=native-O3 -march=native -flto6K12K18K24K30KSE +/- 169.85, N = 3SE +/- 219.17, N = 3SE +/- 428.25, N = 3SE +/- 401.04, N = 4SE +/- 166.84, N = 3SE +/- 43.76, N = 3297262970429281292742783427352-march=athlon64-O2 -march=athlon64-march=athlon64-sse3-O2-flto1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=nativePGO-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=athlon64-O2 -march=native15M30M45M60M75MSE +/- 954655.13, N = 3SE +/- 458502.78, N = 3SE +/- 443906.77, N = 3SE +/- 779150.59, N = 3SE +/- 385955.10, N = 3SE +/- 226981.30, N = 3SE +/- 526520.80, N = 368200164678418776757115067513602674506896689068766697487-march=native-march=native-march=athlon64-sse3-march=athlon64-march=native-O2 -march=athlon64-O2 -march=native1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Vector-O3 -march=native -flto-O3 -march=nativePGO-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon64-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 374.9274.9375.2775.5375.5675.7975.9978.14-march=native -flto-march=native-march=native-march=athlon64-march=athlon64-sse3-O2 -march=athlon64-O2 -march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Atol-O3 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=athlon64-O3 -march=native -fltoAMD Ryzen Threadripper 2990WX 32-CorePGO-O2 -march=native1632486480SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 368.3768.4269.0369.0869.2569.2569.3169.80-march=athlon64-march=athlon64-sse3-march=native-O2 -march=athlon64-march=native -flto-march=native-march=native-O2 -march=native1. (CXX) g++ options: -O3 -std=c++11

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=native -fltoPGO-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 32212202192171921911904-O3 -march=native -flto-O3 -march=native-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon641. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingPGO-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 324324324023823523323131-O3 -march=native-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedPGO-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 32342332322311981951948-O3 -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse31. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O3 -march=nativePGO-O3 -march=native -flto-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 320420320320220019919840-O3 -march=native-O3 -march=native-O3 -march=native -flto-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlPGO-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 325025024724524423723726-O3 -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotatePGO-O3 -march=native-O2 -march=native-O3 -march=native -flto-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.33, N = 3SE +/- 0.67, N = 3251249249248243240238175-O3 -march=native-O3 -march=native-O2 -march=native-O3 -march=native -flto-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpacePGO-O3 -march=native -flto-O3 -march=native-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.33, N = 3SE +/- 1.33, N = 327427427227227126326159-O3 -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-CorePGO-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native-O3 -march=native -flto30060090012001500SE +/- 3.65, N = 3SE +/- 3.47, N = 3SE +/- 3.88, N = 3SE +/- 4.00, N = 3SE +/- 3.98, N = 3SE +/- 1.73, N = 3SE +/- 1.86, N = 313281322132113191316131613131304-O2 -march=athlon64-march=native-march=native-O2 -march=native-march=athlon64-sse3-march=athlon64-march=native-march=native -flto1. (CC) gcc options: -O3 -mavx2

Memcached mcperf

Method: Prepend

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: PrependPGO-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native-O2 -march=athlon6410K20K30K40K50KSE +/- 863.93, N = 12SE +/- 116.90, N = 3SE +/- 179.60, N = 3SE +/- 2300.04, N = 15SE +/- 58.44, N = 3SE +/- 186.47, N = 3SE +/- 109.24, N = 34708545691455874382435968355793555235467-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-O3 -march=native-O2 -march=native-O2 -march=athlon641. (CC) gcc options: -lm -rdynamic

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3PGO-O3 -march=athlon64-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.20, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 343.8644.4462.9363.0163.1963.2585.40-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

Memcached mcperf

Method: Append

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Append-O2 -march=athlon64-O3 -march=athlon64PGO-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native13K26K39K52K65KSE +/- 2658.53, N = 15SE +/- 3045.08, N = 15SE +/- 123.87, N = 3SE +/- 220.19, N = 3SE +/- 218.35, N = 3SE +/- 256.24, N = 3SE +/- 168.27, N = 36121956172461064571645339424553592035058-O2 -march=athlon64-O3 -march=athlon64-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O3 -march=native-O2 -march=native1. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1-O2 -march=athlon64-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-CorePGO-O3 -march=athlon64714212835SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.34, N = 15SE +/- 0.39, N = 3SE +/- 0.29, N = 925.6026.0726.2326.3127.1227.8328.7429.66-O2 -march=athlon64-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=native-O3 -march=athlon641. (CC) gcc options: -pthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimePGO-O3 -march=native-O3 -march=native -flto-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon641122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 339.1039.1139.9242.3443.3644.9845.2047.28-march=native-march=native-march=native -flto-O2 -march=native-march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon641. (CC) gcc options: -lm -O3

Memcached mcperf

Method: Replace

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Replace-O3 -march=nativePGO-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O2 -march=athlon64-O2 -march=native12K24K36K48K60KSE +/- 1741.69, N = 12SE +/- 168.47, N = 3SE +/- 85.02, N = 3SE +/- 174.58, N = 3SE +/- 365.15, N = 3SE +/- 132.68, N = 3SE +/- 167.62, N = 35386545956457074559136231359783582935486-O3 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=athlon64-O3 -march=native-O2 -march=athlon64-O2 -march=native1. (CC) gcc options: -lm -rdynamic

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositePGO-O3 -march=native -flto-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon645001000150020002500SE +/- 2.91, N = 3SE +/- 32.19, N = 3SE +/- 26.08, N = 8SE +/- 0.49, N = 3SE +/- 25.27, N = 5SE +/- 17.63, N = 3SE +/- 4.62, N = 325552543251422572039202119811782-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon641. (CC) gcc options: -lm

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Ctype-O3 -march=native -flto-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon64-O3 -march=nativePGO-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core918273645SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 332.3233.2833.3133.3733.8834.0234.5037.67-march=native -flto-march=athlon64-march=athlon64-sse3-O2 -march=athlon64-march=native-march=native-O2 -march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

LuaJIT

Test: Composite

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: CompositePGO-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=athlon64-O3 -march=athlon64-O2 -march=native30060090012001500SE +/- 1.60, N = 3SE +/- 0.43, N = 3SE +/- 1.46, N = 3SE +/- 0.72, N = 3SE +/- 4.50, N = 3SE +/- 4.29, N = 3SE +/- 10.67, N = 31499149714961495149114891467-O3 -march=native-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=native-march=athlon64-O3 -march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O2 -march=athlon64-O2 -march=nativePGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto20406080100SE +/- 0.21, N = 3SE +/- 0.24, N = 3SE +/- 0.07, N = 3SE +/- 0.25, N = 3SE +/- 0.03, N = 3SE +/- 0.23, N = 5SE +/- 1.39, N = 317.4117.5819.0919.1519.6919.7319.7788.89

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesPGO-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core120240360480600SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 33.833.853.873.894.514.544.57564.07-march=native1. (CXX) g++ options: -fopenmp -O3

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O3 -march=native -fltoPGO-O3 -march=athlon64-O3 -march=native-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3AMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 1.19, N = 15SE +/- 0.96, N = 15SE +/- 1.26, N = 15SE +/- 1.13, N = 15SE +/- 0.98, N = 15SE +/- 0.82, N = 3SE +/- 0.78, N = 3104.42103.97103.45102.91101.13100.8297.815.07-O3 -march=native-O3 -march=native-O3 -march=athlon64-O3 -march=native-march=athlon64-march=native-O3 -march=athlon64-sse31. (CC) gcc options: -flto -fPIE -fPIC -O2 -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstraction-O3 -march=native-O3 -march=native -flto-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon64PGO-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core714212835SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 328.2228.2528.2628.2828.3328.3628.6929.08-march=native-march=native -flto-march=athlon64-march=athlon64-sse3-O2 -march=athlon64-march=native-O2 -march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

Memcached mcperf

Method: Delete

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Delete-O3 -march=athlon64-sse3PGO-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native-O2 -march=athlon6415K30K45K60K75KSE +/- 914.89, N = 4SE +/- 370.63, N = 3SE +/- 220.33, N = 3SE +/- 772.91, N = 4SE +/- 204.60, N = 3SE +/- 304.30, N = 3SE +/- 274.96, N = 36950968980686965896956822567975614155747-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-O3 -march=native-O2 -march=native-O2 -march=athlon641. (CC) gcc options: -lm -rdynamic

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O2 -march=athlon64PGO-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native600K1200K1800K2400K3000KSE +/- 41423.26, N = 15SE +/- 25803.36, N = 3SE +/- 24550.99, N = 9SE +/- 34288.12, N = 3SE +/- 25886.44, N = 8SE +/- 3626.56, N = 3SE +/- 24290.24, N = 32654025264834526167032573823254476225062762340595-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O3 -march=athlon64-sse3-O3 -march=native -fltoPGO-O3 -march=native-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O2 -march=native-O2 -march=athlon6415K30K45K60K75KSE +/- 286.25, N = 3SE +/- 66.15, N = 3SE +/- 734.34, N = 3SE +/- 812.95, N = 3SE +/- 29.60, N = 3SE +/- 125.71, N = 3SE +/- 339.02, N = 36900468644684265765256837563245579155647-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=athlon64-O3 -march=native-O2 -march=native-O2 -march=athlon641. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O2 -march=nativePGO-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core48121620SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 10SE +/- 0.09, N = 15SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 158.688.828.899.019.409.419.5914.05-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3 -march=native-O3 -march=athlon64-O3 -march=native1. (CC) gcc options: -pthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=athlon64-O2 -march=nativePGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core13002600390052006500SE +/- 1.25, N = 3SE +/- 1.40, N = 3SE +/- 2.44, N = 3SE +/- 0.92, N = 3SE +/- 1.56, N = 3SE +/- 4.62, N = 3SE +/- 3.74, N = 358385837583358325831583058255791-O3 -march=athlon64-sse3 -lssl-O3 -march=athlon64 -lssl-O3 -march=native -flto -lssl-O2 -march=athlon64 -lssl-O2 -march=native -lssl-O3 -march=native -lssl-O3 -march=native -lssl-O3 -march=native1. (CC) gcc options: -pthread -m64 -lcrypto -ldl

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=athlon64-O3 -march=athlon64-sse3PGO-O3 -march=native-O3 -march=native -flto-O2 -march=athlon64-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core816243240SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 333.8933.8833.7933.7633.6833.5333.449.95-march=athlon64-march=athlon64-sse3-march=native-march=native-march=native -flto-O2 -march=athlon64-O2 -march=native-march=native1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACPGO-O3 -march=native -flto-O3 -march=native-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon6448121620SE +/- 0.00, N = 5SE +/- 0.06, N = 5SE +/- 0.01, N = 5SE +/- 0.05, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.01, N = 59.449.489.539.789.8115.4315.4815.58-O3 -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon641. (CXX) g++ options: -fvisibility=hidden -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=athlon64-O3 -march=nativePGO-O3 -march=athlon64-sse3-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64400K800K1200K1600K2000KSE +/- 16989.46, N = 3SE +/- 6043.61, N = 3SE +/- 14705.69, N = 3SE +/- 12476.39, N = 3SE +/- 7219.96, N = 3SE +/- 28790.51, N = 15SE +/- 22619.48, N = 31840806180727017988001755590175547217449791730702-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O3 -march=native -fltoPGO-O3 -march=athlon64-O3 -march=native-O3 -march=athlon64-sse3-O2 -march=native-O2 -march=athlon64400K800K1200K1600K2000KSE +/- 27875.47, N = 3SE +/- 10024.22, N = 3SE +/- 23560.29, N = 3SE +/- 13234.25, N = 3SE +/- 24795.31, N = 5SE +/- 26134.11, N = 12SE +/- 18375.04, N = 32084089208343020511292046557203292120007091975322-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=athlon64PGO-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3-O2 -march=athlon64-O2 -march=native500K1000K1500K2000K2500KSE +/- 15021.00, N = 3SE +/- 26635.52, N = 12SE +/- 36774.66, N = 3SE +/- 11016.75, N = 3SE +/- 28994.69, N = 3SE +/- 39327.40, N = 3SE +/- 25441.63, N = 32540398253302725094332502182246172523841432275019-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O3 -march=nativePGO-O3 -march=native -flto-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 315.4015.4615.5015.6116.1416.1716.2022.07-march=native-march=native-march=native -flto-O2 -march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -std=c++11

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O3 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=native-O2 -march=athlon64-O2 -march=native1632486480SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.40, N = 3SE +/- 0.09, N = 3SE +/- 0.24, N = 370.9470.9471.0071.0271.2671.9872.08-march=athlon64 - MIN: 70.24MIN: 70.39-lrt - MIN: 70.47-march=athlon64-sse3 - MIN: 70.33MIN: 70.22-O2 -march=athlon64 - MIN: 70.78-O2 - MIN: 70.861. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-03-071080p 8-bit YUV To AV1 Video Encode-O3 -march=native -flto-O3 -march=nativePGO-O3 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O2 -march=native-O2 -march=athlon64510152025SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 320.4120.2719.5718.8418.7718.6618.6318.51-march=native -flto-march=native-march=native-march=athlon64-march=native-march=athlon64-sse3-O2 -march=native-O2 -march=athlon641. (CXX) g++ options: -O3 -pie -lpthread -lm

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3 -march=native -flto-O2 -march=athlon64-O3 -march=nativePGO-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native300K600K900K1200K1500KSE +/- 6810.29, N = 3SE +/- 20493.00, N = 5SE +/- 12857.88, N = 3SE +/- 22070.89, N = 3SE +/- 9157.67, N = 3SE +/- 19988.57, N = 3SE +/- 22581.44, N = 31549647154855615402571532038152295215202761463388-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodePGO-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 6.47, N = 15SE +/- 4.80, N = 15SE +/- 2.41, N = 3SE +/- 0.65, N = 3SE +/- 2.04, N = 3SE +/- 1.96, N = 3SE +/- 3.04, N = 12185.00172.00168.00166.00165.00165.00163.0023.01-O3-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O3-O31. (CC) gcc options: -march=native -fPIE -fPIC -O2 -flto -fvisibility=hidden -pie -rdynamic -lpthread -lrt

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64PGO-O3 -march=native -flto-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core0.61651.2331.84952.4663.0825SE +/- 0.03, N = 15SE +/- 0.02, N = 15SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.00, N = 3SE +/- 0.03, N = 15SE +/- 0.01, N = 32.552.582.582.602.632.652.692.741. (CC) gcc options: -std=c99 -O3 -lm -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native -fltoPGO-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=athlon643691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 37.987.988.009.369.4310.8011.0911.41-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O3 -march=native-O2 -march=athlon641. (CC) gcc options: -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64PGO-O2 -march=native306090120150SE +/- 1.46, N = 9SE +/- 1.95, N = 5SE +/- 0.81, N = 3SE +/- 1.63, N = 7SE +/- 0.95, N = 3SE +/- 1.95, N = 3147146146146145143-march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon64-march=native-O2 -march=native1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=native -fltoPGO-O3 -march=native-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon640.56031.12061.68092.24122.8015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.302.342.352.382.462.462.49-march=native -flto-march=native-march=native-O2 -march=native-march=athlon64-march=athlon64-sse3-O2 -march=athlon641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64PGO-O2 -march=native-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core200K400K600K800K1000KSE +/- 740.45, N = 5SE +/- 1107.42, N = 5SE +/- 903.00, N = 5SE +/- 1105.70, N = 5SE +/- 2184.70, N = 5SE +/- 2140.82, N = 5SE +/- 5126.15, N = 51135626111674711158411115390110911411020131094211981778-flto-march=athlon64-sse3-march=athlon64-O2-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core306090120150150150150150150150150150-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options:

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=native -flto-O3 -march=nativePGO-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon640.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.971.001.001.021.061.061.07-march=native -flto-march=native-march=native-O2 -march=native-march=athlon64-march=athlon64-sse3-O2 -march=athlon641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=native -fltoPGO-O2 -march=native-O3 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon640.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.820.830.840.840.860.860.87-march=native -flto-march=native-O2 -march=native-march=native-march=athlon64-march=athlon64-sse3-O2 -march=athlon641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=nativePGO-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64-O3 -march=native -flto0.5491.0981.6472.1962.745SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 32.302.322.352.362.392.422.44-march=native-march=native-march=athlon64-sse3-march=athlon64-O2 -march=native-O2 -march=athlon64-march=native -flto1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=native -flto-O3 -march=nativePGO-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon641.01252.0253.03754.055.0625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 33.903.973.974.064.454.454.50-march=native -flto-march=native-march=native-O2 -march=native-march=athlon64-march=athlon64-sse3-O2 -march=athlon641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackPGO-O3 -march=native-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64-O3 -march=native -flto1.08452.1693.25354.3385.4225SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 34.414.424.574.624.654.684.82-march=native-march=native-O2 -march=native-march=athlon64-sse3-march=athlon64-O2 -march=athlon64-march=native -flto1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallPGO-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64-O3 -march=native -flto0.89331.78662.67993.57324.4665SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 33.833.843.913.923.953.963.97-march=native-march=native-march=athlon64-sse3-march=athlon64-O2 -march=native-O2 -march=athlon64-march=native -flto1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=nativePGO-O3 -march=native -fltoAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon645001000150020002500SE +/- 0.57, N = 3SE +/- 0.39, N = 3SE +/- 0.48, N = 3SE +/- 0.37, N = 3SE +/- 0.24, N = 3SE +/- 7.73, N = 3SE +/- 5.74, N = 322182208220221901842184213061186-O3 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon641. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationPGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O3 -march=native -flto-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon6414002800420056007000SE +/- 11.42, N = 3SE +/- 352.63, N = 3SE +/- 139.92, N = 3SE +/- 35.70, N = 3SE +/- 1.14, N = 3SE +/- 4.78, N = 3SE +/- 15.82, N = 363565989542953884507427442393593-O3 -march=native-O3 -march=native-O3 -march=native-O3 -march=native -flto-O2 -march=native-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=athlon641. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyPGO-O3 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=athlon647001400210028003500SE +/- 9.00, N = 3SE +/- 37.52, N = 3SE +/- 7.39, N = 3SE +/- 52.49, N = 3SE +/- 4.22, N = 3SE +/- 20.52, N = 3SE +/- 208.10, N = 332203174315331193105308229512874-O3 -march=native-O3 -march=native-O3 -march=native-O2 -march=athlon64-O2 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=athlon641. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64-O3 -march=native -flto-O3 -march=native-O2 -march=nativePGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.67, N = 3SE +/- 0.13, N = 3SE +/- 0.93, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 3SE +/- 1.28, N = 3SE +/- 0.21, N = 3294294291270270265261260-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=nativePGO-O2 -march=athlon64-O2 -march=nativeAMD Ryzen Threadripper 2990WX 32-Core400800120016002000SE +/- 0.38, N = 3SE +/- 0.20, N = 3SE +/- 0.82, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 3.18, N = 3SE +/- 3.93, N = 31904737736732728723721255-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native-O3 -march=native-O2 -march=athlon64-O2 -march=native-O3 -march=native1. (CC) gcc options: -lm

LuaJIT

Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-RelaxationPGO-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64-O2 -march=native400800120016002000SE +/- 0.37, N = 3SE +/- 0.35, N = 3SE +/- 0.27, N = 3SE +/- 0.46, N = 3SE +/- 0.46, N = 3SE +/- 0.07, N = 3SE +/- 11.29, N = 31870186818681868186718651830-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix FactorizationPGO-O3 -march=native -flto-O2 -march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=athlon64-O2 -march=native8001600240032004000SE +/- 8.61, N = 3SE +/- 1.48, N = 3SE +/- 8.06, N = 3SE +/- 5.63, N = 3SE +/- 2.52, N = 3SE +/- 22.76, N = 3SE +/- 31.00, N = 33636362436233619361135903550-O3 -march=native-O3 -march=native -flto-march=athlon64-O3 -march=athlon64-sse3-O3 -march=native-O3 -march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix Multiply-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=native -fltoPGO-O3 -march=athlon64-O2 -march=athlon64-O2 -march=native30060090012001500SE +/- 1.12, N = 3SE +/- 1.43, N = 3SE +/- 1.60, N = 3SE +/- 0.69, N = 3SE +/- 2.17, N = 3SE +/- 14.55, N = 3SE +/- 7.82, N = 31208120712041203120011831182-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=native -flto-O3 -march=native-O3 -march=athlon64-march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier Transform-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=athlon64PGO-O3 -march=native-O2 -march=athlon64-O2 -march=native60120180240300SE +/- 0.09, N = 3SE +/- 0.81, N = 3SE +/- 0.62, N = 3SE +/- 0.27, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 2.77, N = 3287287287286286286281-O3 -march=native -flto-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native-O3 -march=native-march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte Carlo-O3 -march=native -fltoPGO-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=athlon64-O2 -march=native110220330440550SE +/- 1.65, N = 3SE +/- 0.60, N = 3SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.46, N = 3SE +/- 0.13, N = 3SE +/- 2.87, N = 3500499499499498498490-O3 -march=native -flto-O3 -march=native-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-march=athlon64-march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector


Phoronix Test Suite v10.8.5