GCC 9.1 Compiler Tuning Threadripper AMD znver1

AMD Ryzen Threadripper 2990WX compiler benchmarks on GCC 9.1 with Ubuntu Linux by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1905122-HV-GCC91COMP71&grr.

GCC 9.1 Compiler Tuning Threadripper AMD znver1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-CoreAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX 64 8GB (1590/800MHz)Realtek ALC1220ASUS VP28UIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 18.044.18.0-18-generic (x86_64)GNOME Shell 3.28.3X Server 1.20.1amdgpu 18.1.04.5 Mesa 18.2.8 (LLVM 7.0.0)GCC 9.1.0ext43840x2160OpenBenchmarking.orgEnvironment Details- -O2 -march=athlon64: CXXFLAGS=-O2-march=athlon64 CFLAGS=-O2-march=athlon64- -O3 -march=athlon64: CXXFLAGS=-O3-march=athlon64 CFLAGS=-O3-march=athlon64- -O3 -march=athlon64-sse3: CXXFLAGS=-O3-march=athlon64-sse3 CFLAGS=-O3-march=athlon64-sse3- -O2 -march=native: CXXFLAGS=-O2-march=native CFLAGS=-O2-march=native- -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- -O3 -march=native -flto: CXXFLAGS=-O3-march=native-flto CFLAGS=-O3-march=native-flto- PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- AMD Ryzen Threadripper 2990WX 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checing=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15rc1 + Python 3.6.7Security Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

GCC 9.1 Compiler Tuning Threadripper AMD znver1cpp-perf-bench: Rand Numberscpp-perf-bench: Math Libraryvpxenc: vpxenc VP9 1080p Video Encodefftw: Float + SSE - 2D FFT Size 4096hpcg: c-ray: Total Time - 4K, 16 Rays Per Pixelfftw: Stock - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writembw: Memory Copy, Fixed Block Size - 8192 MiBbuild-llvm: Time To Compilecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9aom-av1: AV1 Video Encodingcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19mcperf: Addmbw: Memory Copy - 8192 MiBpgbench: Buffer Test - Normal Load - Read Onlymcperf: Setnginx: Static Web Page Servingstockfish: Total Timecpp-perf-bench: Stepanov Vectorcpp-perf-bench: Atolgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solvermcperf: Prependbuild-php: Time To Compilemcperf: Appendt-test1: 1aobench: 2048 x 2048 - Total Timemcperf: Replacescimark2: Compositecpp-perf-bench: Ctypeluajit: Compositebuild-imagemagick: Time To Compilesmallpt: Global Illumination Renderer; 128 Samplessvt-vp9: 1080p 8-bit YUV To VP9 Video Encodecpp-perf-bench: Stepanov Abstractionmcperf: Deleteredis: LPOPmcperf: Gett-test1: 2openssl: RSA 4096-bit Performancex265: H.265 1080p Video Encodingencode-flac: WAV To FLACredis: SETredis: SADDredis: GETcpp-perf-bench: Function Objectsmkl-dnn: IP Batch 1D - f32svt-av1: 1080p 8-bit YUV To AV1 Video Encoderedis: LPUSHsvt-hevc: 1080p 8-bit YUV To HEVC Video Encodemafft: Multiple Sequence Alignmentencode-mp3: WAV To MP3x264: H.264 Video Encodingbullet: Rayteststscp: AI Chess Performancectx-clock: Context Switch Timebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloluajit: Jacobi Successive Over-Relaxationluajit: Dense LU Matrix Factorizationluajit: Sparse Matrix Multiplyluajit: Fast Fourier Transformluajit: Monte Carlo-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core107239925.440.8945.4343425098652622926.750.2018.02546001263245360559924297046689068775.7969.0819223519820224424327213283546743.866121925.6047.2835829178233.37149117.414.57101.1328.33557472654025556478.68583233.5315.5817307021975322238414316.2071.9818.5115485561722.6011.411462.4910942111501.070.872.424.504.683.96118635933119291723186536231183286498107039826.120.9229.39436115616675522526.500.2017.74531061293046384034880297266751360275.5368.3719023319519923723826113163596863.195617229.6645.2036231202133.28148919.774.51103.4528.26568222544762568379.59583733.8915.4818408062051129254039816.1770.9418.8415202761682.589.431462.4611153901501.060.862.364.454.653.92184242742874294736186735901200287498105539925.580.8529.53446316252668122425.930.2118.38348221269945959648548292816757115075.5668.4219123119419823724026313164569162.934571627.1244.9845707203933.31149619.734.5497.8128.28695092506276690048.82583833.8815.4317555902032921246172516.1471.0218.6615229521662.559.361462.4611158411501.060.862.354.454.623.91184242393082294737186836191207287499104135626.02162630.8134.22633513558677822126.780.2119.31341381279545855134446278346669748775.9969.8021723823120024524927113193555244.443505826.0742.3435486198134.50146717.583.89100.8228.69561412340595557919.40583133.449.7817449792000709227501915.6172.0818.6314633881632.5810.801432.3811020131501.020.842.394.064.573.95130645073105265721183035501182281490102335126.37149270.8318.00660016339668522125.700.2219.09449031285146672338646292746820016474.9369.0321924323220424724927213134382463.254245526.2339.1153865251433.88149519.153.87102.9128.22589692573823576529.01582533.769.5318072702046557250218215.4071.2620.2715402571652.698.001472.3511167471501.000.842.303.974.423.842218598931742707321868361112082864991011352148640.9817.85701914989672195125.620.2217.94500551317647362043570273526745068974.9269.252212402332032502482741304455874533926.3139.9245591254332.32149788.893.85104.4228.25686962616703686448.89583333.689.4817554722084089250943315.5020.4115496471652.657.982.3011356261500.970.822.443.904.823.972202538829512701904186836241204287500102735326.36152870.9117.9667176281675326.090.2217.364777412406460414463966784187775.2769.3122024323420325025127413214708563.014610628.7439.1045956255534.02149919.093.83103.9728.36689802648345684269.41583033.799.4417988002083430253302715.4670.9419.5715320381852.637.981452.3411091141501.000.832.323.974.413.832208635632202617281870363612032864991154355150980.904024.62641164726596290708.150.08265.7935174126182590583523978.1469.25431840261755913223557985.403592027.8343.3635978225737.6719.69564.075.0729.08567975632414.0557919.959.8122.0771.0018.7723.012.7411.09981778150219054293153260255OpenBenchmarking.org

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random Numbers-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core2004006008001000SE +/- 0.07, N = 3SE +/- 0.71, N = 3SE +/- 0.04, N = 3SE +/- 4.70, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 310721070105510411023101110271154-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core90180270360450SE +/- 0.87, N = 3SE +/- 0.79, N = 3SE +/- 0.83, N = 3SE +/- 1.79, N = 3SE +/- 0.11, N = 3SE +/- 0.73, N = 3SE +/- 0.93, N = 3399398399356351352353355-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=nativePGO612182430SE +/- 0.26, N = 9SE +/- 0.00, N = 3SE +/- 0.40, N = 3SE +/- 0.22, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 325.4426.1225.5826.0226.3726.36-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core3K6K9K12K15KSE +/- 13.87, N = 3SE +/- 71.85, N = 3SE +/- 74.46, N = 3SE +/- 152.85, N = 31626314927148641528715098-O2-O3-O3 -flto-O3-O31. (CC) gcc options: -pthread -march=native -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core0.22050.4410.66150.8821.1025SE +/- 0.02, N = 13SE +/- 0.02, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 15SE +/- 0.03, N = 12SE +/- 0.02, N = 3SE +/- 0.03, N = 120.890.920.850.810.830.980.910.90

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core9001800270036004500SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 345.4329.3929.5334.2218.0017.8517.964024.62-march=native1. (CC) gcc options: -lm -lpthread -O3

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core15003000450060007500SE +/- 28.80, N = 3SE +/- 63.69, N = 3SE +/- 1.73, N = 3SE +/- 34.65, N = 3SE +/- 51.66, N = 3SE +/- 8.84, N = 3SE +/- 14.46, N = 343424361446363356600701967176411-O2-O3-O3 -flto-O3-O31. (CC) gcc options: -pthread -march=native -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core4K8K12K16K20KSE +/- 65.89, N = 4SE +/- 247.31, N = 3SE +/- 51.55, N = 3SE +/- 279.82, N = 15SE +/- 98.94, N = 3SE +/- 164.04, N = 7SE +/- 93.40, N = 45098156161625213558163391498962816472-O2 -march=athlon64 -lpq-O3 -march=athlon64 -lpq-O3 -march=athlon64-sse3 -lpq-O2 -march=native -lpq-O3 -march=native -lpq-O3 -march=native -flto -lpq-O3 -march=native -lpq-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core15003000450060007500SE +/- 60.66, N = 3SE +/- 7.74, N = 3SE +/- 10.14, N = 3SE +/- 38.61, N = 3SE +/- 27.83, N = 3SE +/- 96.68, N = 3SE +/- 26.90, N = 365266755668167786685672167536596-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2-flto1. (CC) gcc options: -O3 -march=native

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoAMD Ryzen Threadripper 2990WX 32-Core2004006008001000229225224221221951290

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core150300450600750SE +/- 0.21, N = 15SE +/- 0.27, N = 3SE +/- 0.22, N = 12SE +/- 0.10, N = 3SE +/- 0.26, N = 15SE +/- 0.23, N = 15SE +/- 0.07, N = 326.7526.5025.9326.7825.7025.6226.09708.15-O3 -march=native1. (CC) gcc options: -pthread -fvisibility=hidden

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video Encoding-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core0.04950.0990.14850.1980.2475SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.200.200.210.210.220.220.220.08-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.60, N = 15SE +/- 0.69, N = 12SE +/- 0.66, N = 15SE +/- 0.61, N = 15SE +/- 0.65, N = 15SE +/- 0.66, N = 15SE +/- 0.45, N = 1518.0217.7418.3819.3119.0917.9417.36265.79-O3 -march=native1. (CC) gcc options: -pthread -lz -llzma

Memcached mcperf

Method: Add

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Add-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core12K24K36K48K60KSE +/- 2866.90, N = 12SE +/- 2252.95, N = 15SE +/- 121.25, N = 3SE +/- 125.83, N = 3SE +/- 1053.86, N = 15SE +/- 2185.23, N = 15SE +/- 2440.20, N = 125460053106348223413844903500554777435174-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core3K6K9K12K15KSE +/- 152.24, N = 5SE +/- 162.89, N = 5SE +/- 217.78, N = 3SE +/- 167.95, N = 5SE +/- 169.00, N = 3SE +/- 153.84, N = 3SE +/- 61.39, N = 31263212930126991279512851131761240612618-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2-flto1. (CC) gcc options: -O3 -march=native

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core100K200K300K400K500KSE +/- 2874.20, N = 3SE +/- 6987.31, N = 3SE +/- 6062.49, N = 4SE +/- 3403.88, N = 3SE +/- 1148.20, N = 3SE +/- 1547.77, N = 3SE +/- 7464.16, N = 3453605463840459596458551466723473620460414259058-O2 -march=athlon64 -lpq-O3 -march=athlon64 -lpq-O3 -march=athlon64-sse3 -lpq-O2 -march=native -lpq-O3 -march=native -lpq-O3 -march=native -flto -lpq-O3 -march=native -lpq-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core13K26K39K52K65KSE +/- 2193.95, N = 15SE +/- 44.53, N = 3SE +/- 2384.39, N = 15SE +/- 67.69, N = 3SE +/- 877.57, N = 15SE +/- 195.82, N = 3SE +/- 1474.59, N = 125992434880485483444638646435704639635239-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto6K12K18K24K30KSE +/- 219.17, N = 3SE +/- 169.85, N = 3SE +/- 428.25, N = 3SE +/- 166.84, N = 3SE +/- 401.04, N = 4SE +/- 43.76, N = 3297042972629281278342927427352-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2-flto1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO15M30M45M60M75MSE +/- 226981.30, N = 3SE +/- 779150.59, N = 3SE +/- 443906.77, N = 3SE +/- 526520.80, N = 3SE +/- 954655.13, N = 3SE +/- 385955.10, N = 3SE +/- 458502.78, N = 366890687675136026757115066697487682001646745068967841877-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native-march=native1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Vector-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 375.7975.5375.5675.9974.9374.9275.2778.14-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Atol-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core1632486480SE +/- 0.05, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 369.0868.3768.4269.8069.0369.2569.3169.25-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 31921901912172192212204-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 1.15, N = 3SE +/- 1.00, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 323523323123824324024331-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 31981951942312322332348-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 1.53, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 320219919820020420320340-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 324423723724524725025026-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.67, N = 3SE +/- 0.33, N = 3243238240249249248251175-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.33, N = 3SE +/- 1.33, N = 327226126327127227427459-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core30060090012001500SE +/- 3.65, N = 3SE +/- 3.98, N = 3SE +/- 4.00, N = 3SE +/- 3.88, N = 3SE +/- 1.73, N = 3SE +/- 1.86, N = 3SE +/- 3.47, N = 313281316131613191313130413211322-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CC) gcc options: -O3 -mavx2

Memcached mcperf

Method: Prepend

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Prepend-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core10K20K30K40K50KSE +/- 109.24, N = 3SE +/- 58.44, N = 3SE +/- 116.90, N = 3SE +/- 186.47, N = 3SE +/- 2300.04, N = 15SE +/- 179.60, N = 3SE +/- 863.93, N = 123546735968456913555243824455874708535579-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=nativePGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.20, N = 3SE +/- 0.32, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 343.8663.1962.9344.4463.2563.0185.40-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native-O3 -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

Memcached mcperf

Method: Append

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Append-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core13K26K39K52K65KSE +/- 2658.53, N = 15SE +/- 3045.08, N = 15SE +/- 220.19, N = 3SE +/- 168.27, N = 3SE +/- 256.24, N = 3SE +/- 218.35, N = 3SE +/- 123.87, N = 36121956172457163505842455453394610635920-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core714212835SE +/- 0.10, N = 3SE +/- 0.29, N = 9SE +/- 0.34, N = 15SE +/- 0.33, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.39, N = 325.6029.6627.1226.0726.2326.3128.7427.83-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -pthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core1122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 347.2845.2044.9842.3439.1139.9239.1043.36-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CC) gcc options: -lm -O3

Memcached mcperf

Method: Replace

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Replace-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core12K24K36K48K60KSE +/- 132.68, N = 3SE +/- 365.15, N = 3SE +/- 85.02, N = 3SE +/- 167.62, N = 3SE +/- 1741.69, N = 12SE +/- 174.58, N = 3SE +/- 168.47, N = 33582936231457073548653865455914595635978-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core5001000150020002500SE +/- 4.62, N = 3SE +/- 25.27, N = 5SE +/- 0.49, N = 3SE +/- 17.63, N = 3SE +/- 26.08, N = 8SE +/- 32.19, N = 3SE +/- 2.91, N = 317822021203919812514254325552257-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Ctype-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core918273645SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 333.3733.2833.3134.5033.8832.3234.0237.67-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

LuaJIT

Test: Composite

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Composite-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO30060090012001500SE +/- 4.50, N = 3SE +/- 4.29, N = 3SE +/- 1.46, N = 3SE +/- 10.67, N = 3SE +/- 0.72, N = 3SE +/- 0.43, N = 3SE +/- 1.60, N = 31491148914961467149514971499-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.21, N = 3SE +/- 0.23, N = 5SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.25, N = 3SE +/- 1.39, N = 3SE +/- 0.07, N = 317.4119.7719.7317.5819.1588.8919.0919.69

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core120240360480600SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 34.574.514.543.893.873.853.83564.07-march=native1. (CXX) g++ options: -fopenmp -O3

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.98, N = 15SE +/- 1.26, N = 15SE +/- 0.78, N = 3SE +/- 0.82, N = 3SE +/- 1.13, N = 15SE +/- 1.19, N = 15SE +/- 0.96, N = 15101.13103.4597.81100.82102.91104.42103.975.07-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native-O3 -march=native1. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstraction-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core714212835SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 328.3328.2628.2828.6928.2228.2528.3629.08-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

Memcached mcperf

Method: Delete

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Delete-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core15K30K45K60K75KSE +/- 274.96, N = 3SE +/- 204.60, N = 3SE +/- 914.89, N = 4SE +/- 304.30, N = 3SE +/- 772.91, N = 4SE +/- 220.33, N = 3SE +/- 370.63, N = 35574756822695095614158969686966898056797-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO600K1200K1800K2400K3000KSE +/- 41423.26, N = 15SE +/- 25886.44, N = 8SE +/- 3626.56, N = 3SE +/- 24290.24, N = 3SE +/- 34288.12, N = 3SE +/- 24550.99, N = 9SE +/- 25803.36, N = 32654025254476225062762340595257382326167032648345-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core15K30K45K60K75KSE +/- 339.02, N = 3SE +/- 29.60, N = 3SE +/- 286.25, N = 3SE +/- 125.71, N = 3SE +/- 812.95, N = 3SE +/- 66.15, N = 3SE +/- 734.34, N = 35564756837690045579157652686446842656324-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core48121620SE +/- 0.10, N = 3SE +/- 0.10, N = 15SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 15SE +/- 0.08, N = 10SE +/- 0.12, N = 38.689.598.829.409.018.899.4114.05-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -pthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core13002600390052006500SE +/- 0.92, N = 3SE +/- 1.40, N = 3SE +/- 1.25, N = 3SE +/- 1.56, N = 3SE +/- 3.74, N = 3SE +/- 2.44, N = 3SE +/- 4.62, N = 358325837583858315825583358305791-O2 -march=athlon64 -lssl-O3 -march=athlon64 -lssl-O3 -march=athlon64-sse3 -lssl-O2 -march=native -lssl-O3 -march=native -lssl-O3 -march=native -flto -lssl-O3 -march=native -lssl-O3 -march=native1. (CC) gcc options: -pthread -m64 -lcrypto -ldl

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core816243240SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 333.5333.8933.8833.4433.7633.6833.799.95-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core48121620SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.01, N = 5SE +/- 0.05, N = 5SE +/- 0.01, N = 5SE +/- 0.06, N = 5SE +/- 0.00, N = 515.5815.4815.439.789.539.489.449.81-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fvisibility=hidden -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO400K800K1200K1600K2000KSE +/- 22619.48, N = 3SE +/- 16989.46, N = 3SE +/- 12476.39, N = 3SE +/- 28790.51, N = 15SE +/- 6043.61, N = 3SE +/- 7219.96, N = 3SE +/- 14705.69, N = 31730702184080617555901744979180727017554721798800-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO400K800K1200K1600K2000KSE +/- 18375.04, N = 3SE +/- 23560.29, N = 3SE +/- 24795.31, N = 5SE +/- 26134.11, N = 12SE +/- 13234.25, N = 3SE +/- 27875.47, N = 3SE +/- 10024.22, N = 31975322205112920329212000709204655720840892083430-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO500K1000K1500K2000K2500KSE +/- 39327.40, N = 3SE +/- 15021.00, N = 3SE +/- 28994.69, N = 3SE +/- 25441.63, N = 3SE +/- 11016.75, N = 3SE +/- 36774.66, N = 3SE +/- 26635.52, N = 122384143254039824617252275019250218225094332533027-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.2016.1716.1415.6115.4015.5015.4622.07-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -std=c++11 -O3

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=nativePGOAMD Ryzen Threadripper 2990WX 32-Core1632486480SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.24, N = 3SE +/- 0.40, N = 3SE +/- 0.06, N = 371.9870.9471.0272.0871.2670.9471.00-O2 -march=athlon64 - MIN: 70.78-march=athlon64 - MIN: 70.24-march=athlon64-sse3 - MIN: 70.33-O2 - MIN: 70.86MIN: 70.22MIN: 70.39-lrt - MIN: 70.471. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-03-071080p 8-bit YUV To AV1 Video Encode-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core510152025SE +/- 0.17, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 318.5118.8418.6618.6320.2720.4119.5718.77-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native-march=native1. (CXX) g++ options: -O3 -pie -lpthread -lm

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO300K600K900K1200K1500KSE +/- 20493.00, N = 5SE +/- 19988.57, N = 3SE +/- 9157.67, N = 3SE +/- 22581.44, N = 3SE +/- 12857.88, N = 3SE +/- 6810.29, N = 3SE +/- 22070.89, N = 31548556152027615229521463388154025715496471532038-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video Encode-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 4.80, N = 15SE +/- 2.41, N = 3SE +/- 0.65, N = 3SE +/- 3.04, N = 12SE +/- 1.96, N = 3SE +/- 2.04, N = 3SE +/- 6.47, N = 15172.00168.00166.00163.00165.00165.00185.0023.01-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O3-O3-O31. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core0.61651.2331.84952.4663.0825SE +/- 0.05, N = 15SE +/- 0.02, N = 15SE +/- 0.03, N = 15SE +/- 0.06, N = 15SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.00, N = 32.602.582.552.582.692.652.632.741. (CC) gcc options: -std=c99 -O3 -lm -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core3691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.419.439.3610.808.007.987.9811.09-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=nativePGO306090120150SE +/- 1.63, N = 7SE +/- 0.81, N = 3SE +/- 1.95, N = 5SE +/- 1.95, N = 3SE +/- 1.46, N = 9SE +/- 0.95, N = 3146146146143147145-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO0.56031.12061.68092.24122.8015SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.492.462.462.382.352.302.34-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core200K400K600K800K1000KSE +/- 5126.15, N = 5SE +/- 1105.70, N = 5SE +/- 903.00, N = 5SE +/- 2140.82, N = 5SE +/- 1107.42, N = 5SE +/- 740.45, N = 5SE +/- 2184.70, N = 51094211111539011158411102013111674711356261109114981778-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2-flto1. (CC) gcc options: -O3 -march=native

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core306090120150150150150150150150150150-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options:

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO0.24080.48160.72240.96321.204SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.061.061.021.000.971.00-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO0.19580.39160.58740.78320.979SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.860.860.840.840.820.83-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO0.5491.0981.6472.1962.745SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.422.362.352.392.302.442.32-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO1.01252.0253.03754.055.0625SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.504.454.454.063.973.903.97-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO1.08452.1693.25354.3385.4225SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.684.654.624.574.424.824.41-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO0.89331.78662.67993.57324.4665SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.963.923.913.953.843.973.83-O2 -march=athlon64-march=athlon64-march=athlon64-sse3-O2 -march=native-march=native-march=native -flto-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core5001000150020002500SE +/- 5.74, N = 3SE +/- 0.24, N = 3SE +/- 0.37, N = 3SE +/- 7.73, N = 3SE +/- 0.57, N = 3SE +/- 0.48, N = 3SE +/- 0.39, N = 311861842184213062218220222082190-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core14002800420056007000SE +/- 15.82, N = 3SE +/- 1.14, N = 3SE +/- 4.78, N = 3SE +/- 35.70, N = 3SE +/- 352.63, N = 3SE +/- 139.92, N = 3SE +/- 11.42, N = 335934274423945075989538863565429-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core7001400210028003500SE +/- 7.39, N = 3SE +/- 208.10, N = 3SE +/- 4.22, N = 3SE +/- 52.49, N = 3SE +/- 37.52, N = 3SE +/- 20.52, N = 3SE +/- 9.00, N = 331192874308231053174295132203153-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.93, N = 3SE +/- 0.13, N = 3SE +/- 0.67, N = 3SE +/- 1.28, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3SE +/- 0.21, N = 3291294294265270270261260-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGOAMD Ryzen Threadripper 2990WX 32-Core400800120016002000SE +/- 3.18, N = 3SE +/- 0.82, N = 3SE +/- 0.20, N = 3SE +/- 3.93, N = 3SE +/- 0.10, N = 3SE +/- 0.38, N = 3SE +/- 0.27, N = 37237367377217321904728255-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

LuaJIT

Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-Relaxation-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO400800120016002000SE +/- 0.07, N = 3SE +/- 0.46, N = 3SE +/- 0.46, N = 3SE +/- 11.29, N = 3SE +/- 0.27, N = 3SE +/- 0.35, N = 3SE +/- 0.37, N = 31865186718681830186818681870-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix Factorization-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO8001600240032004000SE +/- 8.06, N = 3SE +/- 22.76, N = 3SE +/- 5.63, N = 3SE +/- 31.00, N = 3SE +/- 2.52, N = 3SE +/- 1.48, N = 3SE +/- 8.61, N = 33623359036193550361136243636-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix Multiply-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO30060090012001500SE +/- 14.55, N = 3SE +/- 2.17, N = 3SE +/- 1.43, N = 3SE +/- 7.82, N = 3SE +/- 1.12, N = 3SE +/- 1.60, N = 3SE +/- 0.69, N = 31183120012071182120812041203-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier Transform-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO60120180240300SE +/- 0.26, N = 3SE +/- 0.62, N = 3SE +/- 0.81, N = 3SE +/- 2.77, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3286287287281286287286-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte Carlo-O2 -march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-O2 -march=native-O3 -march=native-O3 -march=native -fltoPGO110220330440550SE +/- 0.13, N = 3SE +/- 0.46, N = 3SE +/- 0.04, N = 3SE +/- 2.87, N = 3SE +/- 0.17, N = 3SE +/- 1.65, N = 3SE +/- 0.60, N = 3498498499490499500499-march=athlon64-O3 -march=athlon64-O3 -march=athlon64-sse3-march=native-O3 -march=native-O3 -march=native -flto-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector


Phoronix Test Suite v10.8.5