GCC 9.1 Compiler Tuning Threadripper AMD znver1

AMD Ryzen Threadripper 2990WX compiler benchmarks on GCC 9.1 with Ubuntu Linux by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1905122-HV-GCC91COMP71&grr&rdt.

GCC 9.1 Compiler Tuning Threadripper AMD znver1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-CoreAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX 64 8GB (1590/800MHz)Realtek ALC1220ASUS VP28UIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 18.044.18.0-18-generic (x86_64)GNOME Shell 3.28.3X Server 1.20.1amdgpu 18.1.04.5 Mesa 18.2.8 (LLVM 7.0.0)GCC 9.1.0ext43840x2160OpenBenchmarking.orgEnvironment Details- -O3 -march=native: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- -O3 -march=athlon64-sse3: CXXFLAGS=-O3-march=athlon64-sse3 CFLAGS=-O3-march=athlon64-sse3- -O3 -march=athlon64: CXXFLAGS=-O3-march=athlon64 CFLAGS=-O3-march=athlon64- -O3 -march=native -flto: CXXFLAGS=-O3-march=native-flto CFLAGS=-O3-march=native-flto- -O2 -march=native: CXXFLAGS=-O2-march=native CFLAGS=-O2-march=native- -O2 -march=athlon64: CXXFLAGS=-O2-march=athlon64 CFLAGS=-O2-march=athlon64- PGO: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native- AMD Ryzen Threadripper 2990WX 32-Core: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Details- --disable-multilib --enable-checing=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15rc1 + Python 3.6.7Security Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

GCC 9.1 Compiler Tuning Threadripper AMD znver1cpp-perf-bench: Rand Numberscpp-perf-bench: Math Libraryvpxenc: vpxenc VP9 1080p Video Encodefftw: Float + SSE - 2D FFT Size 4096hpcg: c-ray: Total Time - 4K, 16 Rays Per Pixelfftw: Stock - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writembw: Memory Copy, Fixed Block Size - 8192 MiBbuild-llvm: Time To Compilecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9aom-av1: AV1 Video Encodingcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19mcperf: Addmbw: Memory Copy - 8192 MiBpgbench: Buffer Test - Normal Load - Read Onlymcperf: Setnginx: Static Web Page Servingstockfish: Total Timecpp-perf-bench: Stepanov Vectorcpp-perf-bench: Atolgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solvermcperf: Prependbuild-php: Time To Compilemcperf: Appendt-test1: 1aobench: 2048 x 2048 - Total Timemcperf: Replacescimark2: Compositecpp-perf-bench: Ctypeluajit: Compositebuild-imagemagick: Time To Compilesmallpt: Global Illumination Renderer; 128 Samplessvt-vp9: 1080p 8-bit YUV To VP9 Video Encodecpp-perf-bench: Stepanov Abstractionmcperf: Deleteredis: LPOPmcperf: Gett-test1: 2openssl: RSA 4096-bit Performancex265: H.265 1080p Video Encodingencode-flac: WAV To FLACredis: SETredis: SADDredis: GETcpp-perf-bench: Function Objectsmkl-dnn: IP Batch 1D - f32svt-av1: 1080p 8-bit YUV To AV1 Video Encoderedis: LPUSHsvt-hevc: 1080p 8-bit YUV To HEVC Video Encodemafft: Multiple Sequence Alignmentencode-mp3: WAV To MP3x264: H.264 Video Encodingbullet: Rayteststscp: AI Chess Performancectx-clock: Context Switch Timebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloluajit: Jacobi Successive Over-Relaxationluajit: Dense LU Matrix Factorizationluajit: Sparse Matrix Multiplyluajit: Fast Fourier Transformluajit: Monte Carlo-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core102335126.37149270.8318.00660016339668522125.700.2219.09449031285146672338646292746820016474.9369.0321924323220424724927213134382463.254245526.2339.1153865251433.88149519.153.87102.9128.22589692573823576529.01582533.769.5318072702046557250218215.4071.2620.2715402571652.698.001472.3511167471501.000.842.303.974.423.84221859893174270732186836111208286499105539925.580.8529.53446316252668122425.930.2118.38348221269945959648548292816757115075.5668.4219123119419823724026313164569162.934571627.1244.9845707203933.31149619.734.5497.8128.28695092506276690048.82583833.8815.4317555902032921246172516.1471.0218.6615229521662.559.361462.4611158411501.060.862.354.454.623.91184242393082294737186836191207287499107039826.120.9229.39436115616675522526.500.2017.74531061293046384034880297266751360275.5368.3719023319519923723826113163596863.195617229.6645.2036231202133.28148919.774.51103.4528.26568222544762568379.59583733.8915.4818408062051129254039816.1770.9418.8415202761682.589.431462.4611153901501.060.862.364.454.653.921842427428742947361867359012002874981011352148640.9817.85701914989672195125.620.2217.94500551317647362043570273526745068974.9269.252212402332032502482741304455874533926.3139.9245591254332.32149788.893.85104.4228.25686962616703686448.89583333.689.4817554722084089250943315.5020.4115496471652.657.982.3011356261500.970.822.443.904.823.972202538829512701904186836241204287500104135626.02162630.8134.22633513558677822126.780.2119.31341381279545855134446278346669748775.9969.8021723823120024524927113193555244.443505826.0742.3435486198134.50146717.583.89100.8228.69561412340595557919.40583133.449.7817449792000709227501915.6172.0818.6314633881632.5810.801432.3811020131501.020.842.394.064.573.95130645073105265721183035501182281490107239925.440.8945.4343425098652622926.750.2018.02546001263245360559924297046689068775.7969.0819223519820224424327213283546743.866121925.6047.2835829178233.37149117.414.57101.1328.33557472654025556478.68583233.5315.5817307021975322238414316.2071.9818.5115485561722.6011.411462.4910942111501.070.872.424.504.683.96118635933119291723186536231183286498102735326.36152870.9117.9667176281675326.090.2217.364777412406460414463966784187775.2769.3122024323420325025127413214708563.014610628.7439.1045956255534.02149919.093.83103.9728.36689802648345684269.41583033.799.4417988002083430253302715.4670.9419.5715320381852.637.981452.3411091141501.000.832.323.974.413.832208635632202617281870363612032864991154355150980.904024.62641164726596290708.150.08265.7935174126182590583523978.1469.25431840261755913223557985.403592027.8343.3635978225737.6719.69564.075.0729.08567975632414.0557919.959.8122.0771.0018.7723.012.7411.09981778150219054293153260255OpenBenchmarking.org

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random Numbers-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core2004006008001000SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.71, N = 3SE +/- 0.05, N = 3SE +/- 4.70, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 310231055107010111041107210271154-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core90180270360450SE +/- 0.11, N = 3SE +/- 0.83, N = 3SE +/- 0.79, N = 3SE +/- 0.73, N = 3SE +/- 1.79, N = 3SE +/- 0.87, N = 3SE +/- 0.93, N = 3351399398352356399353355-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64PGO612182430SE +/- 0.13, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3SE +/- 0.22, N = 3SE +/- 0.26, N = 9SE +/- 0.02, N = 326.3725.5826.1226.0225.4426.36-march=native-march=athlon64-sse3-march=athlon64-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=native-O3 -march=native -flto-O2 -march=nativePGOAMD Ryzen Threadripper 2990WX 32-Core3K6K9K12K15KSE +/- 71.85, N = 3SE +/- 74.46, N = 3SE +/- 13.87, N = 3SE +/- 152.85, N = 31492714864162631528715098-O3-O3 -flto-O2-O3-O31. (CC) gcc options: -pthread -march=native -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core0.22050.4410.66150.8821.1025SE +/- 0.03, N = 12SE +/- 0.01, N = 3SE +/- 0.02, N = 15SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.02, N = 13SE +/- 0.03, N = 120.830.850.920.980.810.890.910.90

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core9001800270036004500SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 318.0029.5329.3917.8534.2245.4317.964024.62-march=native1. (CC) gcc options: -lm -lpthread -O3

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core15003000450060007500SE +/- 51.66, N = 3SE +/- 1.73, N = 3SE +/- 63.69, N = 3SE +/- 8.84, N = 3SE +/- 34.65, N = 3SE +/- 28.80, N = 3SE +/- 14.46, N = 366004463436170196335434267176411-O3-O3 -flto-O2-O3-O31. (CC) gcc options: -pthread -march=native -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core4K8K12K16K20KSE +/- 98.94, N = 3SE +/- 51.55, N = 3SE +/- 247.31, N = 3SE +/- 164.04, N = 7SE +/- 279.82, N = 15SE +/- 65.89, N = 4SE +/- 93.40, N = 41633916252156161498913558509862816472-O3 -march=native -lpq-O3 -march=athlon64-sse3 -lpq-O3 -march=athlon64 -lpq-O3 -march=native -flto -lpq-O2 -march=native -lpq-O2 -march=athlon64 -lpq-O3 -march=native -lpq-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core15003000450060007500SE +/- 27.83, N = 3SE +/- 10.14, N = 3SE +/- 7.74, N = 3SE +/- 96.68, N = 3SE +/- 38.61, N = 3SE +/- 60.66, N = 3SE +/- 26.90, N = 366856681675567216778652667536596-march=athlon64-sse3-march=athlon64-flto-O2-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64AMD Ryzen Threadripper 2990WX 32-Core2004006008001000221224225951221229290

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core150300450600750SE +/- 0.26, N = 15SE +/- 0.22, N = 12SE +/- 0.27, N = 3SE +/- 0.23, N = 15SE +/- 0.10, N = 3SE +/- 0.21, N = 15SE +/- 0.07, N = 325.7025.9326.5025.6226.7826.7526.09708.15-O3 -march=native1. (CC) gcc options: -pthread -fvisibility=hidden

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-02-11AV1 Video Encoding-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core0.04950.0990.14850.1980.2475SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.220.210.200.220.210.200.220.08-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.65, N = 15SE +/- 0.66, N = 15SE +/- 0.69, N = 12SE +/- 0.66, N = 15SE +/- 0.61, N = 15SE +/- 0.60, N = 15SE +/- 0.45, N = 1519.0918.3817.7417.9419.3118.0217.36265.79-O3 -march=native1. (CC) gcc options: -pthread -lz -llzma

Memcached mcperf

Method: Add

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Add-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core12K24K36K48K60KSE +/- 1053.86, N = 15SE +/- 121.25, N = 3SE +/- 2252.95, N = 15SE +/- 2185.23, N = 15SE +/- 125.83, N = 3SE +/- 2866.90, N = 12SE +/- 2440.20, N = 124490334822531065005534138546004777435174-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core3K6K9K12K15KSE +/- 169.00, N = 3SE +/- 217.78, N = 3SE +/- 162.89, N = 5SE +/- 153.84, N = 3SE +/- 167.95, N = 5SE +/- 152.24, N = 5SE +/- 61.39, N = 31285112699129301317612795126321240612618-march=athlon64-sse3-march=athlon64-flto-O2-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core100K200K300K400K500KSE +/- 1148.20, N = 3SE +/- 6062.49, N = 4SE +/- 6987.31, N = 3SE +/- 1547.77, N = 3SE +/- 3403.88, N = 3SE +/- 2874.20, N = 3SE +/- 7464.16, N = 3466723459596463840473620458551453605460414259058-O3 -march=native -lpq-O3 -march=athlon64-sse3 -lpq-O3 -march=athlon64 -lpq-O3 -march=native -flto -lpq-O2 -march=native -lpq-O2 -march=athlon64 -lpq-O3 -march=native -lpq-O3 -march=native1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core13K26K39K52K65KSE +/- 877.57, N = 15SE +/- 2384.39, N = 15SE +/- 44.53, N = 3SE +/- 195.82, N = 3SE +/- 67.69, N = 3SE +/- 2193.95, N = 15SE +/- 1474.59, N = 123864648548348804357034446599244639635239-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon646K12K18K24K30KSE +/- 401.04, N = 4SE +/- 428.25, N = 3SE +/- 169.85, N = 3SE +/- 43.76, N = 3SE +/- 166.84, N = 3SE +/- 219.17, N = 3292742928129726273522783429704-march=athlon64-sse3-march=athlon64-flto-O2-O2 -march=athlon641. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO15M30M45M60M75MSE +/- 954655.13, N = 3SE +/- 443906.77, N = 3SE +/- 779150.59, N = 3SE +/- 385955.10, N = 3SE +/- 526520.80, N = 3SE +/- 226981.30, N = 3SE +/- 458502.78, N = 368200164675711506751360267450689666974876689068767841877-march=native-march=athlon64-sse3-march=athlon64-march=native-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Vector-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 374.9375.5675.5374.9275.9975.7975.2778.14-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Atol-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core1632486480SE +/- 0.05, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 369.0368.4268.3769.2569.8069.0869.3169.25-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 32191911902212171922204-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 1.00, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 3SE +/- 0.67, N = 324323123324023823524331-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 32321941952332311982348-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.67, N = 320419819920320020220340-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 324723723725024524425026-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core50100150200250SE +/- 0.67, N = 3SE +/- 0.33, N = 3249240238248249243251175-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color Space-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 1.33, N = 3SE +/- 0.33, N = 327226326127427127227459-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core30060090012001500SE +/- 1.73, N = 3SE +/- 4.00, N = 3SE +/- 3.98, N = 3SE +/- 1.86, N = 3SE +/- 3.88, N = 3SE +/- 3.65, N = 3SE +/- 3.47, N = 313131316131613041319132813211322-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CC) gcc options: -O3 -mavx2

Memcached mcperf

Method: Prepend

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Prepend-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core10K20K30K40K50KSE +/- 2300.04, N = 15SE +/- 116.90, N = 3SE +/- 58.44, N = 3SE +/- 179.60, N = 3SE +/- 186.47, N = 3SE +/- 109.24, N = 3SE +/- 863.93, N = 124382445691359684558735552354674708535579-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.32, N = 3SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 363.2562.9363.1944.4443.8663.0185.40-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -pedantic -ldl -lz -lm

Memcached mcperf

Method: Append

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Append-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core13K26K39K52K65KSE +/- 256.24, N = 3SE +/- 220.19, N = 3SE +/- 3045.08, N = 15SE +/- 218.35, N = 3SE +/- 168.27, N = 3SE +/- 2658.53, N = 15SE +/- 123.87, N = 34245545716561724533935058612194610635920-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core714212835SE +/- 0.07, N = 3SE +/- 0.34, N = 15SE +/- 0.29, N = 9SE +/- 0.16, N = 3SE +/- 0.33, N = 3SE +/- 0.10, N = 3SE +/- 0.39, N = 326.2327.1229.6626.3126.0725.6028.7427.83-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -pthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core1122334455SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 339.1144.9845.2039.9242.3447.2839.1043.36-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CC) gcc options: -lm -O3

Memcached mcperf

Method: Replace

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Replace-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core12K24K36K48K60KSE +/- 1741.69, N = 12SE +/- 85.02, N = 3SE +/- 365.15, N = 3SE +/- 174.58, N = 3SE +/- 167.62, N = 3SE +/- 132.68, N = 3SE +/- 168.47, N = 35386545707362314559135486358294595635978-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core5001000150020002500SE +/- 26.08, N = 8SE +/- 0.49, N = 3SE +/- 25.27, N = 5SE +/- 32.19, N = 3SE +/- 17.63, N = 3SE +/- 4.62, N = 3SE +/- 2.91, N = 325142039202125431981178225552257-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Ctype-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core918273645SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 333.8833.3133.2832.3234.5033.3734.0237.67-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

LuaJIT

Test: Composite

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Composite-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO30060090012001500SE +/- 0.72, N = 3SE +/- 1.46, N = 3SE +/- 4.29, N = 3SE +/- 0.43, N = 3SE +/- 10.67, N = 3SE +/- 4.50, N = 3SE +/- 1.60, N = 31495149614891497146714911499-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 0.25, N = 3SE +/- 0.03, N = 3SE +/- 0.23, N = 5SE +/- 1.39, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 319.1519.7319.7788.8917.5817.4119.0919.69

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core120240360480600SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 33.874.544.513.853.894.573.83564.07-march=native1. (CXX) g++ options: -fopenmp -O3

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video Encode-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core20406080100SE +/- 1.13, N = 15SE +/- 0.78, N = 3SE +/- 1.26, N = 15SE +/- 1.19, N = 15SE +/- 0.82, N = 3SE +/- 0.98, N = 15SE +/- 0.96, N = 15102.9197.81103.45104.42100.82101.13103.975.07-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstraction-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 328.2228.2828.2628.2528.6928.3328.3629.08-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

Memcached mcperf

Method: Delete

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Delete-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core15K30K45K60K75KSE +/- 772.91, N = 4SE +/- 914.89, N = 4SE +/- 204.60, N = 3SE +/- 220.33, N = 3SE +/- 304.30, N = 3SE +/- 274.96, N = 3SE +/- 370.63, N = 35896969509568226869656141557476898056797-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOP-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO600K1200K1800K2400K3000KSE +/- 34288.12, N = 3SE +/- 3626.56, N = 3SE +/- 25886.44, N = 8SE +/- 24550.99, N = 9SE +/- 24290.24, N = 3SE +/- 41423.26, N = 15SE +/- 25803.36, N = 32573823250627625447622616703234059526540252648345-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core15K30K45K60K75KSE +/- 812.95, N = 3SE +/- 286.25, N = 3SE +/- 29.60, N = 3SE +/- 66.15, N = 3SE +/- 125.71, N = 3SE +/- 339.02, N = 3SE +/- 734.34, N = 35765269004568376864455791556476842656324-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm -rdynamic

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core48121620SE +/- 0.09, N = 15SE +/- 0.10, N = 3SE +/- 0.10, N = 15SE +/- 0.08, N = 10SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 39.018.829.598.899.408.689.4114.05-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -pthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core13002600390052006500SE +/- 3.74, N = 3SE +/- 1.25, N = 3SE +/- 1.40, N = 3SE +/- 2.44, N = 3SE +/- 1.56, N = 3SE +/- 0.92, N = 3SE +/- 4.62, N = 358255838583758335831583258305791-O3 -march=native -lssl-O3 -march=athlon64-sse3 -lssl-O3 -march=athlon64 -lssl-O3 -march=native -flto -lssl-O2 -march=native -lssl-O2 -march=athlon64 -lssl-O3 -march=native -lssl-O3 -march=native1. (CC) gcc options: -pthread -m64 -lcrypto -ldl

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core816243240SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 333.7633.8833.8933.6833.4433.5333.799.95-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core48121620SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.06, N = 5SE +/- 0.05, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 59.5315.4315.489.489.7815.589.449.81-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fvisibility=hidden -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO400K800K1200K1600K2000KSE +/- 6043.61, N = 3SE +/- 12476.39, N = 3SE +/- 16989.46, N = 3SE +/- 7219.96, N = 3SE +/- 28790.51, N = 15SE +/- 22619.48, N = 3SE +/- 14705.69, N = 31807270175559018408061755472174497917307021798800-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADD-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO400K800K1200K1600K2000KSE +/- 13234.25, N = 3SE +/- 24795.31, N = 5SE +/- 23560.29, N = 3SE +/- 27875.47, N = 3SE +/- 26134.11, N = 12SE +/- 18375.04, N = 3SE +/- 10024.22, N = 32046557203292120511292084089200070919753222083430-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO500K1000K1500K2000K2500KSE +/- 11016.75, N = 3SE +/- 28994.69, N = 3SE +/- 15021.00, N = 3SE +/- 36774.66, N = 3SE +/- 25441.63, N = 3SE +/- 39327.40, N = 3SE +/- 26635.52, N = 122502182246172525403982509433227501923841432533027-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 315.4016.1416.1715.5015.6116.2015.4622.07-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -std=c++11

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core1632486480SE +/- 0.40, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 371.2671.0270.9472.0871.9870.9471.00MIN: 70.22-march=athlon64-sse3 - MIN: 70.33-march=athlon64 - MIN: 70.24-O2 - MIN: 70.86-O2 -march=athlon64 - MIN: 70.78MIN: 70.39-lrt - MIN: 70.471. (CXX) g++ options: -O3 -march=native -std=c++11 -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2019-03-071080p 8-bit YUV To AV1 Video Encode-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core510152025SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 320.2718.6618.8420.4118.6318.5119.5718.77-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native-march=native1. (CXX) g++ options: -O3 -pie -lpthread -lm

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSH-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO300K600K900K1200K1500KSE +/- 12857.88, N = 3SE +/- 9157.67, N = 3SE +/- 19988.57, N = 3SE +/- 6810.29, N = 3SE +/- 22581.44, N = 3SE +/- 20493.00, N = 5SE +/- 22070.89, N = 31540257152295215202761549647146338815485561532038-O2 -O3 -march=native -flto1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video Encode-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core4080120160200SE +/- 1.96, N = 3SE +/- 0.65, N = 3SE +/- 2.41, N = 3SE +/- 2.04, N = 3SE +/- 3.04, N = 12SE +/- 4.80, N = 15SE +/- 6.47, N = 15165.00166.00168.00165.00163.00172.00185.0023.01-O3-O3 -march=athlon64-sse3-O3 -march=athlon64-O3-march=athlon64-O31. (CC) gcc options: -march=native -fPIE -fPIC -O2 -flto -fvisibility=hidden -pie -rdynamic -lpthread -lrt

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence Alignment-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core0.61651.2331.84952.4663.0825SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.02, N = 15SE +/- 0.03, N = 15SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.00, N = 32.692.552.582.652.582.602.632.741. (CC) gcc options: -std=c99 -O3 -lm -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 38.009.369.437.9810.8011.417.9811.09-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O2 -march=native-O2 -march=athlon64PGO306090120150SE +/- 1.46, N = 9SE +/- 1.95, N = 5SE +/- 0.81, N = 3SE +/- 1.95, N = 3SE +/- 1.63, N = 7SE +/- 0.95, N = 3147146146143146145-march=native-march=athlon64-sse3-march=athlon64-O2 -march=native-O2 -march=athlon64-march=native1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO0.56031.12061.68092.24122.8015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 32.352.462.462.302.382.492.34-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core200K400K600K800K1000KSE +/- 1107.42, N = 5SE +/- 903.00, N = 5SE +/- 1105.70, N = 5SE +/- 740.45, N = 5SE +/- 2140.82, N = 5SE +/- 5126.15, N = 5SE +/- 2184.70, N = 51116747111584111153901135626110201310942111109114981778-march=athlon64-sse3-march=athlon64-flto-O2-O2 -march=athlon641. (CC) gcc options: -O3 -march=native

ctx_clock

Context Switch Time

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Time-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core306090120150150150150150150150150150-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options:

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex Trimesh-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO0.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.001.061.060.971.021.071.00-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim Trimesh-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO0.19580.39160.58740.78320.979SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.840.860.860.820.840.870.83-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO0.5491.0981.6472.1962.745SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 32.302.352.362.442.392.422.32-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO1.01252.0253.03754.055.0625SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 33.974.454.453.904.064.503.97-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO1.08452.1693.25354.3385.4225SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 34.424.624.654.824.574.684.41-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO0.89331.78662.67993.57324.4665SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 33.843.913.923.973.953.963.83-march=native-march=athlon64-sse3-march=athlon64-march=native -flto-O2 -march=native-O2 -march=athlon64-march=native1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core5001000150020002500SE +/- 0.57, N = 3SE +/- 0.37, N = 3SE +/- 0.24, N = 3SE +/- 0.48, N = 3SE +/- 7.73, N = 3SE +/- 5.74, N = 3SE +/- 0.39, N = 322181842184222021306118622082190-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core14002800420056007000SE +/- 352.63, N = 3SE +/- 4.78, N = 3SE +/- 1.14, N = 3SE +/- 139.92, N = 3SE +/- 35.70, N = 3SE +/- 15.82, N = 3SE +/- 11.42, N = 359894239427453884507359363565429-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core7001400210028003500SE +/- 37.52, N = 3SE +/- 4.22, N = 3SE +/- 208.10, N = 3SE +/- 20.52, N = 3SE +/- 52.49, N = 3SE +/- 7.39, N = 3SE +/- 9.00, N = 331743082287429513105311932203153-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core60120180240300SE +/- 0.11, N = 3SE +/- 0.67, N = 3SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 1.28, N = 3SE +/- 0.93, N = 3SE +/- 0.21, N = 3270294294270265291261260-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGOAMD Ryzen Threadripper 2990WX 32-Core400800120016002000SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.82, N = 3SE +/- 0.38, N = 3SE +/- 3.93, N = 3SE +/- 3.18, N = 3SE +/- 0.27, N = 37327377361904721723728255-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64-O3 -march=native-O3 -march=native1. (CC) gcc options: -lm

LuaJIT

Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-Relaxation-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO400800120016002000SE +/- 0.27, N = 3SE +/- 0.46, N = 3SE +/- 0.46, N = 3SE +/- 0.35, N = 3SE +/- 11.29, N = 3SE +/- 0.07, N = 3SE +/- 0.37, N = 31868186818671868183018651870-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix Factorization-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO8001600240032004000SE +/- 2.52, N = 3SE +/- 5.63, N = 3SE +/- 22.76, N = 3SE +/- 1.48, N = 3SE +/- 31.00, N = 3SE +/- 8.06, N = 3SE +/- 8.61, N = 33611361935903624355036233636-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix Multiply-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO30060090012001500SE +/- 1.12, N = 3SE +/- 1.43, N = 3SE +/- 2.17, N = 3SE +/- 1.60, N = 3SE +/- 7.82, N = 3SE +/- 14.55, N = 3SE +/- 0.69, N = 31208120712001204118211831203-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier Transform-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO60120180240300SE +/- 0.06, N = 3SE +/- 0.81, N = 3SE +/- 0.62, N = 3SE +/- 0.09, N = 3SE +/- 2.77, N = 3SE +/- 0.26, N = 3SE +/- 0.27, N = 3286287287287281286286-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte Carlo-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-O2 -march=native-O2 -march=athlon64PGO110220330440550SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.46, N = 3SE +/- 1.65, N = 3SE +/- 2.87, N = 3SE +/- 0.13, N = 3SE +/- 0.60, N = 3499499498500490498499-O3 -march=native-O3 -march=athlon64-sse3-O3 -march=athlon64-O3 -march=native -flto-march=native-march=athlon64-O3 -march=native1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector


Phoronix Test Suite v10.8.5