AMD Ryzen Threadripper 2990WX compiler benchmarks on GCC 9.1 with Ubuntu Linux by Michael Larabel.
-O2 -march=athlon64 Environment Notes: CXXFLAGS=-O2-march=athlon64 CFLAGS=-O2-march=athlon64Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=athlon64 Environment Notes: CXXFLAGS=-O3-march=athlon64 CFLAGS=-O3-march=athlon64Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=athlon64-sse3 Environment Notes: CXXFLAGS=-O3-march=athlon64-sse3 CFLAGS=-O3-march=athlon64-sse3Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O2 -march=native Environment Notes: CXXFLAGS=-O2-march=native CFLAGS=-O2-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
-O3 -march=native -flto Environment Notes: CXXFLAGS=-O3-march=native-flto CFLAGS=-O3-march=native-fltoCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
PGO Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
AMD Ryzen Threadripper 2990WX 32-Core Processor: AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: ASUS ROG ZENITH EXTREME (1701 BIOS), Chipset: AMD 17h, Memory: 32768MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: AMD Radeon RX 64 8GB (1590/800MHz), Audio: Realtek ALC1220, Monitor: ASUS VP28U, Network: Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad
OS: Ubuntu 18.04, Kernel: 4.18.0-18-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: amdgpu 18.1.0, OpenGL: 4.5 Mesa 18.2.8 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160
AOM AV1 This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2019-02-11 AV1 Video Encoding -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.20 0.20 0.21 0.21 0.22 0.22 0.22 0.08 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2019-03-07 1080p 8-bit YUV To AV1 Video Encode -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 18.51 18.84 18.66 18.63 20.27 20.41 19.57 18.77 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -O3 -pie -lpthread -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 2019-02-03 1080p 8-bit YUV To HEVC Video Encode -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 40 80 120 160 200 SE +/- 4.80, N = 15 SE +/- 2.41, N = 3 SE +/- 0.65, N = 3 SE +/- 3.04, N = 12 SE +/- 1.96, N = 3 SE +/- 2.04, N = 3 SE +/- 6.47, N = 15 172.00 168.00 166.00 163.00 165.00 165.00 185.00 23.01 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O3 -O3 -O3 1. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 2019-02-17 1080p 8-bit YUV To VP9 Video Encode -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 20 40 60 80 100 SE +/- 0.98, N = 15 SE +/- 1.26, N = 15 SE +/- 0.78, N = 3 SE +/- 0.82, N = 3 SE +/- 1.13, N = 15 SE +/- 1.19, N = 15 SE +/- 0.96, N = 15 101.13 103.45 97.81 100.82 102.91 104.42 103.97 5.07 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -O3 -march=native 1. (CC) gcc options: -O2 -fPIE -fPIC -flto -fvisibility=hidden -mavx -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.0 vpxenc VP9 1080p Video Encode -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native PGO 6 12 18 24 30 SE +/- 0.26, N = 9 SE +/- 0.00, N = 3 SE +/- 0.40, N = 3 SE +/- 0.22, N = 3 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 25.44 26.12 25.58 26.02 26.37 26.36 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native PGO 30 60 90 120 150 SE +/- 1.63, N = 7 SE +/- 0.81, N = 3 SE +/- 1.95, N = 5 SE +/- 1.95, N = 3 SE +/- 1.46, N = 9 SE +/- 0.95, N = 3 146 146 146 143 147 145 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 33.53 33.89 33.88 33.44 33.76 33.68 33.79 9.95 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.0 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 0.2205 0.441 0.6615 0.882 1.1025 SE +/- 0.02, N = 13 SE +/- 0.02, N = 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 15 SE +/- 0.03, N = 12 SE +/- 0.02, N = 3 SE +/- 0.03, N = 12 0.89 0.92 0.85 0.81 0.83 0.98 0.91 0.90
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 244 237 237 245 247 250 250 26 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 243 238 240 249 249 248 251 175 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 0.88, N = 3 192 190 191 217 219 221 220 4 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 50 100 150 200 250 SE +/- 0.58, N = 3 SE +/- 0.58, N = 3 SE +/- 0.88, N = 3 198 195 194 231 232 233 234 8 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 50 100 150 200 250 SE +/- 1.15, N = 3 SE +/- 1.00, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 235 233 231 238 243 240 243 31 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 40 80 120 160 200 SE +/- 1.53, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 3 202 199 198 200 204 203 203 40 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 1.33, N = 3 272 261 263 271 272 274 274 59 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -fopenmp -pthread -ljbig -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lz -lm -ldl -lpthread
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 1500 3000 4500 6000 7500 SE +/- 28.80, N = 3 SE +/- 63.69, N = 3 SE +/- 1.73, N = 3 SE +/- 34.65, N = 3 SE +/- 51.66, N = 3 SE +/- 8.84, N = 3 SE +/- 14.46, N = 3 4342 4361 4463 6335 6600 7019 6717 6411 -O2 -O3 -O3 -flto -O3 -O3 1. (CC) gcc options: -pthread -march=native -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 3K 6K 9K 12K 15K SE +/- 13.87, N = 3 SE +/- 71.85, N = 3 SE +/- 74.46, N = 3 SE +/- 152.85, N = 3 16263 14927 14864 15287 15098 -O2 -O3 -O3 -flto -O3 -O3 1. (CC) gcc options: -pthread -march=native -lm
LuaJIT This test profile is a collection of Lua scripts/benchmarks run against a locally-built copy of LuaJIT upstream. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Composite -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 300 600 900 1200 1500 SE +/- 4.50, N = 3 SE +/- 4.29, N = 3 SE +/- 1.46, N = 3 SE +/- 10.67, N = 3 SE +/- 0.72, N = 3 SE +/- 0.43, N = 3 SE +/- 1.60, N = 3 1491 1489 1496 1467 1495 1497 1499 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Monte Carlo -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 110 220 330 440 550 SE +/- 0.13, N = 3 SE +/- 0.46, N = 3 SE +/- 0.04, N = 3 SE +/- 2.87, N = 3 SE +/- 0.17, N = 3 SE +/- 1.65, N = 3 SE +/- 0.60, N = 3 498 498 499 490 499 500 499 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Fast Fourier Transform -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 60 120 180 240 300 SE +/- 0.26, N = 3 SE +/- 0.62, N = 3 SE +/- 0.81, N = 3 SE +/- 2.77, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.27, N = 3 286 287 287 281 286 287 286 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Sparse Matrix Multiply -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 300 600 900 1200 1500 SE +/- 14.55, N = 3 SE +/- 2.17, N = 3 SE +/- 1.43, N = 3 SE +/- 7.82, N = 3 SE +/- 1.12, N = 3 SE +/- 1.60, N = 3 SE +/- 0.69, N = 3 1183 1200 1207 1182 1208 1204 1203 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Dense LU Matrix Factorization -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 800 1600 2400 3200 4000 SE +/- 8.06, N = 3 SE +/- 22.76, N = 3 SE +/- 5.63, N = 3 SE +/- 31.00, N = 3 SE +/- 2.52, N = 3 SE +/- 1.48, N = 3 SE +/- 8.61, N = 3 3623 3590 3619 3550 3611 3624 3636 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
OpenBenchmarking.org Mflops, More Is Better LuaJIT 2.1-git Test: Jacobi Successive Over-Relaxation -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 400 800 1200 1600 2000 SE +/- 0.07, N = 3 SE +/- 0.46, N = 3 SE +/- 0.46, N = 3 SE +/- 11.29, N = 3 SE +/- 0.27, N = 3 SE +/- 0.35, N = 3 SE +/- 0.37, N = 3 1865 1867 1868 1830 1868 1868 1870 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native 1. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -U_FORTIFY_SOURCE -fno-stack-protector
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 500 1000 1500 2000 2500 SE +/- 4.62, N = 3 SE +/- 25.27, N = 5 SE +/- 0.49, N = 3 SE +/- 17.63, N = 3 SE +/- 26.08, N = 8 SE +/- 32.19, N = 3 SE +/- 2.91, N = 3 1782 2021 2039 1981 2514 2543 2555 2257 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 400 800 1200 1600 2000 SE +/- 3.18, N = 3 SE +/- 0.82, N = 3 SE +/- 0.20, N = 3 SE +/- 3.93, N = 3 SE +/- 0.10, N = 3 SE +/- 0.38, N = 3 SE +/- 0.27, N = 3 723 736 737 721 732 1904 728 255 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 60 120 180 240 300 SE +/- 0.93, N = 3 SE +/- 0.13, N = 3 SE +/- 0.67, N = 3 SE +/- 1.28, N = 3 SE +/- 0.11, N = 3 SE +/- 0.17, N = 3 SE +/- 0.21, N = 3 291 294 294 265 270 270 261 260 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 700 1400 2100 2800 3500 SE +/- 7.39, N = 3 SE +/- 208.10, N = 3 SE +/- 4.22, N = 3 SE +/- 52.49, N = 3 SE +/- 37.52, N = 3 SE +/- 20.52, N = 3 SE +/- 9.00, N = 3 3119 2874 3082 3105 3174 2951 3220 3153 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 1400 2800 4200 5600 7000 SE +/- 15.82, N = 3 SE +/- 1.14, N = 3 SE +/- 4.78, N = 3 SE +/- 35.70, N = 3 SE +/- 352.63, N = 3 SE +/- 139.92, N = 3 SE +/- 11.42, N = 3 3593 4274 4239 4507 5989 5388 6356 5429 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 500 1000 1500 2000 2500 SE +/- 5.74, N = 3 SE +/- 0.24, N = 3 SE +/- 0.37, N = 3 SE +/- 7.73, N = 3 SE +/- 0.57, N = 3 SE +/- 0.48, N = 3 SE +/- 0.39, N = 3 1186 1842 1842 1306 2218 2202 2208 2190 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
Himeno Benchmark The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 300 600 900 1200 1500 SE +/- 3.65, N = 3 SE +/- 3.98, N = 3 SE +/- 4.00, N = 3 SE +/- 3.88, N = 3 SE +/- 1.73, N = 3 SE +/- 1.86, N = 3 SE +/- 3.47, N = 3 1328 1316 1316 1319 1313 1304 1321 1322 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CC) gcc options: -O3 -mavx2
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 3K 6K 9K 12K 15K SE +/- 152.24, N = 5 SE +/- 162.89, N = 5 SE +/- 217.78, N = 3 SE +/- 167.95, N = 5 SE +/- 169.00, N = 3 SE +/- 153.84, N = 3 SE +/- 61.39, N = 3 12632 12930 12699 12795 12851 13176 12406 12618 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -flto 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 1500 3000 4500 6000 7500 SE +/- 60.66, N = 3 SE +/- 7.74, N = 3 SE +/- 10.14, N = 3 SE +/- 38.61, N = 3 SE +/- 27.83, N = 3 SE +/- 96.68, N = 3 SE +/- 26.90, N = 3 6526 6755 6681 6778 6685 6721 6753 6596 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -flto 1. (CC) gcc options: -O3 -march=native
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 200K 400K 600K 800K 1000K SE +/- 5126.15, N = 5 SE +/- 1105.70, N = 5 SE +/- 903.00, N = 5 SE +/- 2140.82, N = 5 SE +/- 1107.42, N = 5 SE +/- 740.45, N = 5 SE +/- 2184.70, N = 5 1094211 1115390 1115841 1102013 1116747 1135626 1109114 981778 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -flto 1. (CC) gcc options: -O3 -march=native
Stockfish This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 9 Total Time -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 15M 30M 45M 60M 75M SE +/- 226981.30, N = 3 SE +/- 779150.59, N = 3 SE +/- 443906.77, N = 3 SE +/- 526520.80, N = 3 SE +/- 954655.13, N = 3 SE +/- 385955.10, N = 3 SE +/- 458502.78, N = 3 66890687 67513602 67571150 66697487 68200164 67450689 67841877 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -march=native 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto
Memcached mcperf This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 12K 24K 36K 48K 60K SE +/- 2866.90, N = 12 SE +/- 2252.95, N = 15 SE +/- 121.25, N = 3 SE +/- 125.83, N = 3 SE +/- 1053.86, N = 15 SE +/- 2185.23, N = 15 SE +/- 2440.20, N = 12 54600 53106 34822 34138 44903 50055 47774 35174 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Get -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 15K 30K 45K 60K 75K SE +/- 339.02, N = 3 SE +/- 29.60, N = 3 SE +/- 286.25, N = 3 SE +/- 125.71, N = 3 SE +/- 812.95, N = 3 SE +/- 66.15, N = 3 SE +/- 734.34, N = 3 55647 56837 69004 55791 57652 68644 68426 56324 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Set -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 13K 26K 39K 52K 65K SE +/- 2193.95, N = 15 SE +/- 44.53, N = 3 SE +/- 2384.39, N = 15 SE +/- 67.69, N = 3 SE +/- 877.57, N = 15 SE +/- 195.82, N = 3 SE +/- 1474.59, N = 12 59924 34880 48548 34446 38646 43570 46396 35239 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 13K 26K 39K 52K 65K SE +/- 2658.53, N = 15 SE +/- 3045.08, N = 15 SE +/- 220.19, N = 3 SE +/- 168.27, N = 3 SE +/- 256.24, N = 3 SE +/- 218.35, N = 3 SE +/- 123.87, N = 3 61219 56172 45716 35058 42455 45339 46106 35920 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Delete -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 15K 30K 45K 60K 75K SE +/- 274.96, N = 3 SE +/- 204.60, N = 3 SE +/- 914.89, N = 4 SE +/- 304.30, N = 3 SE +/- 772.91, N = 4 SE +/- 220.33, N = 3 SE +/- 370.63, N = 3 55747 56822 69509 56141 58969 68696 68980 56797 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 10K 20K 30K 40K 50K SE +/- 109.24, N = 3 SE +/- 58.44, N = 3 SE +/- 116.90, N = 3 SE +/- 186.47, N = 3 SE +/- 2300.04, N = 15 SE +/- 179.60, N = 3 SE +/- 863.93, N = 12 35467 35968 45691 35552 43824 45587 47085 35579 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 12K 24K 36K 48K 60K SE +/- 132.68, N = 3 SE +/- 365.15, N = 3 SE +/- 85.02, N = 3 SE +/- 167.62, N = 3 SE +/- 1741.69, N = 12 SE +/- 174.58, N = 3 SE +/- 168.47, N = 3 35829 36231 45707 35486 53865 45591 45956 35978 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 600K 1200K 1800K 2400K 3000K SE +/- 41423.26, N = 15 SE +/- 25886.44, N = 8 SE +/- 3626.56, N = 3 SE +/- 24290.24, N = 3 SE +/- 34288.12, N = 3 SE +/- 24550.99, N = 9 SE +/- 25803.36, N = 3 2654025 2544762 2506276 2340595 2573823 2616703 2648345 -O2 -O3 -march=native -flto 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 400K 800K 1200K 1600K 2000K SE +/- 18375.04, N = 3 SE +/- 23560.29, N = 3 SE +/- 24795.31, N = 5 SE +/- 26134.11, N = 12 SE +/- 13234.25, N = 3 SE +/- 27875.47, N = 3 SE +/- 10024.22, N = 3 1975322 2051129 2032921 2000709 2046557 2084089 2083430 -O2 -O3 -march=native -flto 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 300K 600K 900K 1200K 1500K SE +/- 20493.00, N = 5 SE +/- 19988.57, N = 3 SE +/- 9157.67, N = 3 SE +/- 22581.44, N = 3 SE +/- 12857.88, N = 3 SE +/- 6810.29, N = 3 SE +/- 22070.89, N = 3 1548556 1520276 1522952 1463388 1540257 1549647 1532038 -O2 -O3 -march=native -flto 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 500K 1000K 1500K 2000K 2500K SE +/- 39327.40, N = 3 SE +/- 15021.00, N = 3 SE +/- 28994.69, N = 3 SE +/- 25441.63, N = 3 SE +/- 11016.75, N = 3 SE +/- 36774.66, N = 3 SE +/- 26635.52, N = 12 2384143 2540398 2461725 2275019 2502182 2509433 2533027 -O2 -O3 -march=native -flto 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 400K 800K 1200K 1600K 2000K SE +/- 22619.48, N = 3 SE +/- 16989.46, N = 3 SE +/- 12476.39, N = 3 SE +/- 28790.51, N = 15 SE +/- 6043.61, N = 3 SE +/- 7219.96, N = 3 SE +/- 14705.69, N = 3 1730702 1840806 1755590 1744979 1807270 1755472 1798800 -O2 -O3 -march=native -flto 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
NGINX Benchmark This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto 6K 12K 18K 24K 30K SE +/- 219.17, N = 3 SE +/- 169.85, N = 3 SE +/- 428.25, N = 3 SE +/- 166.84, N = 3 SE +/- 401.04, N = 4 SE +/- 43.76, N = 3 29704 29726 29281 27834 29274 27352 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -flto 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 1300 2600 3900 5200 6500 SE +/- 0.92, N = 3 SE +/- 1.40, N = 3 SE +/- 1.25, N = 3 SE +/- 1.56, N = 3 SE +/- 3.74, N = 3 SE +/- 2.44, N = 3 SE +/- 4.62, N = 3 5832 5837 5838 5831 5825 5833 5830 5791 -O2 -march=athlon64 -lssl -O3 -march=athlon64 -lssl -O3 -march=athlon64-sse3 -lssl -O2 -march=native -lssl -O3 -march=native -lssl -O3 -march=native -flto -lssl -O3 -march=native -lssl -O3 -march=native 1. (CC) gcc options: -pthread -m64 -lcrypto -ldl
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 100K 200K 300K 400K 500K SE +/- 2874.20, N = 3 SE +/- 6987.31, N = 3 SE +/- 6062.49, N = 4 SE +/- 3403.88, N = 3 SE +/- 1148.20, N = 3 SE +/- 1547.77, N = 3 SE +/- 7464.16, N = 3 453605 463840 459596 458551 466723 473620 460414 259058 -O2 -march=athlon64 -lpq -O3 -march=athlon64 -lpq -O3 -march=athlon64-sse3 -lpq -O2 -march=native -lpq -O3 -march=native -lpq -O3 -march=native -flto -lpq -O3 -march=native -lpq -O3 -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.3 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 4K 8K 12K 16K 20K SE +/- 65.89, N = 4 SE +/- 247.31, N = 3 SE +/- 51.55, N = 3 SE +/- 279.82, N = 15 SE +/- 98.94, N = 3 SE +/- 164.04, N = 7 SE +/- 93.40, N = 4 5098 15616 16252 13558 16339 14989 6281 6472 -O2 -march=athlon64 -lpq -O3 -march=athlon64 -lpq -O3 -march=athlon64-sse3 -lpq -O2 -march=native -lpq -O3 -march=native -lpq -O3 -march=native -flto -lpq -O3 -march=native -lpq -O3 -march=native 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
ctx_clock Ctx_clock is a simple test program to measure the context switch time in clock cycles. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Clocks, Fewer Is Better ctx_clock Context Switch Time -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 30 60 90 120 150 150 150 150 150 150 150 150 150 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options:
MKL-DNN This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better MKL-DNN 2019-04-16 Harness: IP Batch 1D - Data Type: f32 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native PGO AMD Ryzen Threadripper 2990WX 32-Core 16 32 48 64 80 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 SE +/- 0.24, N = 3 SE +/- 0.40, N = 3 SE +/- 0.06, N = 3 71.98 70.94 71.02 72.08 71.26 70.94 71.00 -O2 -march=athlon64 - MIN: 70.78 -march=athlon64 - MIN: 70.24 -march=athlon64-sse3 - MIN: 70.33 -O2 - MIN: 70.86 MIN: 70.22 MIN: 70.39 -lrt - MIN: 70.47 1. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.29, N = 9 SE +/- 0.34, N = 15 SE +/- 0.33, N = 3 SE +/- 0.07, N = 3 SE +/- 0.16, N = 3 SE +/- 0.39, N = 3 25.60 29.66 27.12 26.07 26.23 26.31 28.74 27.83 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -pthread
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.10, N = 15 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 15 SE +/- 0.08, N = 10 SE +/- 0.12, N = 3 8.68 9.59 8.82 9.40 9.01 8.89 9.41 14.05 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -pthread
Timed MAFFT Alignment This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.392 Multiple Sequence Alignment -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 0.6165 1.233 1.8495 2.466 3.0825 SE +/- 0.05, N = 15 SE +/- 0.02, N = 15 SE +/- 0.03, N = 15 SE +/- 0.06, N = 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 15 SE +/- 0.00, N = 3 2.60 2.58 2.55 2.58 2.69 2.65 2.63 2.74 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Timed ImageMagick Compilation This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.23, N = 5 SE +/- 0.03, N = 3 SE +/- 0.24, N = 3 SE +/- 0.25, N = 3 SE +/- 1.39, N = 3 SE +/- 0.07, N = 3 17.41 19.77 19.73 17.58 19.15 88.89 19.09 19.69
Timed PHP Compilation This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.1.9 Time To Compile -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native PGO AMD Ryzen Threadripper 2990WX 32-Core 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.32, N = 3 SE +/- 0.17, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 43.86 63.19 62.93 44.44 63.25 63.01 85.40 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -O3 -march=native 1. (CC) gcc options: -pedantic -ldl -lz -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 900 1800 2700 3600 4500 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 45.43 29.39 29.53 34.22 18.00 17.85 17.96 4024.62 -march=native 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 120 240 360 480 600 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 4.57 4.51 4.54 3.89 3.87 3.85 3.83 564.07 -march=native 1. (CXX) g++ options: -fopenmp -O3
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 47.28 45.20 44.98 42.34 39.11 39.92 39.10 43.36 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CC) gcc options: -lm -O3
Bullet Physics Engine This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 0.5603 1.1206 1.6809 2.2412 2.8015 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 2.49 2.46 2.46 2.38 2.35 2.30 2.34 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 0.8933 1.7866 2.6799 3.5732 4.4665 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.96 3.92 3.91 3.95 3.84 3.97 3.83 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 1.0845 2.169 3.2535 4.338 5.4225 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.68 4.65 4.62 4.57 4.42 4.82 4.41 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 1.0125 2.025 3.0375 4.05 5.0625 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.50 4.45 4.45 4.06 3.97 3.90 3.97 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 0.549 1.098 1.647 2.196 2.745 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.42 2.36 2.35 2.39 2.30 2.44 2.32 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.86 0.86 0.84 0.84 0.82 0.83 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO 0.2408 0.4816 0.7224 0.9632 1.204 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.07 1.06 1.06 1.02 1.00 0.97 1.00 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native 1. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 150 300 450 600 750 SE +/- 0.21, N = 15 SE +/- 0.27, N = 3 SE +/- 0.22, N = 12 SE +/- 0.10, N = 3 SE +/- 0.26, N = 15 SE +/- 0.23, N = 15 SE +/- 0.07, N = 3 26.75 26.50 25.93 26.78 25.70 25.62 26.09 708.15 -O3 -march=native 1. (CC) gcc options: -pthread -fvisibility=hidden
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 60 120 180 240 300 SE +/- 0.60, N = 15 SE +/- 0.69, N = 12 SE +/- 0.66, N = 15 SE +/- 0.61, N = 15 SE +/- 0.65, N = 15 SE +/- 0.66, N = 15 SE +/- 0.45, N = 15 18.02 17.74 18.38 19.31 19.09 17.94 17.36 265.79 -O3 -march=native 1. (CC) gcc options: -pthread -lz -llzma
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.04, N = 5 SE +/- 0.01, N = 5 SE +/- 0.05, N = 5 SE +/- 0.01, N = 5 SE +/- 0.06, N = 5 SE +/- 0.00, N = 5 15.58 15.48 15.43 9.78 9.53 9.48 9.44 9.81 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CXX) g++ options: -fvisibility=hidden -lm
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 11.41 9.43 9.36 10.80 8.00 7.98 7.98 11.09 -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto -O3 -march=native -O3 -march=native 1. (CC) gcc options: -lm
CppPerformanceBenchmarks CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 16 32 48 64 80 SE +/- 0.05, N = 3 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.17, N = 3 69.08 68.37 68.42 69.80 69.03 69.25 69.31 69.25 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 33.37 33.28 33.31 34.50 33.88 32.32 34.02 37.67 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 90 180 270 360 450 SE +/- 0.87, N = 3 SE +/- 0.79, N = 3 SE +/- 0.83, N = 3 SE +/- 1.79, N = 3 SE +/- 0.11, N = 3 SE +/- 0.73, N = 3 SE +/- 0.93, N = 3 399 398 399 356 351 352 353 355 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 200 400 600 800 1000 SE +/- 0.07, N = 3 SE +/- 0.71, N = 3 SE +/- 0.04, N = 3 SE +/- 4.70, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 1072 1070 1055 1041 1023 1011 1027 1154 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 75.79 75.53 75.56 75.99 74.93 74.92 75.27 78.14 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 16.20 16.17 16.14 15.61 15.40 15.50 15.46 22.07 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction -O2 -march=athlon64 -O3 -march=athlon64 -O3 -march=athlon64-sse3 -O2 -march=native -O3 -march=native -O3 -march=native -flto PGO AMD Ryzen Threadripper 2990WX 32-Core 7 14 21 28 35 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 28.33 28.26 28.28 28.69 28.22 28.25 28.36 29.08 -O2 -march=athlon64 -march=athlon64 -march=athlon64-sse3 -O2 -march=native -march=native -march=native -flto -march=native -march=native 1. (CXX) g++ options: -std=c++11 -O3
-O2 -march=athlon64 Environment Notes: CXXFLAGS=-O2-march=athlon64 CFLAGS=-O2-march=athlon64Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 11 May 2019 17:06 by user phoronix.
-O3 -march=athlon64 Environment Notes: CXXFLAGS=-O3-march=athlon64 CFLAGS=-O3-march=athlon64Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 9 May 2019 13:16 by user phoronix.
-O3 -march=athlon64-sse3 Environment Notes: CXXFLAGS=-O3-march=athlon64-sse3 CFLAGS=-O3-march=athlon64-sse3Compiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 9 May 2019 08:33 by user phoronix.
-O2 -march=native Environment Notes: CXXFLAGS=-O2-march=native CFLAGS=-O2-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 10 May 2019 06:28 by user phoronix.
-O3 -march=native Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 8 May 2019 19:49 by user phoronix.
-O3 -march=native -flto Environment Notes: CXXFLAGS=-O3-march=native-flto CFLAGS=-O3-march=native-fltoCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 9 May 2019 21:12 by user phoronix.
PGO Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 12 May 2019 09:23 by user phoronix.
AMD Ryzen Threadripper 2990WX 32-Core Processor: AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: ASUS ROG ZENITH EXTREME (1701 BIOS), Chipset: AMD 17h, Memory: 32768MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: AMD Radeon RX 64 8GB (1590/800MHz), Audio: Realtek ALC1220, Monitor: ASUS VP28U, Network: Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad
OS: Ubuntu 18.04, Kernel: 4.18.0-18-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: amdgpu 18.1.0, OpenGL: 4.5 Mesa 18.2.8 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160
Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=nativeCompiler Notes: --disable-multilib --enable-checing=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemandPython Notes: Python 2.7.15rc1 + Python 3.6.7Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
Testing initiated at 12 May 2019 14:52 by user phoronix.