Threadripper 2990WX Thread Scaling

AMD Ryzen Threadripper 2990WX 32-Core testing with different core/thread counts. Benchmarks by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1808163-RA-TR2SCALIN95&gru&sro.

Threadripper 2990WX Thread ScalingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 ThreadsAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (1 Core)ASUS ROG ZENITH EXTREME (0064 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX Vega 8192MBRealtek ALC1220DELL P2415QIntel I211 Gigabit Connection + Qualcomm Atheros QCA6174 802.11ac WirelessUbuntu 18.044.18.0-999-generic (x86_64) 20180805GNOME Shell 3.28.2X Server 1.19.6amdgpu 18.0.14.5 Mesa 18.0.5 (LLVM 6.0.0)GCC 7.3.0ext43840x2160AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (2 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (4 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (8 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (16 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores)AMD Radeon RX Vega 8064MBAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)AMD Radeon RX Vega 8192MBOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemandGraphics Details- GLAMORSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection

Threadripper 2990WX Thread Scalingsysbench: CPUindigobench: Bedroomindigobench: Supercarstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthjohn-the-ripper: Blowfishebizzy: openssl: RSA 4096-bit Performancenpb: EP.Cbuild-linux-kernel: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timem-queens: Time To Solveblender: Barbershop - CPU-Onlyv-ray: CPU1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads2065.060.110.2523417642589854166090996212.6742.08625.13655.74538.04892.50761.834136.770.230.51464735151533693306177694422.7384.13320.88327.35267.95447.21381.557880.260.430.97896390696729816036343462811.33160.75171.22172.28141.32235.563276.02204.0415350.930.781.74165453211805550188116710581582.67313.3292.0388.3173.36121.781801.18108.6029095.431.433.1629758991329556231481011452953002.27594.5054.3046.5939.1865.30982.1660.2653346.192.575.6753378671590586633091115554725514.601092.6535.2525.5622.5136.34553.2435.9056727.783.076.5666760558767280574468213677175869.632160.7531.7224.6317.3824.79402.2530.75OpenBenchmarking.org

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads12K24K36K48K60KSE +/- 0.87, N = 3SE +/- 8.26, N = 3SE +/- 0.73, N = 3SE +/- 32.79, N = 3SE +/- 1.58, N = 3SE +/- 65.28, N = 3SE +/- 5.20, N = 32065.0629095.434136.7753346.197880.2656727.7815350.931. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

IndigoBench

Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Bedroom1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads0.69081.38162.07242.76323.454SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 6SE +/- 0.00, N = 30.111.430.232.570.433.070.78

IndigoBench

Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Supercar1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.20, N = 6SE +/- 0.00, N = 30.253.160.515.670.976.561.74

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads14M28M42M56M70MSE +/- 2751.54, N = 3SE +/- 86990.41, N = 3SE +/- 21923.77, N = 3SE +/- 217995.10, N = 3SE +/- 71881.58, N = 3SE +/- 651663.27, N = 3SE +/- 86158.77, N = 3234176429758991464735153378671896390666760558165453211. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2017-09-191024 Hash Memory, 26 Depth1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads16M32M48M64M80MSE +/- 619.33, N = 3SE +/- 313307.04, N = 3SE +/- 44526.21, N = 3SE +/- 116017.90, N = 3SE +/- 67053.30, N = 3SE +/- 298662.02, N = 3SE +/- 133076.25, N = 325898543295562351533695905866396729817672805718055501

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads10K20K30K40K50KSE +/- 18.33, N = 3SE +/- 14.17, N = 3SE +/- 14.73, N = 3SE +/- 565.26, N = 3SE +/- 74.63, N = 3SE +/- 245.29, N = 3SE +/- 372.22, N = 616601481033063091160364468288111. (CC) gcc options: -lssl -lcrypto -fopenmp -pthread -lm -lz -ldl -lcrypt -lbz2

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.31 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads300K600K900K1200K1500KSE +/- 250.34, N = 3SE +/- 1118.92, N = 3SE +/- 3326.82, N = 3SE +/- 22021.37, N = 6SE +/- 282.86, N = 3SE +/- 24098.90, N = 6SE +/- 738.51, N = 3909961145295177694155547234346213677176710581. (CC) gcc options: -pthread -lpthread -O3 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit Performance1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads13002600390052006500SE +/- 0.09, N = 3SE +/- 0.23, N = 3SE +/- 3.02, N = 3SE +/- 2.44, N = 3SE +/- 0.24, N = 3SE +/- 1.87, N = 3SE +/- 0.38, N = 3212.673002.27422.735514.60811.335869.631582.671. (CC) gcc options: -O3 -pthread -m64 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.C1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads5001000150020002500SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 1.71, N = 3SE +/- 0.09, N = 342.08594.5084.131092.65160.752160.75313.321. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.13Time To Compile1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads130260390520650SE +/- 0.88, N = 3SE +/- 0.87, N = 3SE +/- 0.79, N = 3SE +/- 0.47, N = 6SE +/- 0.92, N = 3SE +/- 0.43, N = 6SE +/- 0.98, N = 3625.1354.30320.8835.25171.2231.7292.03

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads140280420560700SE +/- 0.65, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3655.7446.59327.3525.56172.2824.6388.311. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Time1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads120240360480600SE +/- 0.57, N = 3SE +/- 0.02, N = 3SE +/- 0.46, N = 3SE +/- 0.15, N = 3SE +/- 0.63, N = 3SE +/- 0.05, N = 3SE +/- 0.43, N = 3538.0439.18267.9522.51141.3217.3873.361. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To Solve1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads2004006008001000SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3892.5065.30447.2136.34235.5624.79121.781. (CXX) g++ options: -fopenmp -O2 -march=native

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-Only16 Threads32 Threads4 Threads64 Threads8 Threads7001400210028003500982.16553.243276.02402.251801.18

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CPU1 Thread16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads160320480640800SE +/- 1.89, N = 3SE +/- 0.16, N = 3SE +/- 1.08, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.34, N = 3761.8360.26381.5535.90204.0430.75108.60


Phoronix Test Suite v10.8.4