Threadripper 2990WX Thread Scaling

AMD Ryzen Threadripper 2990WX 32-Core testing with different core/thread counts. Benchmarks by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1808163-RA-TR2SCALIN95.

Threadripper 2990WX Thread ScalingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 ThreadsAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (1 Core)ASUS ROG ZENITH EXTREME (0064 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX Vega 8192MBRealtek ALC1220DELL P2415QIntel I211 Gigabit Connection + Qualcomm Atheros QCA6174 802.11ac WirelessUbuntu 18.044.18.0-999-generic (x86_64) 20180805GNOME Shell 3.28.2X Server 1.19.6amdgpu 18.0.14.5 Mesa 18.0.5 (LLVM 6.0.0)GCC 7.3.0ext43840x2160AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (2 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (4 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (8 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (16 Cores)AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores)AMD Radeon RX Vega 8064MBAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)AMD Radeon RX Vega 8192MBOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemandGraphics Details- GLAMORSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection

Threadripper 2990WX Thread Scalingindigobench: Bedroomindigobench: Supercarnpb: EP.Cjohn-the-ripper: Blowfishstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthebizzy: build-linux-kernel: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timem-queens: Time To Solveopenssl: RSA 4096-bit Performancesysbench: CPUblender: Barbershop - CPU-Onlyv-ray: CPU1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads0.110.2542.0816602341764258985490996625.13655.74538.04892.50212.672065.06761.830.230.5184.13330646473515153369177694320.88327.35267.95447.21422.734136.77381.550.430.97160.75603689639069672981343462171.22172.28141.32235.56811.337880.263276.02204.040.781.74313.328811165453211805550167105892.0388.3173.36121.781582.6715350.931801.18108.601.433.16594.50148102975899132955623114529554.3046.5939.1865.303002.2729095.43982.1660.262.575.671092.65309115337867159058663155547235.2525.5622.5136.345514.6053346.19553.2435.903.076.562160.75446826676055876728057136771731.7224.6317.3824.795869.6356727.78402.2530.75OpenBenchmarking.org

IndigoBench

Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Bedroom1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads0.69081.38162.07242.76323.454SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 60.110.230.430.781.432.573.07

IndigoBench

Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: Supercar1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 60.250.510.971.743.165.676.56

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.C1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads5001000150020002500SE +/- 0.12, N = 3SE +/- 0.29, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 1.71, N = 342.0884.13160.75313.32594.501092.652160.751. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0-jumbo-1Test: Blowfish1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads10K20K30K40K50KSE +/- 18.33, N = 3SE +/- 14.73, N = 3SE +/- 74.63, N = 3SE +/- 372.22, N = 6SE +/- 14.17, N = 3SE +/- 565.26, N = 3SE +/- 245.29, N = 316603306603688111481030911446821. (CC) gcc options: -lssl -lcrypto -fopenmp -pthread -lm -lz -ldl -lcrypt -lbz2

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads14M28M42M56M70MSE +/- 2751.54, N = 3SE +/- 21923.77, N = 3SE +/- 71881.58, N = 3SE +/- 86158.77, N = 3SE +/- 86990.41, N = 3SE +/- 217995.10, N = 3SE +/- 651663.27, N = 3234176446473518963906165453212975899153378671667605581. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2017-09-191024 Hash Memory, 26 Depth1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads16M32M48M64M80MSE +/- 619.33, N = 3SE +/- 44526.21, N = 3SE +/- 67053.30, N = 3SE +/- 133076.25, N = 3SE +/- 313307.04, N = 3SE +/- 116017.90, N = 3SE +/- 298662.02, N = 325898545153369967298118055501329556235905866376728057

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.31 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads300K600K900K1200K1500KSE +/- 250.34, N = 3SE +/- 3326.82, N = 3SE +/- 282.86, N = 3SE +/- 738.51, N = 3SE +/- 1118.92, N = 3SE +/- 22021.37, N = 6SE +/- 24098.90, N = 6909961776943434626710581145295155547213677171. (CC) gcc options: -pthread -lpthread -O3 -march=native

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.13Time To Compile1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads130260390520650SE +/- 0.88, N = 3SE +/- 0.79, N = 3SE +/- 0.92, N = 3SE +/- 0.98, N = 3SE +/- 0.87, N = 3SE +/- 0.47, N = 6SE +/- 0.43, N = 6625.13320.88171.2292.0354.3035.2531.72

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads140280420560700SE +/- 0.65, N = 3SE +/- 0.13, N = 3SE +/- 0.29, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3655.74327.35172.2888.3146.5925.5624.631. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Time1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads120240360480600SE +/- 0.57, N = 3SE +/- 0.46, N = 3SE +/- 0.63, N = 3SE +/- 0.43, N = 3SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.05, N = 3538.04267.95141.3273.3639.1822.5117.381. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To Solve1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads2004006008001000SE +/- 0.27, N = 3SE +/- 0.22, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3892.50447.21235.56121.7865.3036.3424.791. (CXX) g++ options: -fopenmp -O2 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.0fRSA 4096-bit Performance1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads13002600390052006500SE +/- 0.09, N = 3SE +/- 3.02, N = 3SE +/- 0.24, N = 3SE +/- 0.38, N = 3SE +/- 0.23, N = 3SE +/- 2.44, N = 3SE +/- 1.87, N = 3212.67422.73811.331582.673002.275514.605869.631. (CC) gcc options: -O3 -pthread -m64 -lssl -lcrypto -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads12K24K36K48K60KSE +/- 0.87, N = 3SE +/- 0.73, N = 3SE +/- 1.58, N = 3SE +/- 5.20, N = 3SE +/- 8.26, N = 3SE +/- 32.79, N = 3SE +/- 65.28, N = 32065.064136.777880.2615350.9329095.4353346.1956727.781. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-Only4 Threads8 Threads16 Threads32 Threads64 Threads70014002100280035003276.021801.18982.16553.24402.25

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CPU1 Thread2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads160320480640800SE +/- 1.89, N = 3SE +/- 1.08, N = 3SE +/- 0.04, N = 3SE +/- 0.34, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 3761.83381.55204.04108.6060.2635.9030.75


Phoronix Test Suite v10.8.4