P620_Workstation_mixed

AMD Ryzen Threadripper PRO 3955WX 16-Cores testing with a LENOVO 1046 (S07KT45A BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308027-DT9-P620WORK24&grs.

P620_Workstation_mixedProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenCLVulkanCompilerFile-SystemScreen ResolutionP620 Workstation mixed run1AMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads)LENOVO 1046 (S07KT45A BIOS)AMD Starship/Matisse128GB2 x 1024GB SAMSUNG MZVL21T0HCLR-00BL7 + 2000GB Seagate ST2000DM008-2UB1NVIDIA RTX A4000 16GBNVIDIA Device 228bPHL 243V7Aquantia AQC107 NBase-T/IEEEUbuntu 20.045.16.12-usb32-p620 (x86_64)GNOME Shell 3.36.9X Server 1.20.13NVIDIA 525.125.06OpenCL 3.0 CUDA 12.0.1511.3.224GCC 9.4.0 + CUDA 11.4ext41920x1080OpenBenchmarking.org- Transparent Huge Pages: madvise- NVM_CD_FLAGS=- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x830104d- GPU Compute Cores: 6144- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu120.04.1)- Python 3.8.10- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

P620_Workstation_mixedai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scorerocksdb: Rand Fillmemtier-benchmark: Redis - 100 - 1:1deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamsqlite-speedtest: Timed Time - Size 1,000daphne: OpenMP - Points2Imageredis: SET - 50redis: GET - 50memcached: 1:100arrayfire: Conjugate Gradient CPUarrayfire: BLAS CPUamg: dolfyn: Computational Fluid Dynamicscloverleaf: Lagrangian-Eulerian Hydrodynamicssqlite: 32sqlite: 1cassandra: Mixed 1:1askap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingdbench: 256fio: Rand Read - Linux AIO - No - Yes - 1MB - 1 - /dataP620 Workstation mixed run13451162618257507831798738.1698.570781.061362.22322101.6773694791537423.622313152.673101333.6728.53436.92152666213316.62951.54497.94997.8441206377219.466844.503887.76OpenBenchmarking.org

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreP620 Workstation mixed run17001400210028003500SE +/- 12.90, N = 33451

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreP620 Workstation mixed run130060090012001500SE +/- 9.02, N = 31626

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreP620 Workstation mixed run1400800120016002000SE +/- 5.13, N = 31825

RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random FillP620 Workstation mixed run1160K320K480K640K800KSE +/- 1196.96, N = 37507831. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1P620 Workstation mixed run1400K800K1200K1600K2000KSE +/- 20265.20, N = 31798738.161. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamP620 Workstation mixed run120406080100SE +/- 0.47, N = 398.57

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamP620 Workstation mixed run120406080100SE +/- 0.40, N = 381.06

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000P620 Workstation mixed run11428425670SE +/- 0.37, N = 362.221. (CC) gcc options: -O2 -ldl -lz -lpthread

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Points2ImageP620 Workstation mixed run15K10K15K20K25KSE +/- 157.14, N = 322101.681. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

Redis

Test: SET - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 50P620 Workstation mixed run1300K600K900K1200K1500KSE +/- 5633.42, N = 31537423.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET - Parallel Connections: 50

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 50P620 Workstation mixed run1500K1000K1500K2000K2500KSE +/- 683.48, N = 32313152.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100P620 Workstation mixed run1700K1400K2100K2800K3500KSE +/- 32684.97, N = 33101333.671. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

ArrayFire

Test: Conjugate Gradient CPU

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient CPUP620 Workstation mixed run1714212835SE +/- 0.48, N = 328.531. (CXX) g++ options: -rdynamic

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUP620 Workstation mixed run190180270360450SE +/- 2.08, N = 3436.921. (CXX) g++ options: -rdynamic

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2P620 Workstation mixed run1110M220M330M440M550MSE +/- 1372469.30, N = 35266621331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsP620 Workstation mixed run148121620SE +/- 0.02, N = 316.63

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsP620 Workstation mixed run11224364860SE +/- 0.22, N = 351.541. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

SQLite

Threads / Copies: 32

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 32P620 Workstation mixed run1110220330440550SE +/- 12.60, N = 3497.951. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 1P620 Workstation mixed run120406080100SE +/- 0.65, N = 397.841. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

Apache Cassandra

Test: Mixed 1:1

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Mixed 1:1P620 Workstation mixed run130K60K90K120K150KSE +/- 5223.45, N = 3120637

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingP620 Workstation mixed run115003000450060007500SE +/- 684.88, N = 37219.461. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingP620 Workstation mixed run115003000450060007500SE +/- 310.13, N = 36844.501. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Dbench

Client Count: 256

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 256P620 Workstation mixed run18001600240032004000SE +/- 157.90, N = 33887.761. (CC) gcc options: -lpopt -O2


Phoronix Test Suite v10.8.5