P620_Workstation_mixed AMD Ryzen Threadripper PRO 3955WX 16-Cores testing with a LENOVO 1046 (S07KT45A BIOS) and NVIDIA RTX A4000 16GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2308027-DT9-P620WORK24 .
P620_Workstation_mixed Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution P620 Workstation mixed run1 AMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads) LENOVO 1046 (S07KT45A BIOS) AMD Starship/Matisse 128GB 2 x 1024GB SAMSUNG MZVL21T0HCLR-00BL7 + 2000GB Seagate ST2000DM008-2UB1 NVIDIA RTX A4000 16GB NVIDIA Device 228b PHL 243V7 Aquantia AQC107 NBase-T/IEEE Ubuntu 20.04 5.16.12-usb32-p620 (x86_64) GNOME Shell 3.36.9 X Server 1.20.13 NVIDIA 525.125.06 OpenCL 3.0 CUDA 12.0.151 1.3.224 GCC 9.4.0 + CUDA 11.4 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - NVM_CD_FLAGS= - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x830104d - GPU Compute Cores: 6144 - OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu120.04.1) - Python 3.8.10 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
P620_Workstation_mixed sqlite: 1 sqlite: 32 dbench: 256 cloverleaf: Lagrangian-Eulerian Hydrodynamics dolfyn: Computational Fluid Dynamics amg: arrayfire: BLAS CPU arrayfire: Conjugate Gradient CPU memcached: 1:100 redis: GET - 50 redis: SET - 50 askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding daphne: OpenMP - Points2Image sqlite-speedtest: Timed Time - Size 1,000 deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream memtier-benchmark: Redis - 100 - 1:1 cassandra: Mixed 1:1 rocksdb: Rand Fill ai-benchmark: Device Inference Score ai-benchmark: Device Training Score ai-benchmark: Device AI Score influxdb: 4 - 10000 - 2,5000,1 - 10000 P620 Workstation mixed run1 97.844 497.949 3887.76 51.54 16.629 526662133 436.921 28.53 3101333.67 2313152.67 1537423.62 6844.50 7219.46 22101.677369479 62.223 81.0613 98.5707 1798738.16 120637 750783 1825 1626 3451 OpenBenchmarking.org
SQLite Threads / Copies: 1 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 1 P620 Workstation mixed run1 20 40 60 80 100 SE +/- 0.65, N = 3 97.84 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
SQLite Threads / Copies: 32 OpenBenchmarking.org Seconds, Fewer Is Better SQLite 3.41.2 Threads / Copies: 32 P620 Workstation mixed run1 110 220 330 440 550 SE +/- 12.60, N = 3 497.95 1. (CC) gcc options: -O2 -lz -lm -ldl -lpthread
Dbench Client Count: 256 OpenBenchmarking.org MB/s, More Is Better Dbench 4.0 Client Count: 256 P620 Workstation mixed run1 800 1600 2400 3200 4000 SE +/- 157.90, N = 3 3887.76 1. (CC) gcc options: -lpopt -O2
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics P620 Workstation mixed run1 12 24 36 48 60 SE +/- 0.22, N = 3 51.54 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics P620 Workstation mixed run1 4 8 12 16 20 SE +/- 0.02, N = 3 16.63
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 P620 Workstation mixed run1 110M 220M 330M 440M 550M SE +/- 1372469.30, N = 3 526662133 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
ArrayFire Test: BLAS CPU OpenBenchmarking.org GFLOPS, More Is Better ArrayFire 3.7 Test: BLAS CPU P620 Workstation mixed run1 90 180 270 360 450 SE +/- 2.08, N = 3 436.92 1. (CXX) g++ options: -rdynamic
ArrayFire Test: Conjugate Gradient CPU OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient CPU P620 Workstation mixed run1 7 14 21 28 35 SE +/- 0.48, N = 3 28.53 1. (CXX) g++ options: -rdynamic
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 P620 Workstation mixed run1 700K 1400K 2100K 2800K 3500K SE +/- 32684.97, N = 3 3101333.67 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis Test: GET - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 50 P620 Workstation mixed run1 500K 1000K 1500K 2000K 2500K SE +/- 683.48, N = 3 2313152.67 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: SET - Parallel Connections: 50 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 50 P620 Workstation mixed run1 300K 600K 900K 1200K 1500K SE +/- 5633.42, N = 3 1537423.62 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding P620 Workstation mixed run1 1500 3000 4500 6000 7500 SE +/- 310.13, N = 3 6844.50 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding P620 Workstation mixed run1 1500 3000 4500 6000 7500 SE +/- 684.88, N = 3 7219.46 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite 2021.11.02 Backend: OpenMP - Kernel: Points2Image P620 Workstation mixed run1 5K 10K 15K 20K 25K SE +/- 157.14, N = 3 22101.68 1. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 P620 Workstation mixed run1 14 28 42 56 70 SE +/- 0.37, N = 3 62.22 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream P620 Workstation mixed run1 20 40 60 80 100 SE +/- 0.40, N = 3 81.06
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream P620 Workstation mixed run1 20 40 60 80 100 SE +/- 0.47, N = 3 98.57
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 P620 Workstation mixed run1 400K 800K 1200K 1600K 2000K SE +/- 20265.20, N = 3 1798738.16 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Apache Cassandra Test: Mixed 1:1 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.1.3 Test: Mixed 1:1 P620 Workstation mixed run1 30K 60K 90K 120K 150K SE +/- 5223.45, N = 3 120637
RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Fill P620 Workstation mixed run1 160K 320K 480K 640K 800K SE +/- 1196.96, N = 3 750783 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
AI Benchmark Alpha Device Inference Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Inference Score P620 Workstation mixed run1 400 800 1200 1600 2000 SE +/- 5.13, N = 3 1825
AI Benchmark Alpha Device Training Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device Training Score P620 Workstation mixed run1 300 600 900 1200 1500 SE +/- 9.02, N = 3 1626
AI Benchmark Alpha Device AI Score OpenBenchmarking.org Score, More Is Better AI Benchmark Alpha 0.1.2 Device AI Score P620 Workstation mixed run1 700 1400 2100 2800 3500 SE +/- 12.90, N = 3 3451
Phoronix Test Suite v10.8.5