3900XT new AMD Ryzen 9 3900XT 12-Core testing with a MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.94 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009306-PTS-3900XTNE63&rdt&grr .
3900XT new Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3900XT 12-Core @ 3.80GHz (12 Cores / 24 Threads) MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.94 BIOS) AMD Starship/Matisse 16GB 500GB Seagate FireCuda 520 SSD ZP500GM30002 AMD Radeon RX 56/64 8GB (1630/945MHz) AMD Vega 10 HDMI Audio DELL P2415Q Realtek Device 2600 + Realtek Device 3000 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.9.0-050900rc6daily20200922-generic (x86_64) 20200921 GNOME Shell 3.36.4 X Server 1.20.8 amdgpu 19.1.0 4.6 Mesa 20.3.0-devel (git-31f75aa 2020-08-28 focal-oibaf-ppa) (LLVM 10.0.1) GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701021 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3900XT new caffe: GoogleNet - CPU - 1000 lczero: Eigen caffe: AlexNet - CPU - 1000 lczero: BLAS lczero: Rand hint: FLOAT caffe: GoogleNet - CPU - 200 gromacs: Water Benchmark caffe: GoogleNet - CPU - 100 byte: Dhrystone 2 mlpack: scikit_qda hmmer: Pfam Database Search caffe: AlexNet - CPU - 200 couchdb: 100 - 1000 - 24 keydb: sockperf: Latency Under Load caffe: AlexNet - CPU - 100 mlpack: scikit_ica mlpack: scikit_linearridgeregression mlpack: scikit_svm sockperf: Throughput dolfyn: Computational Fluid Dynamics sockperf: Latency Ping Pong mafft: Multiple Sequence Alignment - LSU RNA ffte: N=256, 3D Complex FFT Routine 1 2 3 1284480 523 528137 510 227059 407906402.25320 258801 1.135 129203 45808420.1 68.44 108.104 105960 90.139 608002.72 15.212 52869 50.52 1.98 19.05 710528 15.546 3.661 8.724 36875.652239711 1286650 530 529184 523 226411 405600052.70625 258640 1.129 129793 45364247.3 67.80 108.294 106125 90.736 604670.43 13.285 53080 50.86 1.97 18.94 695133 15.197 3.637 8.889 35829.728846853 1285700 518 530599 503 225691 407175465.36738 258885 1.132 129288 44411668.6 68.68 108.706 106244 93.564 604177.49 13.922 53105 50.49 2.01 19.03 699390 15.302 3.628 8.876 35762.100933593 OpenBenchmarking.org
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 3 300K 600K 900K 1200K 1500K SE +/- 2177.02, N = 3 SE +/- 1680.90, N = 3 SE +/- 2589.31, N = 3 1284480 1286650 1285700 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Eigen 1 2 3 110 220 330 440 550 SE +/- 5.33, N = 9 SE +/- 3.28, N = 3 523 530 518 1. (CXX) g++ options: -flto -pthread
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 3 110K 220K 330K 440K 550K SE +/- 449.04, N = 3 SE +/- 436.12, N = 3 SE +/- 599.92, N = 3 528137 529184 530599 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: BLAS 1 2 3 110 220 330 440 550 SE +/- 5.84, N = 3 SE +/- 7.55, N = 4 SE +/- 6.33, N = 3 510 523 503 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Random OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: Random 1 2 3 50K 100K 150K 200K 250K SE +/- 275.62, N = 3 SE +/- 207.81, N = 3 SE +/- 939.75, N = 3 227059 226411 225691 1. (CXX) g++ options: -flto -pthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT 1 2 3 90M 180M 270M 360M 450M SE +/- 4471620.37, N = 7 SE +/- 5128941.40, N = 5 SE +/- 929009.38, N = 3 407906402.25 405600052.71 407175465.37 1. (CC) gcc options: -O3 -march=native -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 3 60K 120K 180K 240K 300K SE +/- 324.47, N = 3 SE +/- 373.68, N = 3 SE +/- 958.11, N = 3 258801 258640 258885 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
GROMACS Water Benchmark OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2020.3 Water Benchmark 1 2 3 0.2554 0.5108 0.7662 1.0216 1.277 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 1.135 1.129 1.132 1. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 3 30K 60K 90K 120K 150K SE +/- 87.09, N = 3 SE +/- 179.48, N = 3 SE +/- 132.79, N = 3 129203 129793 129288 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 3.6 Computational Test: Dhrystone 2 1 2 3 10M 20M 30M 40M 50M SE +/- 358755.52, N = 3 SE +/- 196551.91, N = 3 SE +/- 302167.74, N = 3 45808420.1 45364247.3 44411668.6
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda 1 2 3 15 30 45 60 75 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 SE +/- 0.73, N = 3 68.44 67.80 68.68
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.27, N = 3 SE +/- 0.13, N = 3 108.10 108.29 108.71 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 3 20K 40K 60K 80K 100K SE +/- 161.11, N = 3 SE +/- 40.32, N = 3 SE +/- 195.80, N = 3 105960 106125 106244 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Apache CouchDB Bulk Size: 100 - Inserts: 1000 - Rounds: 24 OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.1.1 Bulk Size: 100 - Inserts: 1000 - Rounds: 24 1 2 3 20 40 60 80 100 SE +/- 0.59, N = 3 SE +/- 0.45, N = 3 SE +/- 1.23, N = 3 90.14 90.74 93.56 1. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 1 2 3 130K 260K 390K 520K 650K SE +/- 1075.08, N = 3 SE +/- 997.28, N = 3 SE +/- 1294.03, N = 3 608002.72 604670.43 604177.49 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Sockperf Test: Latency Under Load OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Under Load 1 2 3 4 8 12 16 20 SE +/- 0.67, N = 25 SE +/- 0.93, N = 25 SE +/- 0.77, N = 25 15.21 13.29 13.92 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 3 11K 22K 33K 44K 55K SE +/- 12.02, N = 3 SE +/- 61.41, N = 3 SE +/- 59.27, N = 3 52869 53080 53105 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica 1 2 3 11 22 33 44 55 SE +/- 0.86, N = 3 SE +/- 0.73, N = 3 SE +/- 0.37, N = 3 50.52 50.86 50.49
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression 1 2 3 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.98 1.97 2.01
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.21, N = 3 SE +/- 0.25, N = 3 19.05 18.94 19.03
Sockperf Test: Throughput OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.4 Test: Throughput 1 2 3 150K 300K 450K 600K 750K SE +/- 7858.40, N = 7 SE +/- 2830.13, N = 5 SE +/- 6061.64, N = 5 710528 695133 699390 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 1 2 3 4 8 12 16 20 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 SE +/- 0.22, N = 3 15.55 15.20 15.30
Sockperf Test: Latency Ping Pong OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.4 Test: Latency Ping Pong 1 2 3 0.8237 1.6474 2.4711 3.2948 4.1185 SE +/- 0.045, N = 5 SE +/- 0.041, N = 5 SE +/- 0.042, N = 5 3.661 3.637 3.628 1. (CXX) g++ options: --param -O3 -rdynamic -ldl -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.015, N = 3 SE +/- 0.051, N = 3 8.724 8.889 8.876 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 2 3 8K 16K 24K 32K 40K SE +/- 25.13, N = 3 SE +/- 57.85, N = 3 SE +/- 33.44, N = 3 36875.65 35829.73 35762.10 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Phoronix Test Suite v10.8.5