machine learning threadripper

AMD Ryzen Threadripper 3970X 32-Core testing with a ASUS ROG ZENITH II EXTREME (0702 BIOS) and AMD Radeon VII 16GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2002242-PTS-MACHINEL24&grt.

machine learning threadripperProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-CoreAMD Ryzen Threadripper 3970X 32-Core @ 3.70GHz (32 Cores / 64 Threads)ASUS ROG ZENITH II EXTREME (0702 BIOS)AMD Starship/Matisse64GB1000GB Force MP600AMD Radeon VII 16GB (1802/1001MHz)AMD Vega 20 HDMI AudioASUS MG28UAquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723Ubuntu 19.105.3.0-40-generic (x86_64)GNOME Shell 3.34.1X Server 1.20.5amdgpu 19.0.14.5 Mesa 19.2.8 (LLVM 9.0.0)GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025- GLAMOR- Python 2.7.17 + Python 3.7.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected

machine learning threadripperlczero: BLASmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: IP Batch 1D - u8s8f32mkl-dnn: IP Batch All - u8s8f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Convolution Batch conv_3d - u8s8f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Convolution Batch conv_all - u8s8f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32mkl-dnn: Recurrent Neural Network Training - f32mkl-dnn: Convolution Batch conv_alexnet - u8s8f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32mlpack: scikit_icamlpack: scikit_qdamlpack: scikit_svmmlpack: scikit_linearridgeregressionnumpy: rbenchmark: scikit-learn: AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core2193.941.6737011.088820.9272130.0084.86201783.2533771.871.845962.6715599.282012473.01272.241391.812331.37194.6591380.1442.3428692.15445.2540.8211.481.44418.580.28377.904OpenBenchmarking.org

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.23.2Backend: BLASAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core5001000150020002500SE +/- 28.97, N = 152193.941. (CXX) g++ options: -lpthread

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core0.37660.75321.12981.50641.883SE +/- 0.00773, N = 31.67370MIN: 1.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core3691215SE +/- 0.05, N = 311.09MIN: 10.51. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core510152025SE +/- 0.03, N = 320.93MIN: 20.191. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch All - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core306090120150SE +/- 0.18, N = 3130.01MIN: 126.31. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core1.0942.1883.2824.3765.47SE +/- 0.00956, N = 34.86201MIN: 4.691. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core2004006008001000SE +/- 4.73, N = 3783.25MIN: 771.681. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_3d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core8001600240032004000SE +/- 3.72, N = 33771.87MIN: 3759.531. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core0.41530.83061.24591.66122.0765SE +/- 0.00210, N = 31.84596MIN: 1.81. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core0.60111.20221.80332.40443.0055SE +/- 0.02056, N = 32.67155MIN: 2.581. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core20406080100SE +/- 0.60, N = 399.28MIN: 97.331. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_all - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core3K6K9K12K15KSE +/- 7.92, N = 312473.0MIN: 123311. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_all - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core30060090012001500SE +/- 3.81, N = 31272.24MIN: 1250.511. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core30060090012001500SE +/- 0.99, N = 31391.81MIN: 1385.921. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core5001000150020002500SE +/- 14.76, N = 32331.37MIN: 2300.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Recurrent Neural Network Training - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core4080120160200SE +/- 1.39, N = 14194.66MIN: 190.331. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core30060090012001500SE +/- 4.23, N = 31380.14MIN: 1356.841. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core1020304050SE +/- 0.28, N = 342.34MIN: 41.481. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core150300450600750SE +/- 2.69, N = 2692.15MIN: 675.461. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

Mlpack Benchmark

Benchmark: scikit_ica

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_icaAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core1020304050SE +/- 0.06, N = 345.25

Mlpack Benchmark

Benchmark: scikit_qda

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qdaAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core918273645SE +/- 0.42, N = 340.82

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svmAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core3691215SE +/- 0.03, N = 311.48

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregressionAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core0.3240.6480.9721.2961.62SE +/- 0.02, N = 31.44

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core90180270360450SE +/- 2.37, N = 3418.58

R Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterR BenchmarkAMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core0.06380.12760.19140.25520.319SE +/- 0.0011, N = 30.28371. R scripting front-end version 3.6.1 (2019-07-05)

Scikit-Learn

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 0.22.1AMD Radeon VII - AMD Ryzen Threadripper 3970X 32-Core246810SE +/- 0.015, N = 37.904


Phoronix Test Suite v10.8.4