3950x dec AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VII HERO (WI-FI) (3103 BIOS) and Sapphire AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012157-HA-3950XDEC971&grr&rdt .
3950x dec Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VII HERO (WI-FI) (3103 BIOS) AMD Starship/Matisse 16GB Samsung SSD 970 EVO 250GB Sapphire AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1260/1750MHz) AMD Ellesmere HDMI Audio VA2431 Intel I211 + Realtek RTL8822BE 802.11a/b/g/n/ac Ubuntu 20.04 5.9.0-050900rc6daily20200922-generic (x86_64) 20200921 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3950x dec build-clash: Time To Compile brl-cad: VGR Performance Metric asmfish: 1024 Hash Memory, 26 Depth hmmer: Pfam Database Search onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU node-web-tooling: onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU build-ffmpeg: Time To Compile phpbench: PHP Benchmark Suite coremark: CoreMark Size 666 - Iterations Per Second onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU mafft: Multiple Sequence Alignment - LSU RNA onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU 1 2 3 366.192 249006 51966295 115.161 5327.85 5304.22 5311.92 2710.91 2733.87 2710.37 11.96 4.71895 34.330 705881 713404.983064 2.89332 4.70277 1.35280 0.714224 1.47960 0.790724 11.4339 8.858 23.2275 20.6660 2.65751 4.38482 367.003 249959 51820731 115.181 5323.43 5316.18 5322.96 2711.83 2706.16 2709.23 11.93 4.23541 34.755 693793 714327.500795 2.89290 4.71650 1.35161 0.712063 1.48291 0.786084 11.4350 8.863 23.2799 20.6996 2.65509 4.36270 366.172 250105 51975038 115.308 5297.88 5282.14 5279.63 2721.00 2714.91 2716.59 11.97 4.26586 34.189 695212 716302.447971 2.89103 4.70463 1.35269 0.718305 1.47932 0.784875 11.4372 8.837 23.2590 20.7479 2.65554 4.37791 OpenBenchmarking.org
Timed Clash Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Clash Compilation Time To Compile 1 2 3 80 160 240 320 400 SE +/- 0.54, N = 3 SE +/- 0.52, N = 3 SE +/- 1.11, N = 3 366.19 367.00 366.17
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 3 50K 100K 150K 200K 250K 249006 249959 250105 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 1 2 3 11M 22M 33M 44M 55M SE +/- 324144.28, N = 3 SE +/- 254669.84, N = 3 SE +/- 601509.90, N = 3 51966295 51820731 51975038
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 115.16 115.18 115.31 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 39.67, N = 3 SE +/- 15.09, N = 3 SE +/- 20.74, N = 3 5327.85 5323.43 5297.88 MIN: 5237.75 MIN: 5262.63 MIN: 5226.98 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 18.58, N = 3 SE +/- 16.28, N = 3 SE +/- 6.44, N = 3 5304.22 5316.18 5282.14 MIN: 5245.38 MIN: 5237.81 MIN: 5224.5 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 1100 2200 3300 4400 5500 SE +/- 16.55, N = 3 SE +/- 7.33, N = 3 SE +/- 17.92, N = 3 5311.92 5322.96 5279.63 MIN: 5240.85 MIN: 5257.18 MIN: 5205.19 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 0.37, N = 3 SE +/- 5.17, N = 3 SE +/- 3.18, N = 3 2710.91 2711.83 2721.00 MIN: 2691.88 MIN: 2680.26 MIN: 2701.56 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 25.75, N = 3 SE +/- 3.71, N = 3 SE +/- 2.74, N = 3 2733.87 2706.16 2714.91 MIN: 2687.15 MIN: 2683.86 MIN: 2698.95 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 2.36, N = 3 SE +/- 3.30, N = 3 SE +/- 8.61, N = 3 2710.37 2709.23 2716.59 MIN: 2688.35 MIN: 2684.34 MIN: 2691.78 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 11.96 11.93 11.97 1. Nodejs
v10.19.0
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.0618 2.1236 3.1854 4.2472 5.309 SE +/- 0.44661, N = 15 SE +/- 0.01173, N = 3 SE +/- 0.03504, N = 3 4.71895 4.23541 4.26586 MIN: 4.11 MIN: 4.11 MIN: 4.11 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 3 8 16 24 32 40 SE +/- 0.18, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 34.33 34.76 34.19
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 150K 300K 450K 600K 750K SE +/- 10502.27, N = 4 SE +/- 6513.43, N = 3 SE +/- 5597.47, N = 3 705881 693793 695212
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 3 150K 300K 450K 600K 750K SE +/- 2137.78, N = 3 SE +/- 2327.47, N = 3 SE +/- 2555.39, N = 3 713404.98 714327.50 716302.45 1. (CC) gcc options: -O2 -lrt" -lrt
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 0.651 1.302 1.953 2.604 3.255 SE +/- 0.00694, N = 3 SE +/- 0.00273, N = 3 SE +/- 0.00445, N = 3 2.89332 2.89290 2.89103 MIN: 2.83 MIN: 2.83 MIN: 2.83 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 1.0612 2.1224 3.1836 4.2448 5.306 SE +/- 0.00164, N = 3 SE +/- 0.01087, N = 3 SE +/- 0.00569, N = 3 4.70277 4.71650 4.70463 MIN: 4.48 MIN: 4.5 MIN: 4.47 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.3044 0.6088 0.9132 1.2176 1.522 SE +/- 0.00224, N = 3 SE +/- 0.00188, N = 3 SE +/- 0.00141, N = 3 1.35280 1.35161 1.35269 MIN: 1.32 MIN: 1.32 MIN: 1.32 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 0.1616 0.3232 0.4848 0.6464 0.808 SE +/- 0.000810, N = 3 SE +/- 0.001576, N = 3 SE +/- 0.001157, N = 3 0.714224 0.712063 0.718305 MIN: 0.69 MIN: 0.69 MIN: 0.69 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.3337 0.6674 1.0011 1.3348 1.6685 SE +/- 0.00272, N = 3 SE +/- 0.00248, N = 3 SE +/- 0.00325, N = 3 1.47960 1.48291 1.47932 MIN: 1.44 MIN: 1.43 MIN: 1.44 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.1779 0.3558 0.5337 0.7116 0.8895 SE +/- 0.005023, N = 3 SE +/- 0.004136, N = 3 SE +/- 0.004562, N = 3 0.790724 0.786084 0.784875 MIN: 0.74 MIN: 0.73 MIN: 0.73 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 11.43 11.44 11.44 MIN: 11.32 MIN: 11.31 MIN: 11.33 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 2 4 6 8 10 SE +/- 0.028, N = 3 SE +/- 0.084, N = 3 SE +/- 0.029, N = 3 8.858 8.863 8.837 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 23.23 23.28 23.26 MIN: 22.8 MIN: 22.87 MIN: 22.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 20.67 20.70 20.75 MIN: 20.07 MIN: 20.31 MIN: 20.09 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.5979 1.1958 1.7937 2.3916 2.9895 SE +/- 0.00692, N = 3 SE +/- 0.00062, N = 3 SE +/- 0.00049, N = 3 2.65751 2.65509 2.65554 MIN: 2.57 MIN: 2.57 MIN: 2.58 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 0.9866 1.9732 2.9598 3.9464 4.933 SE +/- 0.01858, N = 3 SE +/- 0.01332, N = 3 SE +/- 0.01020, N = 3 4.38482 4.36270 4.37791 MIN: 4.27 MIN: 4.25 MIN: 4.26 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Phoronix Test Suite v10.8.5