3800XT Sat AMD Ryzen 7 3800XT 8-Core testing with a MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.MS BIOS) and Sapphire AMD Radeon HD 4650 on Debian 10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012128-HA-3800XTSAT00&rdt&gru .
3800XT Sat Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Server Display Driver Compiler File-System Screen Resolution 1 2 3 AMD Ryzen 7 3800XT 8-Core @ 3.90GHz (8 Cores / 16 Threads) MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.MS BIOS) AMD Starship/Matisse 16GB 128GB INTEL SSDPEKKW128G7 Sapphire AMD Radeon HD 4650 AMD RV710/730 Intel I211 Debian 10 4.19.0-13-amd64 (x86_64) X Server 1.20.4 modesetting 1.20.4 GCC 8.3.0 ext4 1024x768 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Disk Details - NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Python Details - Python 2.7.16 + Python 3.7.3 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3800XT Sat graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space coremark: CoreMark Size 666 - Iterations Per Second compilebench: Compile compilebench: Initial Create compilebench: Read Compiled Tree brl-cad: VGR Performance Metric onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU hmmer: Pfam Database Search mafft: Multiple Sequence Alignment - LSU RNA build-ffmpeg: Time To Compile sqlite-speedtest: Timed Time - Size 1,000 1 2 3 575 849 140 213 1163 267 1366 383032.552244 844.54 669.49 4171.63 128159 4.48965 8.53716 2.52954 2.21579 22.2703 4.90950 6.46414 20.0259 6.15886 5.07364 3769.89 2568.38 3771.02 2576.03 4.86878 3770.04 2534.48 2.79548 98.329 11.629 47.869 54.909 573 846 140 213 1165 267 1366 383539.030274 853.74 702.73 4088.29 127169 4.49317 8.53066 2.52521 2.21160 22.2539 4.93803 6.47656 20.0112 6.12533 5.12100 3739.63 2579.11 3764.83 2545.55 4.88063 3768.24 2569.94 2.78793 98.413 11.539 47.241 56.328 573 851 140 214 1164 268 1366 860.27 738.06 4131.97 4.50496 8.62494 2.53552 2.21205 22.2635 4.91964 6.47200 20.0379 6.12171 5.02533 3753.45 2541.56 3747.44 2571.76 4.86841 3752.15 2566.17 2.77572 98.563 11.536 OpenBenchmarking.org
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 1 2 3 120 240 360 480 600 SE +/- 0.88, N = 3 SE +/- 1.86, N = 3 SE +/- 2.91, N = 3 575 573 573 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 1 2 3 200 400 600 800 1000 SE +/- 3.06, N = 3 SE +/- 1.86, N = 3 SE +/- 1.15, N = 3 849 846 851 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 1 2 3 30 60 90 120 150 140 140 140 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 1 2 3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 213 213 214 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 1 2 3 300 600 900 1200 1500 SE +/- 2.08, N = 3 SE +/- 0.67, N = 3 1163 1165 1164 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 1 2 3 60 120 180 240 300 267 267 268 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space 1 2 3 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 SE +/- 0.67, N = 3 1366 1366 1366 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 80K 160K 240K 320K 400K SE +/- 5758.12, N = 3 SE +/- 1349.13, N = 3 383032.55 383539.03 1. (CC) gcc options: -O2 -lrt" -lrt
Compile Bench Test: Compile OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile 1 2 3 200 400 600 800 1000 SE +/- 13.31, N = 9 SE +/- 3.60, N = 3 SE +/- 2.82, N = 3 844.54 853.74 860.27
Compile Bench Test: Initial Create OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Initial Create 1 2 3 160 320 480 640 800 SE +/- 31.98, N = 3 SE +/- 28.61, N = 3 SE +/- 10.61, N = 3 669.49 702.73 738.06
Compile Bench Test: Read Compiled Tree OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Read Compiled Tree 1 2 3 900 1800 2700 3600 4500 SE +/- 1.89, N = 3 SE +/- 16.29, N = 3 SE +/- 21.72, N = 3 4171.63 4088.29 4131.97
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 30K 60K 90K 120K 150K 128159 127169 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 1.0136 2.0272 3.0408 4.0544 5.068 SE +/- 0.00257, N = 3 SE +/- 0.00277, N = 3 SE +/- 0.01172, N = 3 4.48965 4.49317 4.50496 MIN: 4.37 MIN: 4.37 MIN: 4.4 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01048, N = 3 SE +/- 0.01733, N = 3 SE +/- 0.09450, N = 3 8.53716 8.53066 8.62494 MIN: 8.22 MIN: 8.26 MIN: 8.21 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.5705 1.141 1.7115 2.282 2.8525 SE +/- 0.01920, N = 3 SE +/- 0.01188, N = 3 SE +/- 0.02336, N = 3 2.52954 2.52521 2.53552 MIN: 2.47 MIN: 2.45 MIN: 2.46 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.4986 0.9972 1.4958 1.9944 2.493 SE +/- 0.00129, N = 3 SE +/- 0.00062, N = 3 SE +/- 0.00084, N = 3 2.21579 2.21160 2.21205 MIN: 2.15 MIN: 2.15 MIN: 2.15 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 22.27 22.25 22.26 MIN: 22.06 MIN: 22.01 MIN: 22.07 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 1.1111 2.2222 3.3333 4.4444 5.5555 SE +/- 0.00858, N = 3 SE +/- 0.03223, N = 3 SE +/- 0.01083, N = 3 4.90950 4.93803 4.91964 MIN: 4.81 MIN: 4.8 MIN: 4.82 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00490, N = 3 SE +/- 0.00280, N = 3 SE +/- 0.00848, N = 3 6.46414 6.47656 6.47200 MIN: 6.27 MIN: 6.29 MIN: 6.24 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 20.03 20.01 20.04 MIN: 19.61 MIN: 19.67 MIN: 19.59 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.10060, N = 3 SE +/- 0.04876, N = 3 SE +/- 0.03095, N = 3 6.15886 6.12533 6.12171 MIN: 5.84 MIN: 5.89 MIN: 5.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 1.1522 2.3044 3.4566 4.6088 5.761 SE +/- 0.02389, N = 3 SE +/- 0.06972, N = 3 SE +/- 0.00124, N = 3 5.07364 5.12100 5.02533 MIN: 4.89 MIN: 4.87 MIN: 4.88 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 800 1600 2400 3200 4000 SE +/- 29.93, N = 3 SE +/- 10.99, N = 3 SE +/- 24.47, N = 3 3769.89 3739.63 3753.45 MIN: 3713.41 MIN: 3685.61 MIN: 3695.8 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 35.65, N = 3 SE +/- 10.92, N = 3 SE +/- 10.80, N = 3 2568.38 2579.11 2541.56 MIN: 2521.85 MIN: 2557.19 MIN: 2508.49 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 800 1600 2400 3200 4000 SE +/- 13.24, N = 3 SE +/- 13.00, N = 3 SE +/- 12.83, N = 3 3771.02 3764.83 3747.44 MIN: 3719.14 MIN: 3699.18 MIN: 3684.06 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 14.58, N = 3 SE +/- 5.82, N = 3 SE +/- 3.99, N = 3 2576.03 2545.55 2571.76 MIN: 2532.36 MIN: 2511.67 MIN: 2531.02 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 1.0981 2.1962 3.2943 4.3924 5.4905 SE +/- 0.00398, N = 3 SE +/- 0.00199, N = 3 SE +/- 0.00553, N = 3 4.86878 4.88063 4.86841 MIN: 4.7 MIN: 4.81 MIN: 4.79 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 800 1600 2400 3200 4000 SE +/- 28.06, N = 3 SE +/- 14.92, N = 3 SE +/- 13.59, N = 3 3770.04 3768.24 3752.15 MIN: 3712.68 MIN: 3711.2 MIN: 3695.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 600 1200 1800 2400 3000 SE +/- 3.40, N = 3 SE +/- 16.22, N = 3 SE +/- 20.57, N = 3 2534.48 2569.94 2566.17 MIN: 2508.43 MIN: 2528.06 MIN: 2529.35 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.629 1.258 1.887 2.516 3.145 SE +/- 0.01991, N = 3 SE +/- 0.01568, N = 3 SE +/- 0.00253, N = 3 2.79548 2.78793 2.77572 MIN: 2.63 MIN: 2.63 MIN: 2.65 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 98.33 98.41 98.56 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 11.63 11.54 11.54 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 11 22 33 44 55 SE +/- 0.75, N = 3 SE +/- 0.03, N = 3 47.87 47.24
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 1 2 13 26 39 52 65 SE +/- 0.26, N = 3 SE +/- 0.63, N = 7 54.91 56.33 1. (CC) gcc options: -O2 -ldl -lz -lpthread
Phoronix Test Suite v10.8.5