3300X oneDNN SVT Stuff

AMD Ryzen 3 3300X 4-Core testing with a MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NR BIOS) and AMD FirePro V3800 512MB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103158-HA-3300XONED31&grr&rdt&rro.

3300X oneDNN SVT StuffProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution123AMD Ryzen 3 3300X 4-Core @ 3.80GHz (4 Cores / 8 Threads)MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NR BIOS)AMD Starship/Matisse8GB256GB INTEL SSDPEKKW256G7AMD FirePro V3800 512MBAMD Redwood HDMI AudioDELL S2409WRealtek RTL8111/8168/8411Ubuntu 20.045.9.0-rc5-14sep-patch (x86_64) 20200914GNOME Shell 3.36.4X Server 1.20.93.3 Mesa 20.0.8 (LLVM 10.0.0)GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

3300X oneDNN SVT Stuffsvt-hevc: 1 - Bosphorus 1080ponednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUsysbench: CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsvt-hevc: 7 - Bosphorus 1080ponednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUsvt-vp9: Visual Quality Optimized - Bosphorus 1080psysbench: RAM / Memoryonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU1234.036030.316032.295993.828918.823109.223101.943113.9311.043206.797256.780514.917535.028276.0988962.0210.40172.7383987.7616393.9522.656421.7320108.00110.67133.1911.33159.157954.046010.236025.135983.998918.553118.203107.963110.2213.00956.788506.750964.908485.033406.1162161.9110.47062.7170988.0316345.4622.534521.8783109.62110.57133.0411.34719.172904.046015.976008.785981.798921.343119.403113.523102.4513.09196.771096.786004.881155.016536.1214161.9210.24412.6883587.6716159.0822.615021.7457109.10111.25133.2011.29569.15620OpenBenchmarking.org

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p3210.9091.8182.7273.6364.545SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.044.044.031. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU32113002600390052006500SE +/- 17.06, N = 3SE +/- 7.25, N = 3SE +/- 2.70, N = 36015.976010.236030.31MIN: 5932.88MIN: 5940.42MIN: 5964.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU32113002600390052006500SE +/- 9.07, N = 3SE +/- 1.80, N = 3SE +/- 9.85, N = 36008.786025.136032.29MIN: 5954.04MIN: 5961.24MIN: 5963.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU32113002600390052006500SE +/- 15.20, N = 3SE +/- 10.53, N = 3SE +/- 24.14, N = 35981.795983.995993.82MIN: 5900.15MIN: 5903.92MIN: 5896.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU3212K4K6K8K10KSE +/- 0.88, N = 3SE +/- 2.00, N = 3SE +/- 2.49, N = 38921.348918.558918.821. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU3217001400210028003500SE +/- 4.28, N = 3SE +/- 6.83, N = 3SE +/- 5.50, N = 33119.403118.203109.22MIN: 3075.08MIN: 3081.35MIN: 3068.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU3217001400210028003500SE +/- 9.34, N = 3SE +/- 2.08, N = 3SE +/- 11.59, N = 33113.523107.963101.94MIN: 3078.02MIN: 3073MIN: 3058.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3217001400210028003500SE +/- 6.91, N = 3SE +/- 2.85, N = 3SE +/- 4.61, N = 33102.453110.223113.93MIN: 3067.54MIN: 3064.84MIN: 3078.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.48, N = 1513.0913.0111.04MIN: 8.86MIN: 8.85MIN: 8.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.01342, N = 3SE +/- 0.02051, N = 3SE +/- 0.01459, N = 36.771096.788506.79725MIN: 6.54MIN: 6.55MIN: 6.541. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU321246810SE +/- 0.03383, N = 3SE +/- 0.00381, N = 3SE +/- 0.00993, N = 36.786006.750966.78051MIN: 6.48MIN: 6.52MIN: 6.541. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3211.10642.21283.31924.42565.532SE +/- 0.00783, N = 3SE +/- 0.02509, N = 3SE +/- 0.02875, N = 34.881154.908484.91753MIN: 4.74MIN: 4.73MIN: 4.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3211.13252.2653.39754.535.6625SE +/- 0.00923, N = 3SE +/- 0.01604, N = 3SE +/- 0.01117, N = 35.016535.033405.02827MIN: 4.88MIN: 4.9MIN: 4.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.02421, N = 3SE +/- 0.02430, N = 3SE +/- 0.02165, N = 36.121416.116216.09889MIN: 5.92MIN: 5.92MIN: 5.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p3211428425670SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 361.9261.9162.021. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 310.2410.4710.40MIN: 10.04MIN: 10.32MIN: 10.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.61611.23221.84832.46443.0805SE +/- 0.02173, N = 3SE +/- 0.01683, N = 3SE +/- 0.02085, N = 32.688352.717092.73839MIN: 2.34MIN: 2.39MIN: 2.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p32120406080100SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.12, N = 387.6788.0387.761. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory3214K8K12K16K20KSE +/- 257.28, N = 3SE +/- 36.56, N = 3SE +/- 34.72, N = 316159.0816345.4616393.951. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321510152025SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 322.6222.5322.66MIN: 21.87MIN: 21.82MIN: 21.831. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 321.7521.8821.73MIN: 21.34MIN: 21.34MIN: 21.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p32120406080100SE +/- 0.22, N = 3SE +/- 0.39, N = 3SE +/- 1.12, N = 3109.10109.62108.001. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p32120406080100SE +/- 0.05, N = 3SE +/- 0.16, N = 3SE +/- 0.17, N = 3111.25110.57110.671. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p321306090120150SE +/- 0.09, N = 3SE +/- 0.44, N = 3SE +/- 0.16, N = 3133.20133.04133.191. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 311.3011.3511.33MIN: 11MIN: 11.05MIN: 11.071. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05208, N = 3SE +/- 0.02925, N = 3SE +/- 0.04201, N = 39.156209.172909.15795MIN: 8.83MIN: 8.82MIN: 8.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl


Phoronix Test Suite v10.8.5