3300X oneDNN SVT Stuff

AMD Ryzen 3 3300X 4-Core testing with a MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NR BIOS) and AMD FirePro V3800 512MB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2103158-HA-3300XONED31
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 4 Tests
Creator Workloads 3 Tests
Encoding 2 Tests
Multi-Core 4 Tests
Server CPU Tests 4 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
March 15 2021
  51 Minutes
2
March 15 2021
  47 Minutes
3
March 15 2021
  47 Minutes
Invert Hiding All Results Option
  49 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


3300X oneDNN SVT StuffProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution123AMD Ryzen 3 3300X 4-Core @ 3.80GHz (4 Cores / 8 Threads)MSI B350M GAMING PRO (MS-7A39) v1.0 (2.NR BIOS)AMD Starship/Matisse8GB256GB INTEL SSDPEKKW256G7AMD FirePro V3800 512MBAMD Redwood HDMI AudioDELL S2409WRealtek RTL8111/8168/8411Ubuntu 20.045.9.0-rc5-14sep-patch (x86_64) 20200914GNOME Shell 3.36.4X Server 1.20.93.3 Mesa 20.0.8 (LLVM 10.0.0)GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%100%100%101%oneDNNSysbenchSVT-VP9SVT-HEVC

3300X oneDNN SVT Stuffsvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsysbench: RAM / Memorysysbench: CPU1234.0362.02133.19108.00110.6787.766.7805110.40174.917532.7383921.732011.0432011.331522.65646.797259.157955993.823109.226032.293101.945.028276030.313113.936.0988916393.958918.824.0461.91133.04109.62110.5788.036.7509610.47064.908482.7170921.878313.009511.347122.53456.788509.172905983.993118.206025.133107.965.033406010.233110.226.1162116345.468918.554.0461.92133.20109.10111.2587.676.7860010.24414.881152.6883521.745713.091911.295622.61506.771099.156205981.793119.406008.783113.525.016536015.973102.456.1214116159.088921.34OpenBenchmarking.org

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p1230.9091.8182.7273.6364.545SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.034.044.041. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p123246810Min: 4.02 / Avg: 4.03 / Max: 4.05Min: 4.02 / Avg: 4.04 / Max: 4.06Min: 4.02 / Avg: 4.04 / Max: 4.071. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p1231428425670SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 362.0261.9161.921. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p1231224364860Min: 62 / Avg: 62.02 / Max: 62.07Min: 61.84 / Avg: 61.91 / Max: 61.96Min: 61.74 / Avg: 61.92 / Max: 62.061. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p123306090120150SE +/- 0.16, N = 3SE +/- 0.44, N = 3SE +/- 0.09, N = 3133.19133.04133.201. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p123306090120150Min: 132.95 / Avg: 133.19 / Max: 133.48Min: 132.36 / Avg: 133.04 / Max: 133.87Min: 133.04 / Avg: 133.2 / Max: 133.331. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p12320406080100SE +/- 1.12, N = 3SE +/- 0.39, N = 3SE +/- 0.22, N = 3108.00109.62109.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p12320406080100Min: 105.86 / Avg: 108 / Max: 109.66Min: 109.22 / Avg: 109.62 / Max: 110.4Min: 108.87 / Avg: 109.1 / Max: 109.531. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p12320406080100SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 3110.67110.57111.251. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p12320406080100Min: 110.34 / Avg: 110.67 / Max: 110.91Min: 110.27 / Avg: 110.57 / Max: 110.83Min: 111.18 / Avg: 111.25 / Max: 111.361. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p12320406080100SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 387.7688.0387.671. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p12320406080100Min: 87.64 / Avg: 87.76 / Max: 87.99Min: 87.84 / Avg: 88.03 / Max: 88.41Min: 87.51 / Avg: 87.67 / Max: 87.831. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810SE +/- 0.00993, N = 3SE +/- 0.00381, N = 3SE +/- 0.03383, N = 36.780516.750966.78600MIN: 6.54MIN: 6.52MIN: 6.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215Min: 6.76 / Avg: 6.78 / Max: 6.8Min: 6.74 / Avg: 6.75 / Max: 6.76Min: 6.74 / Avg: 6.79 / Max: 6.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 310.4010.4710.24MIN: 10.22MIN: 10.32MIN: 10.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215Min: 10.35 / Avg: 10.4 / Max: 10.48Min: 10.44 / Avg: 10.47 / Max: 10.53Min: 10.2 / Avg: 10.24 / Max: 10.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1231.10642.21283.31924.42565.532SE +/- 0.02875, N = 3SE +/- 0.02509, N = 3SE +/- 0.00783, N = 34.917534.908484.88115MIN: 4.74MIN: 4.73MIN: 4.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU123246810Min: 4.89 / Avg: 4.92 / Max: 4.98Min: 4.88 / Avg: 4.91 / Max: 4.96Min: 4.87 / Avg: 4.88 / Max: 4.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.61611.23221.84832.46443.0805SE +/- 0.02085, N = 3SE +/- 0.01683, N = 3SE +/- 0.02173, N = 32.738392.717092.68835MIN: 2.39MIN: 2.39MIN: 2.341. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU123246810Min: 2.71 / Avg: 2.74 / Max: 2.78Min: 2.7 / Avg: 2.72 / Max: 2.75Min: 2.67 / Avg: 2.69 / Max: 2.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 321.7321.8821.75MIN: 21.32MIN: 21.34MIN: 21.341. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025Min: 21.7 / Avg: 21.73 / Max: 21.79Min: 21.81 / Avg: 21.88 / Max: 21.95Min: 21.7 / Avg: 21.75 / Max: 21.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.48, N = 15SE +/- 0.03, N = 3SE +/- 0.05, N = 311.0413.0113.09MIN: 8.76MIN: 8.85MIN: 8.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12348121620Min: 9.08 / Avg: 11.04 / Max: 13.02Min: 12.95 / Avg: 13.01 / Max: 13.05Min: 13.01 / Avg: 13.09 / Max: 13.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 311.3311.3511.30MIN: 11.07MIN: 11.05MIN: 111. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1233691215Min: 11.26 / Avg: 11.33 / Max: 11.47Min: 11.27 / Avg: 11.35 / Max: 11.49Min: 11.2 / Avg: 11.3 / Max: 11.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 322.6622.5322.62MIN: 21.83MIN: 21.82MIN: 21.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025Min: 22.59 / Avg: 22.66 / Max: 22.74Min: 22.34 / Avg: 22.53 / Max: 22.64Min: 22.58 / Avg: 22.62 / Max: 22.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01459, N = 3SE +/- 0.02051, N = 3SE +/- 0.01342, N = 36.797256.788506.77109MIN: 6.54MIN: 6.55MIN: 6.541. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215Min: 6.77 / Avg: 6.8 / Max: 6.82Min: 6.76 / Avg: 6.79 / Max: 6.83Min: 6.74 / Avg: 6.77 / Max: 6.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.04201, N = 3SE +/- 0.02925, N = 3SE +/- 0.05208, N = 39.157959.172909.15620MIN: 8.8MIN: 8.82MIN: 8.831. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215Min: 9.08 / Avg: 9.16 / Max: 9.23Min: 9.13 / Avg: 9.17 / Max: 9.23Min: 9.1 / Avg: 9.16 / Max: 9.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12313002600390052006500SE +/- 24.14, N = 3SE +/- 10.53, N = 3SE +/- 15.20, N = 35993.825983.995981.79MIN: 5896.31MIN: 5903.92MIN: 5900.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12310002000300040005000Min: 5950.38 / Avg: 5993.82 / Max: 6033.8Min: 5966.51 / Avg: 5983.99 / Max: 6002.9Min: 5955.11 / Avg: 5981.79 / Max: 6007.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1237001400210028003500SE +/- 5.50, N = 3SE +/- 6.83, N = 3SE +/- 4.28, N = 33109.223118.203119.40MIN: 3068.21MIN: 3081.35MIN: 3075.081. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1235001000150020002500Min: 3099.36 / Avg: 3109.22 / Max: 3118.36Min: 3104.55 / Avg: 3118.2 / Max: 3125.45Min: 3111.02 / Avg: 3119.4 / Max: 3125.111. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12313002600390052006500SE +/- 9.85, N = 3SE +/- 1.80, N = 3SE +/- 9.07, N = 36032.296025.136008.78MIN: 5963.68MIN: 5961.24MIN: 5954.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12310002000300040005000Min: 6020.56 / Avg: 6032.29 / Max: 6051.86Min: 6022.49 / Avg: 6025.13 / Max: 6028.57Min: 5993.1 / Avg: 6008.78 / Max: 6024.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1237001400210028003500SE +/- 11.59, N = 3SE +/- 2.08, N = 3SE +/- 9.34, N = 33101.943107.963113.52MIN: 3058.41MIN: 3073MIN: 3078.021. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1235001000150020002500Min: 3078.76 / Avg: 3101.94 / Max: 3113.78Min: 3104.8 / Avg: 3107.96 / Max: 3111.88Min: 3096.2 / Avg: 3113.52 / Max: 3128.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1231.13252.2653.39754.535.6625SE +/- 0.01117, N = 3SE +/- 0.01604, N = 3SE +/- 0.00923, N = 35.028275.033405.01653MIN: 4.89MIN: 4.9MIN: 4.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810Min: 5.01 / Avg: 5.03 / Max: 5.04Min: 5 / Avg: 5.03 / Max: 5.05Min: 5 / Avg: 5.02 / Max: 5.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12313002600390052006500SE +/- 2.70, N = 3SE +/- 7.25, N = 3SE +/- 17.06, N = 36030.316010.236015.97MIN: 5964.79MIN: 5940.42MIN: 5932.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000Min: 6025.83 / Avg: 6030.31 / Max: 6035.16Min: 6000.78 / Avg: 6010.23 / Max: 6024.49Min: 5987.84 / Avg: 6015.97 / Max: 6046.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1237001400210028003500SE +/- 4.61, N = 3SE +/- 2.85, N = 3SE +/- 6.91, N = 33113.933110.223102.45MIN: 3078.41MIN: 3064.84MIN: 3067.541. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1235001000150020002500Min: 3104.72 / Avg: 3113.93 / Max: 3118.64Min: 3106.4 / Avg: 3110.22 / Max: 3115.79Min: 3090.9 / Avg: 3102.45 / Max: 3114.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.02165, N = 3SE +/- 0.02430, N = 3SE +/- 0.02421, N = 36.098896.116216.12141MIN: 5.9MIN: 5.92MIN: 5.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810Min: 6.07 / Avg: 6.1 / Max: 6.14Min: 6.07 / Avg: 6.12 / Max: 6.15Min: 6.07 / Avg: 6.12 / Max: 6.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory1234K8K12K16K20KSE +/- 34.72, N = 3SE +/- 36.56, N = 3SE +/- 257.28, N = 316393.9516345.4616159.081. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory1233K6K9K12K15KMin: 16345.76 / Avg: 16393.95 / Max: 16461.34Min: 16273.07 / Avg: 16345.46 / Max: 16390.56Min: 15644.57 / Avg: 16159.08 / Max: 16422.391. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU1232K4K6K8K10KSE +/- 2.49, N = 3SE +/- 2.00, N = 3SE +/- 0.88, N = 38918.828918.558921.341. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU12315003000450060007500Min: 8916.1 / Avg: 8918.82 / Max: 8923.79Min: 8915.6 / Avg: 8918.55 / Max: 8922.35Min: 8920.07 / Avg: 8921.34 / Max: 8923.041. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm