9400F mar

Intel Core i5-9400F testing with a MSI B360M GAMING PLUS (MS-7B19) v1.0 (1.10 BIOS) and MSI NVIDIA NV106 1GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103186-HA-9400FMAR607&gru.

9400F marProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution1234Intel Core i5-9400F @ 4.10GHz (6 Cores)MSI B360M GAMING PLUS (MS-7B19) v1.0 (1.10 BIOS)Intel Cannon Lake PCH16GB256GB SAMSUNG MZVPW256HEGL-000H7MSI NVIDIA NV106 1GBRealtek ALC887-VDG237HLIntel I219-VUbuntu 20.045.9.0-050900rc7daily20200928-generic (x86_64) 20200927GNOME Shell 3.36.0X Server 1.20.7nouveau4.3 Mesa 20.0.2GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xca - Thermald 1.9.1Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected Python Details- 2, 3, 4: Python 3.8.2

9400F marsysbench: CPUaom-av1: Speed 0 Two-Passaom-av1: Speed 4 Two-Passaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 8 Realtimesvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDsysbench: RAM / Memoryonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3incompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 192 Cells Per Directionastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3build-mesa: Time To Compilebuild-nodejs: Time To Compile12348221.430.194.7816.3613.2379.674.3963.38130.50114.66115.1093.792.540.873.603.7412977.845.748859.338063.315672.3583524.09707.286559.4294118.16393.114476.317854687.022712.794693.822710.164.438294688.652716.894.958665.30226.9463.3473.50931.71550.8227946455.4354148.142527.2154208.431132.5209.75353.510106.51599.744890.1838047.960.194.7616.2813.2178.924.3963.31130.96114.39115.0393.462.550.873.603.7412774.426.0874512.51333.319902.5185224.93127.252099.5591319.04593.137836.248564739.812746.264750.442750.234.591474745.822750.924.963045.28526.9303.3723.50731.69952.4945691457.6870538.143227.2210208.472232.5679.77253.532106.46998.712890.0078211.920.194.7616.2813.2178.854.3963.26130.43114.20114.4093.812.540.883.63.7413109.866.1053512.67193.317942.5160325.02457.248629.5081019.15373.125496.287764742.252755.814748.242758.354.592404750.152756.814.958665.30727.0183.3963.52231.83351.3005333457.6131908.148027.2310208.541832.5509.76753.528106.53598.7698218.830.194.7616.2413.278.364.3863.20130.15113.07113.3493.232.540.873.603.7313237.066.0884113.58883.322072.5081825.15557.257609.3941919.52273.118566.210444775.312785.404822.052776.614.587974784.782778.134.961075.27226.9843.3893.50231.58352.8742867459.6867178.148427.2295208.473632.5859.79553.553106.51999.020891.236OpenBenchmarking.org

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU12342K4K6K8K10KSE +/- 0.76, N = 3SE +/- 111.87, N = 4SE +/- 2.08, N = 3SE +/- 2.96, N = 38221.438047.968211.928218.831. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-Pass12340.04280.08560.12840.17120.214SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.190.190.190.191. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-Pass12341.07552.1513.22654.3025.3775SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.784.764.764.761. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Realtime123448121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 316.3616.2816.2816.241. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-Pass12343691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.2313.2113.2113.201. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 Realtime123420406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 379.6778.9278.8578.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p12340.98781.97562.96343.95124.939SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.394.394.394.381. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12341428425670SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 363.3863.3163.2663.201. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p1234306090120150SE +/- 0.37, N = 3SE +/- 0.44, N = 3SE +/- 0.42, N = 3SE +/- 0.41, N = 3130.50130.96130.43130.151. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234306090120150SE +/- 0.39, N = 3SE +/- 0.46, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3114.66114.39114.20113.071. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234306090120150SE +/- 0.33, N = 3SE +/- 0.23, N = 3SE +/- 0.04, N = 3SE +/- 0.26, N = 3115.10115.03114.40113.341. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123420406080100SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.12, N = 393.7993.4693.8193.231. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya12340.57381.14761.72142.29522.869SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.542.552.542.541. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom12340.1980.3960.5940.7920.99SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.870.870.880.871. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets12340.811.622.433.244.05SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.603.603.603.601. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID12340.84151.6832.52453.3664.2075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.743.743.743.731. (CXX) g++ options: -O3 -pthread

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory12343K6K9K12K15KSE +/- 61.19, N = 3SE +/- 109.75, N = 3SE +/- 133.79, N = 3SE +/- 53.34, N = 312977.8412774.4213109.8613237.061. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1234246810SE +/- 0.04456, N = 3SE +/- 0.03402, N = 3SE +/- 0.04121, N = 3SE +/- 0.02504, N = 35.748856.087456.105356.08841MIN: 5.6MIN: 5.94MIN: 5.93MIN: 5.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12343691215SE +/- 0.04042, N = 3SE +/- 0.05232, N = 3SE +/- 0.04540, N = 3SE +/- 0.07677, N = 39.3380612.5133012.6719013.58880MIN: 9.17MIN: 12.3MIN: 12.47MIN: 13.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU12340.74751.4952.24252.993.7375SE +/- 0.01477, N = 3SE +/- 0.01778, N = 3SE +/- 0.01808, N = 3SE +/- 0.01663, N = 33.315673.319903.317943.32207MIN: 3.25MIN: 3.25MIN: 3.25MIN: 3.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU12340.56671.13341.70012.26682.8335SE +/- 0.01460, N = 3SE +/- 0.01713, N = 3SE +/- 0.01626, N = 3SE +/- 0.01463, N = 32.358352.518522.516032.50818MIN: 2.3MIN: 2.46MIN: 2.46MIN: 2.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1234612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 324.1024.9325.0225.16MIN: 23.6MIN: 24.61MIN: 24.73MIN: 24.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1234246810SE +/- 0.02301, N = 3SE +/- 0.01513, N = 3SE +/- 0.01801, N = 3SE +/- 0.01383, N = 37.286557.252097.248627.25760MIN: 7.16MIN: 7.16MIN: 7.16MIN: 7.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12343691215SE +/- 0.06124, N = 3SE +/- 0.01638, N = 3SE +/- 0.06496, N = 3SE +/- 0.01310, N = 39.429419.559139.508109.39419MIN: 9.24MIN: 9.34MIN: 9.23MIN: 9.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1234510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 318.1619.0519.1519.52MIN: 17.89MIN: 18.69MIN: 18.77MIN: 19.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12340.7061.4122.1182.8243.53SE +/- 0.00994, N = 3SE +/- 0.01469, N = 3SE +/- 0.00251, N = 3SE +/- 0.00550, N = 33.114473.137833.125493.11856MIN: 3.08MIN: 3.09MIN: 3.09MIN: 3.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1234246810SE +/- 0.02640, N = 3SE +/- 0.00509, N = 3SE +/- 0.02902, N = 3SE +/- 0.02410, N = 36.317856.248566.287766.21044MIN: 6.22MIN: 6.11MIN: 6.15MIN: 6.081. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU123410002000300040005000SE +/- 9.86, N = 3SE +/- 2.77, N = 3SE +/- 6.95, N = 3SE +/- 8.03, N = 34687.024739.814742.254775.31MIN: 4552.07MIN: 4612.81MIN: 4622.61MIN: 4644.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12346001200180024003000SE +/- 1.94, N = 3SE +/- 5.11, N = 3SE +/- 0.72, N = 3SE +/- 11.46, N = 32712.792746.262755.812785.40MIN: 2634.02MIN: 2668.84MIN: 2681.59MIN: 2704.421. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU123410002000300040005000SE +/- 3.50, N = 3SE +/- 4.18, N = 3SE +/- 2.95, N = 3SE +/- 19.23, N = 34693.824750.444748.244822.05MIN: 4566.24MIN: 4617.15MIN: 4623.59MIN: 4660.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12346001200180024003000SE +/- 5.07, N = 3SE +/- 2.90, N = 3SE +/- 2.26, N = 3SE +/- 2.56, N = 32710.162750.232758.352776.61MIN: 2633.06MIN: 2673.4MIN: 2683.59MIN: 2704.961. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU12341.03332.06663.09994.13325.1665SE +/- 0.01052, N = 3SE +/- 0.00822, N = 3SE +/- 0.00804, N = 3SE +/- 0.01165, N = 34.438294.591474.592404.58797MIN: 4.36MIN: 4.52MIN: 4.52MIN: 4.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123410002000300040005000SE +/- 5.69, N = 3SE +/- 8.86, N = 3SE +/- 5.69, N = 3SE +/- 5.35, N = 34688.654745.824750.154784.78MIN: 4561.77MIN: 4614.86MIN: 4624.59MIN: 4663.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12346001200180024003000SE +/- 1.04, N = 3SE +/- 2.48, N = 3SE +/- 2.36, N = 3SE +/- 3.78, N = 32716.892750.922756.812778.13MIN: 2639.02MIN: 2677.79MIN: 2683.92MIN: 2704.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU12341.11672.23343.35014.46685.5835SE +/- 0.01510, N = 3SE +/- 0.00833, N = 3SE +/- 0.01296, N = 3SE +/- 0.01510, N = 34.958664.963044.958664.96107MIN: 4.89MIN: 4.89MIN: 4.89MIN: 4.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.012341.19412.38823.58234.77645.9705SE +/- 0.068, N = 3SE +/- 0.041, N = 3SE +/- 0.031, N = 3SE +/- 0.031, N = 35.3025.2855.3075.272MIN: 5.09 / MAX: 15.69MIN: 5.07 / MAX: 16.99MIN: 5.1 / MAX: 17.06MIN: 5.07 / MAX: 15.441. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-501234612182430SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 326.9526.9327.0226.98MIN: 26.78 / MAX: 42.17MIN: 26.76 / MAX: 36.61MIN: 26.85 / MAX: 36.71MIN: 26.7 / MAX: 37.051. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_22412340.76411.52822.29233.05643.8205SE +/- 0.032, N = 3SE +/- 0.024, N = 3SE +/- 0.018, N = 3SE +/- 0.029, N = 33.3473.3723.3963.389MIN: 3.18 / MAX: 13.7MIN: 3.21 / MAX: 14.08MIN: 3.24 / MAX: 13.3MIN: 3.23 / MAX: 14.61. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.012340.79251.5852.37753.173.9625SE +/- 0.003, N = 3SE +/- 0.011, N = 3SE +/- 0.002, N = 3SE +/- 0.008, N = 33.5093.5073.5223.502MIN: 3.45 / MAX: 13.33MIN: 3.45 / MAX: 5.69MIN: 3.46 / MAX: 10.92MIN: 3.44 / MAX: 4.791. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v31234714212835SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 331.7231.7031.8331.58MIN: 31.44 / MAX: 40.88MIN: 31.46 / MAX: 41.89MIN: 31.51 / MAX: 42.73MIN: 31.39 / MAX: 41.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction12341224364860SE +/- 0.57, N = 3SE +/- 0.05, N = 3SE +/- 0.59, N = 3SE +/- 0.02, N = 350.8252.4951.3052.871. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 192 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 192 Cells Per Direction1234100200300400500SE +/- 0.04, N = 3SE +/- 1.04, N = 3SE +/- 0.78, N = 3SE +/- 1.01, N = 3455.44457.69457.61459.691. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium1234246810SE +/- 0.0093, N = 3SE +/- 0.0128, N = 3SE +/- 0.0136, N = 3SE +/- 0.0037, N = 38.14258.14328.14808.14841. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough1234612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 327.2227.2227.2327.231. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive123450100150200250SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3208.43208.47208.54208.471. (CXX) g++ options: -O3 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S1234816243240SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 332.5232.5732.5532.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 012343691215SE +/- 0.009, N = 3SE +/- 0.008, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 39.7539.7729.7679.7951. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 212341224364860SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 353.5153.5353.5353.551. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3123420406080100SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3106.52106.47106.54106.521. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile123420406080100SE +/- 1.18, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 399.7498.7198.7799.02

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile1242004006008001000SE +/- 0.38, N = 3SE +/- 0.12, N = 3SE +/- 0.18, N = 3890.18890.01891.24


Phoronix Test Suite v10.8.4