Xeon E3 1280 v5 m

Intel Xeon E3-1280 v5 testing with a MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS) and ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SP on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103199-IB-XEONE312846&sor.

Xeon E3 1280 v5 mProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1280 v5 @ 4.00GHz (4 Cores / 8 Threads)MSI Z170A SLI PLUS (MS-7998) v1.0 (2.A0 BIOS)Intel Xeon E3-1200 v5/E3-150032GB256GB TOSHIBA RD400ASUS AMD Radeon HD 7850 / R7 265 R9 270 1024SPRealtek ALC1150VA2431Intel I219-VUbuntu 20.045.9.0-050900rc2daily20200826-generic (x86_64) 20200825GNOME Shell 3.36.4X Server 1.20.94.5 Mesa 20.0.8 (LLVM 10.0.0)GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Python Details- Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Xeon E3 1280 v5 mincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDaom-av1: Speed 0 Two-Passaom-av1: Speed 4 Two-Passaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 8 Realtimesvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pbuild-mesa: Time To Compilebuild-nodejs: Time To Compileonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3sysbench: RAM / Memorysysbench: CPUstockfish: Total Time12359.7731183203.0885112.360.883.523.990.163.7912.319.8762.443.0847.85101.8587.8588.0169.82126.2631106.5027.9894812.27533.657023.2894220.902014.455214.401020.49764.778718.075197395.553948.987393.223952.655.428177395.133955.375.917068.918033.0108256.213536.77411.00672.898144.1697.55045.7144.0334.57755.42516690.547854.281034384159.8612671202.9322152.360.883.533.980.163.7912.339.8662.543.0847.78101.8587.6688.0470.00126.1861106.6368.0129812.10133.661323.2809120.933314.401414.510820.52334.763448.101927399.683955.177401.563952.085.421017396.463954.965.906368.938032.9899256.091836.83510.99772.911144.0797.47945.3544.0114.54554.83516726.667845.281033853159.8264809203.1359912.360.883.523.980.163.8012.359.8462.453.0847.79101.9887.7488.0669.87126.3861106.6827.9786212.05873.659563.2039920.903114.429614.464120.51504.785768.084237394.303951.907396.593955.005.388177408.913957.845.907098.929633.0202256.171636.83910.99272.894144.0797.45845.2804.0014.53455.04216845.497843.3410254227OpenBenchmarking.org

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction1321326395265SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 359.7759.8359.861. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction2134080120160200SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3202.93203.09203.141. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya3210.5311.0621.5932.1242.655SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.362.362.361. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.1980.3960.5940.7920.99SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.880.880.881. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets2310.79431.58862.38293.17723.9715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.533.523.521. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1320.89781.79562.69343.59124.489SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.993.983.981. (CXX) g++ options: -O3 -pthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-Pass3210.0360.0720.1080.1440.18SE +/- 0.00, N = 8SE +/- 0.00, N = 3SE +/- 0.00, N = 30.160.160.161. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-Pass3210.8551.712.5653.424.275SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.803.793.791. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Realtime3213691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.3512.3312.311. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-Pass1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.879.869.841. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 Realtime2311428425670SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 362.5462.4562.441. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p3210.6931.3862.0792.7723.465SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.083.083.081. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p1321122334455SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 347.8547.7947.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p32120406080100SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 3101.98101.85101.851. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p13220406080100SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 387.8587.7487.661. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p32120406080100SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 388.0688.0488.011. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p2311632486480SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 370.0069.8769.821. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile213306090120150SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3126.19126.26126.39

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile1232004006008001000SE +/- 0.06, N = 3SE +/- 0.31, N = 3SE +/- 0.28, N = 31106.501106.641106.68

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU312246810SE +/- 0.00688, N = 3SE +/- 0.01467, N = 3SE +/- 0.02581, N = 37.978627.989488.01298MIN: 7.79MIN: 7.81MIN: 7.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.0612.1012.28MIN: 11.91MIN: 11.78MIN: 12.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1320.82381.64762.47143.29524.119SE +/- 0.00650, N = 3SE +/- 0.00493, N = 3SE +/- 0.00209, N = 33.657023.659563.66132MIN: 3.61MIN: 3.62MIN: 3.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.74011.48022.22032.96043.7005SE +/- 0.00597, N = 3SE +/- 0.01431, N = 3SE +/- 0.00631, N = 33.203993.280913.28942MIN: 3.13MIN: 3.2MIN: 3.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU132510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 320.9020.9020.93MIN: 20.84MIN: 20.81MIN: 20.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU23148121620SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 314.4014.4314.46MIN: 10.62MIN: 10.61MIN: 10.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU13248121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.4014.4614.51MIN: 14.25MIN: 14.29MIN: 14.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU132510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.5020.5220.52MIN: 20.32MIN: 20.32MIN: 20.291. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2131.07682.15363.23044.30725.384SE +/- 0.00463, N = 3SE +/- 0.01143, N = 3SE +/- 0.01370, N = 34.763444.778714.78576MIN: 4.73MIN: 4.73MIN: 4.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU132246810SE +/- 0.01132, N = 3SE +/- 0.01391, N = 3SE +/- 0.01834, N = 38.075198.084238.10192MIN: 8.01MIN: 8.03MIN: 8.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU31216003200480064008000SE +/- 2.94, N = 3SE +/- 4.38, N = 3SE +/- 4.06, N = 37394.307395.557399.68MIN: 7382.12MIN: 7383.32MIN: 7381.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1328001600240032004000SE +/- 1.93, N = 3SE +/- 1.51, N = 3SE +/- 7.15, N = 33948.983951.903955.17MIN: 3941.23MIN: 3944.83MIN: 3940.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU13216003200480064008000SE +/- 1.70, N = 3SE +/- 3.85, N = 3SE +/- 2.46, N = 37393.227396.597401.56MIN: 7380.82MIN: 7381.63MIN: 7389.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU2138001600240032004000SE +/- 2.76, N = 3SE +/- 2.90, N = 3SE +/- 2.54, N = 33952.083952.653955.00MIN: 3941.26MIN: 3942.22MIN: 3946.661. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3211.22132.44263.66394.88526.1065SE +/- 0.00691, N = 3SE +/- 0.00660, N = 3SE +/- 0.00412, N = 35.388175.421015.42817MIN: 5.33MIN: 5.36MIN: 5.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12316003200480064008000SE +/- 2.52, N = 3SE +/- 5.16, N = 3SE +/- 15.57, N = 37395.137396.467408.91MIN: 7381.08MIN: 7381.2MIN: 7383.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU2138001600240032004000SE +/- 0.47, N = 3SE +/- 2.53, N = 3SE +/- 4.66, N = 33954.963955.373957.84MIN: 3947.75MIN: 3945.55MIN: 3944.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2311.33132.66263.99395.32526.6565SE +/- 0.00128, N = 3SE +/- 0.00186, N = 3SE +/- 0.00433, N = 35.906365.907095.91706MIN: 5.87MIN: 5.87MIN: 5.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium132246810SE +/- 0.0175, N = 3SE +/- 0.0096, N = 3SE +/- 0.0040, N = 38.91808.92968.93801. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough213816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 332.9933.0133.021. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive23160120180240300SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3256.09256.17256.211. (CXX) g++ options: -O3 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S123816243240SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 336.7736.8436.841. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 03213691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.9911.0011.011. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 23121632486480SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 372.8972.9072.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3231306090120150SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3144.08144.08144.171. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0321246810SE +/- 0.011, N = 3SE +/- 0.012, N = 3SE +/- 0.015, N = 37.4587.4797.550MIN: 7.38 / MAX: 29.38MIN: 7.4 / MAX: 11.93MIN: 7.38 / MAX: 30.781. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-503211020304050SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 345.2845.3545.71MIN: 44.97 / MAX: 67.28MIN: 45.03 / MAX: 69.72MIN: 44.4 / MAX: 74.551. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2243210.90741.81482.72223.62964.537SE +/- 0.018, N = 3SE +/- 0.012, N = 3SE +/- 0.014, N = 34.0014.0114.033MIN: 3.91 / MAX: 27.34MIN: 3.92 / MAX: 27.37MIN: 3.94 / MAX: 25.541. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.03211.02982.05963.08944.11925.149SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.013, N = 34.5344.5454.577MIN: 4.49 / MAX: 8.91MIN: 4.48 / MAX: 28.03MIN: 4.48 / MAX: 26.261. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v32311224364860SE +/- 0.49, N = 3SE +/- 0.56, N = 3SE +/- 0.21, N = 354.8455.0455.43MIN: 53.62 / MAX: 78.53MIN: 53.69 / MAX: 79.04MIN: 54.13 / MAX: 79.11. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory3214K8K12K16K20KSE +/- 83.93, N = 3SE +/- 70.80, N = 3SE +/- 100.94, N = 316845.4916726.6616690.541. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU1232K4K6K8K10KSE +/- 0.37, N = 3SE +/- 0.62, N = 3SE +/- 0.29, N = 37854.287845.287843.341. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time1232M4M6M8M10MSE +/- 62662.98, N = 3SE +/- 5126.49, N = 3SE +/- 48708.15, N = 31034384110338531102542271. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver


Phoronix Test Suite v10.8.4