nn gravity too

Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2106185-IB-NNGRAVITY72&grt.

nn gravity tooProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads)Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS)Intel Cannon Lake PCH64GBSamsung SSD 970 EVO Plus 250GBNVIDIA Quadro RTX 4000 8GBIntel Cannon Lake PCH cAVSVE228Intel I219-LM + Intel I210Ubuntu 20.105.8.0-41-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.9NVIDIA 460.32.034.6.0OpenCL 1.2 CUDA 11.2.1091.2.155GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3 Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

nn gravity toogravitymark: 800 x 600 - OpenGLgravitymark: 800 x 600 - Vulkangravitymark: 1024 x 768 - OpenGLgravitymark: 1024 x 768 - Vulkangravitymark: 1280 x 1024 - OpenGLgravitymark: 1280 x 1024 - Vulkangravitymark: 1920 x 1080 - OpenGLgravitymark: 1920 x 1080 - Vulkanmnn: mobilenetV3mnn: squeezenetv1.1mnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1123136.2157.9123.3140.3110.1129.298.6114.21.9043.98729.1655.8853.2202.74539.06117.794.934.163.573.916.371.6314.7064.4815.6313.5226.3127.2619.659.313120.889293.18962.971269.273135.3157.4120.7139.1108.2128.893.4107.62.0054.09030.1176.3143.5943.04540.70418.625.044.003.513.826.661.6215.1967.8516.6813.8327.7429.9320.949.733147.923293.47363.426268.105132.7150.3113.7137.4106.4126.491.8106.21.9684.13230.6306.1813.4833.14340.09418.475.053.993.533.866.651.6616.1069.1816.4313.7027.8329.4420.9110.043140.922293.97563.611268.400OpenBenchmarking.org

GravityMark

Resolution: 800 x 600 - Renderer: OpenGL

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 800 x 600 - Renderer: OpenGL123306090120150SE +/- 1.07, N = 3SE +/- 1.05, N = 3SE +/- 0.75, N = 3136.2135.3132.7

GravityMark

Resolution: 800 x 600 - Renderer: Vulkan

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 800 x 600 - Renderer: Vulkan123306090120150SE +/- 0.47, N = 3SE +/- 0.64, N = 3SE +/- 1.91, N = 3157.9157.4150.3

GravityMark

Resolution: 1024 x 768 - Renderer: OpenGL

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1024 x 768 - Renderer: OpenGL123306090120150SE +/- 1.77, N = 3SE +/- 2.00, N = 3SE +/- 0.40, N = 3123.3120.7113.7

GravityMark

Resolution: 1024 x 768 - Renderer: Vulkan

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1024 x 768 - Renderer: Vulkan123306090120150SE +/- 0.77, N = 3SE +/- 0.35, N = 3SE +/- 0.30, N = 3140.3139.1137.4

GravityMark

Resolution: 1280 x 1024 - Renderer: OpenGL

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1280 x 1024 - Renderer: OpenGL12320406080100SE +/- 0.49, N = 3SE +/- 0.27, N = 3SE +/- 0.24, N = 3110.1108.2106.4

GravityMark

Resolution: 1280 x 1024 - Renderer: Vulkan

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1280 x 1024 - Renderer: Vulkan123306090120150SE +/- 0.15, N = 3SE +/- 0.42, N = 3SE +/- 0.27, N = 3129.2128.8126.4

GravityMark

Resolution: 1920 x 1080 - Renderer: OpenGL

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1920 x 1080 - Renderer: OpenGL12320406080100SE +/- 0.15, N = 3SE +/- 0.20, N = 3SE +/- 0.12, N = 398.693.491.8

GravityMark

Resolution: 1920 x 1080 - Renderer: Vulkan

OpenBenchmarking.orgFrames Per Second, More Is BetterGravityMark 1.1bResolution: 1920 x 1080 - Renderer: Vulkan123306090120150SE +/- 0.27, N = 3SE +/- 0.32, N = 3SE +/- 0.35, N = 3114.2107.6106.2

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenetV31230.45110.90221.35331.80442.2555SE +/- 0.036, N = 12SE +/- 0.024, N = 5SE +/- 0.026, N = 31.9042.0051.968MIN: 1.64 / MAX: 60.45MIN: 1.65 / MAX: 17.33MIN: 1.68 / MAX: 3.081. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: squeezenetv1.11230.92971.85942.78913.71884.6485SE +/- 0.064, N = 12SE +/- 0.020, N = 5SE +/- 0.039, N = 33.9874.0904.132MIN: 3.52 / MAX: 64.16MIN: 3.37 / MAX: 16.23MIN: 3.52 / MAX: 16.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: resnet-v2-50123714212835SE +/- 0.28, N = 12SE +/- 0.69, N = 5SE +/- 0.70, N = 329.1730.1230.63MIN: 24.94 / MAX: 173.42MIN: 24.97 / MAX: 132.38MIN: 24.92 / MAX: 190.051. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: SqueezeNetV1.0123246810SE +/- 0.065, N = 12SE +/- 0.072, N = 5SE +/- 0.019, N = 35.8856.3146.181MIN: 5.4 / MAX: 62.33MIN: 5.39 / MAX: 18.8MIN: 5.37 / MAX: 35.961. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: MobileNetV2_2241230.80871.61742.42613.23484.0435SE +/- 0.015, N = 12SE +/- 0.073, N = 5SE +/- 0.029, N = 33.2203.5943.483MIN: 2.96 / MAX: 14.31MIN: 2.97 / MAX: 70.94MIN: 2.99 / MAX: 15.891. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenet-v1-1.01230.70721.41442.12162.82883.536SE +/- 0.005, N = 12SE +/- 0.022, N = 5SE +/- 0.097, N = 32.7453.0453.143MIN: 2.61 / MAX: 6.6MIN: 2.63 / MAX: 40.53MIN: 2.6 / MAX: 65.361. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: inception-v3123918273645SE +/- 0.15, N = 12SE +/- 0.25, N = 5SE +/- 0.14, N = 339.0640.7040.09MIN: 37.3 / MAX: 163.69MIN: 36.22 / MAX: 159.23MIN: 37.86 / MAX: 95.031. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mobilenet123510152025SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 317.7918.6218.47MIN: 17.41 / MAX: 18.31MIN: 17.39 / MAX: 33.09MIN: 17.35 / MAX: 42.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v2-v2 - Model: mobilenet-v21231.13632.27263.40894.54525.6815SE +/- 0.08, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 34.935.045.05MIN: 4.43 / MAX: 8.48MIN: 4.45 / MAX: 7.77MIN: 4.46 / MAX: 13.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU-v3-v3 - Model: mobilenet-v31230.9361.8722.8083.7444.68SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 34.164.003.99MIN: 3.68 / MAX: 54.3MIN: 3.54 / MAX: 5.25MIN: 3.54 / MAX: 11.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: shufflenet-v21230.80331.60662.40993.21324.0165SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 33.573.513.53MIN: 3.32 / MAX: 4.25MIN: 3.15 / MAX: 4.81MIN: 3.16 / MAX: 6.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: mnasnet1230.87981.75962.63943.51924.399SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 33.913.823.86MIN: 3.65 / MAX: 4.27MIN: 3.4 / MAX: 5.01MIN: 3.43 / MAX: 5.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: efficientnet-b0123246810SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 36.376.666.65MIN: 6.15 / MAX: 6.86MIN: 5.88 / MAX: 10.22MIN: 5.87 / MAX: 7.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: blazeface1230.37350.7471.12051.4941.8675SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 31.631.621.66MIN: 1.53 / MAX: 1.88MIN: 1.44 / MAX: 2.18MIN: 1.46 / MAX: 11.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: googlenet12348121620SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 1.09, N = 314.7015.1916.10MIN: 14.14 / MAX: 23MIN: 13.48 / MAX: 26.33MIN: 13.82 / MAX: 148.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: vgg161231530456075SE +/- 0.06, N = 3SE +/- 0.36, N = 3SE +/- 1.56, N = 364.4867.8569.18MIN: 64.25 / MAX: 65.85MIN: 66.53 / MAX: 91.9MIN: 66.51 / MAX: 245.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet1812348121620SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 315.6316.6816.43MIN: 15.27 / MAX: 16.32MIN: 16 / MAX: 25.72MIN: 15.36 / MAX: 17.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: alexnet12348121620SE +/- 0.39, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 313.5213.8313.70MIN: 12.8 / MAX: 109.15MIN: 12.92 / MAX: 27.92MIN: 12.92 / MAX: 23.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: resnet50123714212835SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 326.3127.7427.83MIN: 25.61 / MAX: 27.16MIN: 26.51 / MAX: 42.46MIN: 26.66 / MAX: 37.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: yolov4-tiny123714212835SE +/- 0.06, N = 3SE +/- 0.84, N = 3SE +/- 0.96, N = 327.2629.9329.44MIN: 26.99 / MAX: 29.89MIN: 27.64 / MAX: 158.01MIN: 27.58 / MAX: 149.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: squeezenet_ssd123510152025SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.23, N = 319.6520.9420.91MIN: 19.3 / MAX: 20.1MIN: 19.59 / MAX: 104.32MIN: 19.51 / MAX: 28.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210525Target: CPU - Model: regnety_400m1233691215SE +/- 0.06, N = 3SE +/- 0.23, N = 3SE +/- 0.35, N = 39.319.7310.04MIN: 9.08 / MAX: 9.84MIN: 8.67 / MAX: 12.44MIN: 8.68 / MAX: 74.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNet1237001400210028003500SE +/- 8.70, N = 3SE +/- 13.56, N = 3SE +/- 24.54, N = 33120.893147.923140.92MIN: 3039.95 / MAX: 3494.45MIN: 3048 / MAX: 3606.13MIN: 3041.73 / MAX: 3462.91. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v212360120180240300SE +/- 1.90, N = 3SE +/- 0.28, N = 3SE +/- 1.34, N = 3293.19293.47293.98MIN: 288.6 / MAX: 381.12MIN: 289.28 / MAX: 346.47MIN: 290.22 / MAX: 384.551. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v21231428425670SE +/- 0.16, N = 3SE +/- 0.05, N = 3SE +/- 0.48, N = 362.9763.4363.61MIN: 62.31 / MAX: 64.38MIN: 62.48 / MAX: 69.72MIN: 62.55 / MAX: 72.621. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.112360120180240300SE +/- 0.30, N = 3SE +/- 0.21, N = 3SE +/- 0.32, N = 3269.27268.11268.40MIN: 266.86 / MAX: 289.03MIN: 267.39 / MAX: 270.38MIN: 267.63 / MAX: 288.891. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl


Phoronix Test Suite v10.8.4