res.txt

2 x Intel Xeon Gold 6244 testing with a Dell 060K5C (2.4.1 BIOS) and NVIDIA Quadro GV100 32GB on Ubuntu 20.04.6 LTS via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402069-NE-RESTXT08482.

res.txtProcessorMotherboardMemoryDiskGraphicsOSKernelDisplay DriverVulkanCompilerFile-SystemScreen Resolution2024-02-05 13:532024-02-05 13:582024-02-05 18:032024-02-05 18:362024-02-06 08:562 x Intel Xeon Gold 6244 @ 4.40GHz (16 Cores / 32 Threads)Dell 060K5C (2.4.1 BIOS)128GBPM981a NVMe SAMSUNG 2048GB + 4 x 8002GB TOSHIBA MG06ACA8NVIDIA Quadro GV100 32GBUbuntu 20.04.6 LTS3.10.0-1160.95.1.el7.x86_64 (x86_64)NVIDIA1.1.182GCC 9.4.0 + CUDA 12.0xfs800x600OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-9QDOt0/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- 2024-02-05 13:53: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003604- 2024-02-05 13:58: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003604- 2024-02-05 18:03: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x5003604- 2024-02-05 18:36: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x5003604- 2024-02-06 08:56: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x5003604Python Details- Python 3.8.10

res.txtncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetplaidml: No - Training - ResNet 50 - CPUplaidml: No - Inference - ResNet 50 - CPUplaidml: Yes - Inference - NASNer Large - CPU2024-02-05 13:532024-02-05 13:582024-02-05 18:032024-02-05 18:362024-02-06 08:5616.927.186.727.386.418.472.9116.2143.209.986.8018.4630.8915.3421.4267.778.2716.737.066.677.526.468.693.0116.6843.3510.016.6518.1531.5415.4621.4768.488.1629.71373.65459.15181.58131.208.532.83120.62594.96104.1474.461279.57254.601029.9525.331647.771112.16181.7684.6381.537.296.358.462.9416.22501.679.856.83565.39421.96244.36112.441263.09654.7317.137.036.577.226.388.373.0219.8178.549.956.7217.9842.9716.0120.7287.128.0918.166.986.537.216.288.212.9916.0885.169.696.6418.0338.8014.9320.7184.987.320.375.03OpenBenchmarking.org

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenet2024-02-05 13:582024-02-05 18:362024-02-06 08:56714212835SE +/- 0.30, N = 15SE +/- 12.39, N = 9SE +/- 0.29, N = 1116.9229.7117.13MIN: 13.75 / MAX: 53.31MIN: 15.53 / MAX: 1342.94MIN: 14.45 / MAX: 20.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v22024-02-05 13:582024-02-05 18:362024-02-06 08:5680160240320400SE +/- 0.12, N = 15SE +/- 175.05, N = 9SE +/- 0.05, N = 117.18373.657.03MIN: 5.54 / MAX: 11671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v32024-02-05 13:582024-02-05 18:362024-02-06 08:56100200300400500SE +/- 0.08, N = 15SE +/- 224.10, N = 9SE +/- 0.04, N = 116.72459.156.57MIN: 5.6 / MAX: 1369.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v22024-02-05 13:582024-02-05 18:362024-02-06 08:564080120160200SE +/- 0.04, N = 15SE +/- 174.29, N = 9SE +/- 0.03, N = 117.38181.587.22MIN: 6.15 / MAX: 1598.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnet2024-02-05 13:582024-02-05 18:362024-02-06 08:56306090120150SE +/- 0.09, N = 15SE +/- 124.99, N = 9SE +/- 0.10, N = 116.41131.206.38MIN: 4.8 / MAX: 1145.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b02024-02-05 13:582024-02-05 18:362024-02-06 08:56246810SE +/- 0.10, N = 15SE +/- 0.14, N = 9SE +/- 0.07, N = 118.478.538.37MIN: 7.07 / MAX: 42.9MIN: 7.16 / MAX: 13.3MIN: 8.01 / MAX: 11.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazeface2024-02-05 13:582024-02-05 18:362024-02-06 08:560.67951.3592.03852.7183.3975SE +/- 0.08, N = 15SE +/- 0.12, N = 9SE +/- 0.02, N = 112.912.833.02MIN: 2.16 / MAX: 5.98MIN: 2.13 / MAX: 5.08MIN: 2.84 / MAX: 22.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenet2024-02-05 13:582024-02-05 18:362024-02-06 08:56306090120150SE +/- 0.28, N = 15SE +/- 104.57, N = 9SE +/- 3.51, N = 1116.21120.6219.81MIN: 12.89 / MAX: 38.38MIN: 13.63 / MAX: 1852.64MIN: 15.64 / MAX: 1809.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg162024-02-05 13:582024-02-05 18:362024-02-06 08:56130260390520650SE +/- 0.31, N = 15SE +/- 34.48, N = 9SE +/- 4.72, N = 1143.20594.9678.54MIN: 39.51 / MAX: 679.59MIN: 39.02 / MAX: 663.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet182024-02-05 13:582024-02-05 18:362024-02-06 08:5620406080100SE +/- 0.16, N = 15SE +/- 94.16, N = 9SE +/- 0.14, N = 119.98104.149.95MIN: 8.07 / MAX: 48.75MIN: 8.47 / MAX: 867.98MIN: 9.3 / MAX: 13.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnet2024-02-05 13:582024-02-05 18:362024-02-06 08:5620406080100SE +/- 0.08, N = 15SE +/- 46.29, N = 9SE +/- 0.05, N = 116.8074.466.72MIN: 6.37 / MAX: 397.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet502024-02-05 13:582024-02-05 18:362024-02-06 08:5630060090012001500SE +/- 0.18, N = 15SE +/- 237.42, N = 9SE +/- 0.20, N = 1118.461279.5717.98MIN: 17.05 / MAX: 1680.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tiny2024-02-05 13:582024-02-05 18:362024-02-06 08:5660120180240300SE +/- 0.37, N = 15SE +/- 110.09, N = 9SE +/- 5.42, N = 1130.89254.6042.97MIN: 25.55 / MAX: 393.68MIN: 26.15 / MAX: 793.43MIN: 25.46 / MAX: 767.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssd2024-02-05 13:582024-02-05 18:362024-02-06 08:562004006008001000SE +/- 0.25, N = 15SE +/- 320.29, N = 9SE +/- 0.62, N = 1115.341029.9516.01MIN: 13 / MAX: 1860.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400m2024-02-05 13:582024-02-05 18:362024-02-06 08:56612182430SE +/- 0.17, N = 15SE +/- 4.36, N = 9SE +/- 0.08, N = 1121.4225.3320.72MIN: 18.61 / MAX: 50.31MIN: 18.49 / MAX: 8765.4MIN: 20.11 / MAX: 89.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformer2024-02-05 13:582024-02-05 18:362024-02-06 08:56400800120016002000SE +/- 0.47, N = 15SE +/- 36.79, N = 9SE +/- 7.53, N = 1167.771647.7787.12MIN: 60.22 / MAX: 1773.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDet2024-02-05 13:582024-02-05 18:362024-02-06 08:562004006008001000SE +/- 0.33, N = 15SE +/- 194.42, N = 9SE +/- 0.36, N = 118.271112.168.09MIN: 5.89 / MAX: 1712.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenet2024-02-05 13:582024-02-05 18:362024-02-06 08:564080120160200SE +/- 0.31, N = 15SE +/- 164.65, N = 8SE +/- 1.91, N = 916.73181.7618.16MIN: 14.72 / MAX: 1345.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v22024-02-05 13:582024-02-05 18:362024-02-06 08:5620406080100SE +/- 0.08, N = 15SE +/- 77.74, N = 8SE +/- 0.02, N = 97.0684.636.98MIN: 5.23 / MAX: 1181.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v32024-02-05 13:582024-02-05 18:362024-02-06 08:5620406080100SE +/- 0.05, N = 15SE +/- 74.95, N = 8SE +/- 0.03, N = 96.6781.536.53MIN: 5.52 / MAX: 1367.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v22024-02-05 13:582024-02-05 18:362024-02-06 08:56246810SE +/- 0.06, N = 15SE +/- 0.11, N = 8SE +/- 0.03, N = 97.527.297.21MIN: 6.49 / MAX: 70.78MIN: 6.09 / MAX: 9.72MIN: 6.9 / MAX: 9.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnet2024-02-05 13:582024-02-05 18:362024-02-06 08:56246810SE +/- 0.06, N = 15SE +/- 0.16, N = 8SE +/- 0.03, N = 96.466.356.28MIN: 5.63 / MAX: 121.66MIN: 5.01 / MAX: 10.28MIN: 6.07 / MAX: 8.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b02024-02-05 13:582024-02-05 18:362024-02-06 08:56246810SE +/- 0.12, N = 15SE +/- 0.18, N = 8SE +/- 0.03, N = 98.698.468.21MIN: 7.5 / MAX: 90.56MIN: 7.17 / MAX: 12.2MIN: 7.9 / MAX: 14.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazeface2024-02-05 13:582024-02-05 18:362024-02-06 08:560.67731.35462.03192.70923.3865SE +/- 0.04, N = 15SE +/- 0.10, N = 8SE +/- 0.01, N = 93.012.942.99MIN: 2.56 / MAX: 5.33MIN: 2.2 / MAX: 5.14MIN: 2.86 / MAX: 5.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenet2024-02-05 13:582024-02-05 18:362024-02-06 08:5648121620SE +/- 0.13, N = 15SE +/- 0.28, N = 8SE +/- 0.06, N = 916.6816.2216.08MIN: 14.68 / MAX: 41.63MIN: 13.66 / MAX: 19.49MIN: 15.44 / MAX: 19.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg162024-02-05 13:582024-02-05 18:362024-02-06 08:56110220330440550SE +/- 0.55, N = 15SE +/- 94.33, N = 8SE +/- 8.40, N = 943.35501.6785.16MIN: 38.99 / MAX: 680.92MIN: 38.78 / MAX: 657.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet182024-02-05 13:582024-02-05 18:362024-02-06 08:563691215SE +/- 0.11, N = 15SE +/- 0.18, N = 8SE +/- 0.04, N = 910.019.859.69MIN: 8.29 / MAX: 66.34MIN: 8.54 / MAX: 12.61MIN: 9.24 / MAX: 12.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnet2024-02-05 13:582024-02-05 18:362024-02-06 08:56246810SE +/- 0.05, N = 15SE +/- 0.05, N = 8SE +/- 0.02, N = 86.656.836.64MIN: 5.83 / MAX: 17.95MIN: 6.38 / MAX: 9.35MIN: 6.38 / MAX: 44.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet502024-02-05 13:582024-02-05 18:362024-02-06 08:56120240360480600SE +/- 0.16, N = 15SE +/- 236.61, N = 8SE +/- 0.37, N = 918.15565.3918.03MIN: 16.76 / MAX: 1675.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tiny2024-02-05 13:582024-02-05 18:362024-02-06 08:5690180270360450SE +/- 0.99, N = 15SE +/- 120.86, N = 8SE +/- 3.53, N = 931.54421.9638.80MIN: 26.32 / MAX: 796.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssd2024-02-05 13:582024-02-05 18:362024-02-06 08:5650100150200250SE +/- 0.25, N = 15SE +/- 228.09, N = 8SE +/- 0.41, N = 915.46244.3614.93MIN: 13.25 / MAX: 1854.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400m2024-02-05 13:582024-02-05 18:362024-02-06 08:56306090120150SE +/- 0.21, N = 15SE +/- 91.36, N = 8SE +/- 0.04, N = 921.47112.4420.71MIN: 18.78 / MAX: 94.51MIN: 18.72 / MAX: 8799.8MIN: 20.11 / MAX: 57.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformer2024-02-05 13:582024-02-05 18:362024-02-06 08:5630060090012001500SE +/- 0.79, N = 15SE +/- 261.92, N = 8SE +/- 7.30, N = 968.481263.0984.98MIN: 59.44 / MAX: 1769.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDet2024-02-05 13:582024-02-05 18:362024-02-06 08:56140280420560700SE +/- 0.29, N = 15SE +/- 284.19, N = 8SE +/- 0.31, N = 98.16654.737.32MIN: 5.88 / MAX: 1709.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

PlaidML

FP16: No - Mode: Training - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: ResNet 50 - Device: CPU2024-02-06 08:560.08330.16660.24990.33320.4165SE +/- 0.00, N = 30.37

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU2024-02-06 08:561.13182.26363.39544.52725.659SE +/- 0.02, N = 35.03


Phoronix Test Suite v10.8.4