phoronix_test_gpu.txt

Docker testing on Ubuntu 20.04.3 LTS via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2202135-NE-PHORONIXT96&grr.

phoronix_test_gpu.txtProcessorMotherboardMemoryDiskGraphicsAudioOSKernelDisplay DriverCompilerFile-SystemScreen ResolutionSystem LayerNVIDIA GeForce RTX 3090Intel Core i9-10900X @ 4.50GHz (10 Cores / 20 Threads)ASUS PRIME X299-A II (0702 BIOS)64GB1000GB Western Digital WDS100T2B0B + 4001GB Western Digital WD4003FFBX-6NVIDIA GeForce RTX 3090 24GBRealtek ALC1220Ubuntu 20.04.3 LTS5.11.0-27-generic (x86_64)NVIDIAGCC 9.3.0overlayfs1920x1080DockerOpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003102 - BAR1 / Visible vRAM Size: 256 MiB- Python 3.8.10- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

phoronix_test_gpu.txtncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetblender: Barbershop - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYblender: Barbershop - CUDAblender: BMW27 - CUDAblender: Pabellon Barcelona - CUDAblender: Fishy Cat - CUDAblender: Classroom - CUDAneatbench: GPUmixbench: NVIDIA CUDA - IntegerNVIDIA GeForce RTX 3090120.10391.18544.47769.86250.27309.631634.15302.8832.43137.3784.1968.2780.9281.42267.631756.70425.78535.88214.67148.7413.328.428.328.227.414.99.6719.313.86.0416.69.9661.519.0638.1919.1314.343090OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mNVIDIA GeForce RTX 3090306090120150SE +/- 1.82, N = 9120.10MIN: 110.16 / MAX: 234.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdNVIDIA GeForce RTX 309080160240320400SE +/- 4.69, N = 9391.18MIN: 357.37 / MAX: 579.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyNVIDIA GeForce RTX 3090120240360480600SE +/- 8.76, N = 9544.47MIN: 503.92 / MAX: 790.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50NVIDIA GeForce RTX 3090170340510680850SE +/- 8.07, N = 9769.86MIN: 735.36 / MAX: 1189.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetNVIDIA GeForce RTX 309050100150200250SE +/- 3.85, N = 9250.27MIN: 230.93 / MAX: 398.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18NVIDIA GeForce RTX 309070140210280350SE +/- 5.68, N = 9309.63MIN: 287.89 / MAX: 489.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16NVIDIA GeForce RTX 3090400800120016002000SE +/- 98.85, N = 91634.15MIN: 1446.18 / MAX: 220919.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetNVIDIA GeForce RTX 309070140210280350SE +/- 4.26, N = 9302.88MIN: 276.38 / MAX: 451.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceNVIDIA GeForce RTX 3090816243240SE +/- 0.79, N = 932.43MIN: 21.87 / MAX: 83.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0NVIDIA GeForce RTX 3090306090120150SE +/- 2.04, N = 9137.37MIN: 122.44 / MAX: 260.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetNVIDIA GeForce RTX 309020406080100SE +/- 1.42, N = 984.19MIN: 75.32 / MAX: 182.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2NVIDIA GeForce RTX 30901530456075SE +/- 1.05, N = 968.27MIN: 61.33 / MAX: 153.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3NVIDIA GeForce RTX 309020406080100SE +/- 1.16, N = 980.92MIN: 72.94 / MAX: 185.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2NVIDIA GeForce RTX 309020406080100SE +/- 1.18, N = 981.42MIN: 73.42 / MAX: 193.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetNVIDIA GeForce RTX 309060120180240300SE +/- 4.62, N = 9267.63MIN: 238.39 / MAX: 459.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: NVIDIA OptiXNVIDIA GeForce RTX 3090400800120016002000SE +/- 2.18, N = 31756.70

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: NVIDIA OptiXNVIDIA GeForce RTX 309090180270360450SE +/- 4.59, N = 5425.78

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXNVIDIA GeForce RTX 3090120240360480600SE +/- 0.49, N = 3535.88

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: NVIDIA OptiXNVIDIA GeForce RTX 309050100150200250SE +/- 0.92, N = 3214.67

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: NVIDIA OptiXNVIDIA GeForce RTX 3090306090120150SE +/- 0.29, N = 3148.74

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NNVIDIA GeForce RTX 30903691215SE +/- 1.19, N = 1113.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTNVIDIA GeForce RTX 3090714212835SE +/- 0.59, N = 1228.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNNVIDIA GeForce RTX 3090714212835SE +/- 0.53, N = 1228.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTNVIDIA GeForce RTX 3090714212835SE +/- 0.46, N = 1228.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNNVIDIA GeForce RTX 3090612182430SE +/- 0.64, N = 1227.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TNVIDIA GeForce RTX 309048121620SE +/- 2.03, N = 1014.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTNVIDIA GeForce RTX 30903691215SE +/- 0.53, N = 129.671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYNVIDIA GeForce RTX 3090510152025SE +/- 0.94, N = 1219.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYNVIDIA GeForce RTX 309048121620SE +/- 0.84, N = 1213.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTNVIDIA GeForce RTX 3090246810SE +/- 0.58, N = 126.041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYNVIDIA GeForce RTX 309048121620SE +/- 2.41, N = 1216.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYNVIDIA GeForce RTX 30903691215SE +/- 1.10, N = 129.961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: CUDANVIDIA GeForce RTX 30901428425670SE +/- 0.75, N = 461.51

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: CUDANVIDIA GeForce RTX 30903691215SE +/- 0.14, N = 159.06

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: CUDANVIDIA GeForce RTX 3090918273645SE +/- 0.19, N = 338.19

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: CUDANVIDIA GeForce RTX 3090510152025SE +/- 0.06, N = 319.13

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: CUDANVIDIA GeForce RTX 309048121620SE +/- 0.05, N = 314.34

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUNVIDIA GeForce RTX 309070014002100280035003090


Phoronix Test Suite v10.8.4