stargazer-nvidia-test

2 x Intel Xeon E5-2640 v4 testing with a Supermicro X10DRG-O+-CPU v1.00 (2.0c BIOS) and MSI NVIDIA GeForce GTX 1080 Ti 11GB on Ubuntu 18.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103180-HA-STARGAZER99.

stargazer-nvidia-testProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverOpenCLVulkanCompilerFile-SystemScreen ResolutionMSI NVIDIA GeForce GTX 1080 Ti2 x Intel Xeon E5-2640 v4 @ 3.40GHz (20 Cores / 40 Threads)Supermicro X10DRG-O+-CPU v1.00 (2.0c BIOS)Intel Xeon E7 v4/Xeon24 x 32 GB DDR4-1600MT/s HMA84GR7AFR4N-UH1920GB INTEL SSDSC2KB01MSI NVIDIA GeForce GTX 1080 Ti 11GBNVIDIA GP102 HDMI Audio2 x Broadcom NetXtreme II BCM57810 10 + 2 x Intel I350Ubuntu 18.045.4.0-67-generic (x86_64)X ServerNVIDIAOpenCL 1.2 CUDA 11.2.1361.2.155GCC 7.5.0 + CUDA 11.1ext41024x768OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xb000038- Python 2.7.17 + Python 3.6.9- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

stargazer-nvidia-testmixbench: OpenCL - Integermixbench: NVIDIA CUDA - Integermixbench: OpenCL - Double Precisionmixbench: OpenCL - Single Precisionmixbench: NVIDIA CUDA - Half Precisionmixbench: NVIDIA CUDA - Double Precisionmixbench: NVIDIA CUDA - Single Precisionviennacl: OpenCL LU Factorizationcl-mem: Copycl-mem: Readcl-mem: Writefahbench: rodinia: OpenCL Particle Filterarrayfire: Conjugate Gradient OpenCLfinancebench: Black-Scholes OpenCLncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory BandwidthMSI NVIDIA GeForce GTX 1080 Ti3022.393337.10348.589440.16215.58427.4111702.3141.0898306.0367.3211.0190.60268.4663.43977.027166651.39355.69377.80322.05349.05489.13127.35457.54455.08363.46188.25826.70451.41537.264229.231279.947362.82423.70355.52OpenBenchmarking.org

Mixbench

Backend: OpenCL - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: IntegerMSI NVIDIA GeForce GTX 1080 Ti6001200180024003000SE +/- 71.53, N = 123022.391. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerMSI NVIDIA GeForce GTX 1080 Ti7001400210028003500SE +/- 5.36, N = 33337.101. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Double PrecisionMSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 6.68, N = 15348.581. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Single PrecisionMSI NVIDIA GeForce GTX 1080 Ti2K4K6K8K10KSE +/- 225.98, N = 159440.161. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Half Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionMSI NVIDIA GeForce GTX 1080 Ti50100150200250SE +/- 0.12, N = 3215.581. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionMSI NVIDIA GeForce GTX 1080 Ti90180270360450SE +/- 0.94, N = 3427.411. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionMSI NVIDIA GeForce GTX 1080 Ti3K6K9K12K15KSE +/- 94.68, N = 311702.311. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationMSI NVIDIA GeForce GTX 1080 Ti918273645SE +/- 0.76, N = 1541.091. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyMSI NVIDIA GeForce GTX 1080 Ti70140210280350SE +/- 0.10, N = 3306.01. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadMSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 0.37, N = 3367.31. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteMSI NVIDIA GeForce GTX 1080 Ti50100150200250SE +/- 2.38, N = 3211.01. (CC) gcc options: -O2 -flto -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2MSI NVIDIA GeForce GTX 1080 Ti4080120160200SE +/- 2.00, N = 3190.60

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterMSI NVIDIA GeForce GTX 1080 Ti246810SE +/- 0.080, N = 68.4661. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLMSI NVIDIA GeForce GTX 1080 Ti0.77381.54762.32143.09523.869SE +/- 0.022, N = 33.4391. (CXX) g++ options: -rdynamic

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLMSI NVIDIA GeForce GTX 1080 Ti20406080100SE +/- 8.68, N = 1277.031. (CXX) g++ options: -O3 -march=native -fopenmp

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetMSI NVIDIA GeForce GTX 1080 Ti140280420560700SE +/- 175.62, N = 9651.39MIN: 24.83 / MAX: 2529.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2MSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 193.36, N = 9355.69MIN: 9.22 / MAX: 2167.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3MSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 196.61, N = 9377.80MIN: 8.13 / MAX: 2429.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2MSI NVIDIA GeForce GTX 1080 Ti70140210280350SE +/- 152.49, N = 9322.05MIN: 9.06 / MAX: 2212.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetMSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 177.07, N = 9349.05MIN: 8.47 / MAX: 2348.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0MSI NVIDIA GeForce GTX 1080 Ti110220330440550SE +/- 210.61, N = 9489.13MIN: 11.4 / MAX: 2409.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefaceMSI NVIDIA GeForce GTX 1080 Ti306090120150SE +/- 84.25, N = 9127.35MIN: 4.33 / MAX: 1156.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetMSI NVIDIA GeForce GTX 1080 Ti100200300400500SE +/- 199.92, N = 9457.54MIN: 21.92 / MAX: 3198.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16MSI NVIDIA GeForce GTX 1080 Ti100200300400500SE +/- 27.68, N = 9455.08MIN: 49.07 / MAX: 966.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18MSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 62.04, N = 9363.46MIN: 17.57 / MAX: 1420.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetMSI NVIDIA GeForce GTX 1080 Ti4080120160200SE +/- 27.13, N = 9188.25MIN: 12.25 / MAX: 482.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50MSI NVIDIA GeForce GTX 1080 Ti2004006008001000SE +/- 180.53, N = 9826.70MIN: 30.68 / MAX: 3046.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyMSI NVIDIA GeForce GTX 1080 Ti100200300400500SE +/- 66.15, N = 9451.41MIN: 38.22 / MAX: 1367.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdMSI NVIDIA GeForce GTX 1080 Ti120240360480600SE +/- 156.67, N = 9537.26MIN: 28.19 / MAX: 2768.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mMSI NVIDIA GeForce GTX 1080 Ti9001800270036004500SE +/- 1659.34, N = 94229.23MIN: 48.3 / MAX: 20492.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTMSI NVIDIA GeForce GTX 1080 Ti30060090012001500SE +/- 203.28, N = 151279.941. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatMSI NVIDIA GeForce GTX 1080 Ti16003200480064008000SE +/- 1049.15, N = 127362.821. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleMSI NVIDIA GeForce GTX 1080 Ti90180270360450SE +/- 4.67, N = 3423.701. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthMSI NVIDIA GeForce GTX 1080 Ti80160240320400SE +/- 0.47, N = 3355.521. (CXX) g++ options: -O3 -rdynamic -lOpenCL


Phoronix Test Suite v10.8.4