nvidia-tests

AMD FX-8350 Eight-Core testing with a ASUS SABERTOOTH 990FX R2.0 (2901 BIOS) and MSI NVIDIA GeForce GT 1030 2GB on Gentoo/Linux via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2111171-TJ-NVIDIATES55.

nvidia-tests ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDisplay ServerDisplay DriverOpenCLVulkanCompilerFile-SystemScreen ResolutionDesktopOpenGLGentooGentoo_OCWin10_OCAMD FX-8350 Eight-Core @ 4.00GHz (4 Cores / 8 Threads)ASUS SABERTOOTH 990FX R2.0 (2901 BIOS)AMD RD9x0/RX98016GB1000GB CT1000MX500SSD1 + 3001GB TOSHIBA DT01ACA3 + 2000GB Seagate ST2000DL003-9VT1 + 500GB CT500MX500SSD1MSI NVIDIA GeForce GT 1030 2GBRealtek ALC892G27QCRealtek RTL8111/8168/8411Gentoo/Linux5.15.2-gentoo-x86_64 (x86_64)X Server 1.20.13NVIDIAOpenCL 3.0 CUDA 11.5.1001.2.186GCC 11.2.0 + Clang 13.0.0 + LLVM 13.0.0 + CUDA 11.5ext41280x1024MSI NVIDIA GeForce GT 1030 2GB OCKDE Plasma 5.22.54.6.02560x14402 x 8192 MB 800MHz Kingston466GB CT500MX5 00SSD1 SATA Disk + 2795GB TOSHIBA DT01ACA300 SATA Disk + 932GB CT1000MX 500SSD1 SATA Disk + 1863GB ST2000DL 003-9VT166 SATA DiskNVIDIA GeForce GT 1030 2GBNVIDIA HD Audio + Realtek HD Audio + NVIDIA Virtual Audio Device (Wave Extensible) (WDM) + HD Webcam C510G7QCVirtualBox Host-Only + Symantec TAP Driver + RAS Async + Bluetooth Device (Personal Area ) + Bluetooth Device (Personal Area ) #2Microsoft Windows 10 Pro Build 1904310.0 (x86_64)496.49 (30.0.14.9649)OpenCL 3.0 CUDA 11.5.76 + OpenCL 1.2 AMD-APP (937.2)GCC 8.3.0 + Clang 6.0.0NTFSOpenBenchmarking.orgKernel Details- Gentoo, Gentoo_OC: Transparent Huge Pages: madviseCompiler Details- Gentoo, Gentoo_OC: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/11.2.0 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0 --disable-esp --disable-fixed-point --disable-libada --disable-libssp --disable-libunwind-exceptions --disable-libvtv --disable-systemtap --disable-valgrind-annotations --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/11.2.0/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/man --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/11.2.0/python --without-isl --without-zstd Processor Details- Gentoo: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852- Gentoo_OC: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x6000852- Win10_OC: CPU Microcode: 5208000600000000Graphics Details- Gentoo: BAR1 / Visible vRAM Size: 256 MiBSecurity Details- Gentoo: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Gentoo_OC: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Win10_OC: __user pointer sanitization: Disabled + Retpoline: Full + IBPB: AlwaysEnvironment Details- Win10_OC: windows_tracing_flags=3

nvidia-tests cl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferfahbench: luxcorerender: DLSC - GPUluxcorerender: Danish Mood - GPUluxcorerender: Orange Juice - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mindigobench: CPU - Bedroomindigobench: CPU - Supercarindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarlczero: BLASlczero: Eigenlczero: OpenCLlczero: CUDA + cuDNNneatbench: CPUneatbench: GPUoctanebench: Total Scorerealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yesvkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4vkpeak: int32-scalarvkpeak: int32-vec4vkresample: 2x - Doublevkresample: 2x - Singlewaifu2x-ncnn: 2x - 3 - Nowaifu2x-ncnn: 2x - 3 - YesGentooGentoo_OCWin10_OC37.340.739.46.43353.721139.8241.2739.521.691.4925.73800.30.080.330.111.680.520.150.760.182.1160.0618.1614.7111.4616.0623.553.8355.53400.8058.3635.84118.69123.2059.6249.2621.125.946.894.426.1811.032.3814.6671.5512.5119.8725.5037.7024.367.8348.854.351.36.37385.061243.1445.5752.311.691.4931.65200.340.090.390.132.1121.115.065.864.095.659.602.3513.4958.0811.2715.9622.8935.7822.966.860.4871.2261.0953.122153248762015.87103024.61260774.730564.8551429.711431.5446.0246.01487.20452.104.91710.0094.89928.68726.41650.320.060.380.112.080.480.110.680.161.330.4521.1441.1033.1616824325.86103024.58316273.507563.7141410.301411.5745.4045.4480.54445.740.145159.5145.29928.643OpenBenchmarking.org

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGentooGentoo_OC1122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 337.348.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGentooGentoo_OC1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 340.754.31. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGentooGentoo_OC1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 339.451.31. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGentooGentoo_OC246810SE +/- 0.03, N = 3SE +/- 0.01, N = 36.436.371. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGentooGentoo_OC80160240320400SE +/- 6.73, N = 15SE +/- 9.00, N = 15353.72385.061. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGentooGentoo_OC30060090012001500SE +/- 26.36, N = 15SE +/- 40.31, N = 151139.821243.141. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGentooGentoo_OC1020304050SE +/- 0.17, N = 3SE +/- 0.44, N = 341.2745.571. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGentooGentoo_OC1224364860SE +/- 0.10, N = 3SE +/- 0.14, N = 339.5252.311. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferGentooGentoo_OC0.38030.76061.14091.52121.9015SE +/- 0.00, N = 3SE +/- 0.00, N = 31.691.691. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGentooGentoo_OC0.33530.67061.00591.34121.6765SE +/- 0.00, N = 3SE +/- 0.00, N = 31.491.491. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GentooGentoo_OCWin10_OC714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.23, N = 325.7431.6526.42

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: GPUGentooGentoo_OCWin10_OC0.07650.1530.22950.3060.3825SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.300.340.32MIN: 0.24 / MAX: 0.31MIN: 0.29 / MAX: 0.35MIN: 0.31

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: GPUGentooGentoo_OCWin10_OC0.02030.04060.06090.08120.1015SE +/- 0.00, N = 15SE +/- 0.00, N = 15SE +/- 0.01, N = 150.080.090.06MAX: 0.17MAX: 0.2MAX: 0.16

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: GPUGentooGentoo_OCWin10_OC0.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.390.38MIN: 0.02 / MAX: 0.4MIN: 0.02 / MAX: 0.44MIN: 0.36 / MAX: 0.39

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: GPUGentooGentoo_OCWin10_OC0.02930.05860.08790.11720.1465SE +/- 0.00, N = 13SE +/- 0.00, N = 3SE +/- 0.00, N = 30.110.130.11MAX: 0.18MAX: 0.21MIN: 0.06 / MAX: 0.16

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: GPUGentooGentoo_OCWin10_OC0.47480.94961.42441.89922.374SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.682.112.08MIN: 0.74 / MAX: 1.8MIN: 0.9 / MAX: 2.25MIN: 2.03 / MAX: 2.11

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: CPUGentooWin10_OC0.1170.2340.3510.4680.585SE +/- 0.01, N = 15SE +/- 0.00, N = 30.520.48MIN: 0.47 / MAX: 0.54

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: CPUGentooWin10_OC0.03380.06760.10140.13520.169SE +/- 0.01, N = 15SE +/- 0.01, N = 120.150.11MIN: 0.03 / MAX: 0.32MIN: 0.02 / MAX: 0.23

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: CPUGentooWin10_OC0.1710.3420.5130.6840.855SE +/- 0.01, N = 15SE +/- 0.00, N = 30.760.68MIN: 0.62 / MAX: 0.83MIN: 0.67

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: CPUGentooWin10_OC0.04050.0810.12150.1620.2025SE +/- 0.00, N = 15SE +/- 0.00, N = 150.180.16MIN: 0.03 / MAX: 0.32MIN: 0.1 / MAX: 0.21

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: CPUGentooWin10_OC0.47480.94961.42441.89922.374SE +/- 0.02, N = 3SE +/- 0.01, N = 32.111.33MIN: 2.07 / MAX: 2.14MIN: 1.32 / MAX: 1.36

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetGentoo1326395265SE +/- 0.13, N = 360.06MIN: 59.06 / MAX: 66.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2Gentoo48121620SE +/- 0.10, N = 318.16MIN: 17.53 / MAX: 24.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3Gentoo48121620SE +/- 0.05, N = 314.71MIN: 14.46 / MAX: 21.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Gentoo3691215SE +/- 0.05, N = 311.46MIN: 11.21 / MAX: 18.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetGentoo48121620SE +/- 0.01, N = 316.06MIN: 15.83 / MAX: 22.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Gentoo612182430SE +/- 0.12, N = 323.55MIN: 22.86 / MAX: 30.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceGentoo0.86181.72362.58543.44724.309SE +/- 0.01, N = 33.83MIN: 3.71 / MAX: 7.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetGentoo1224364860SE +/- 0.05, N = 355.53MIN: 54.84 / MAX: 62.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16Gentoo90180270360450SE +/- 0.10, N = 3400.80MIN: 398.23 / MAX: 409.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18Gentoo1326395265SE +/- 0.04, N = 358.36MIN: 57.89 / MAX: 64.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetGentoo816243240SE +/- 0.03, N = 335.84MIN: 35.14 / MAX: 41.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Gentoo306090120150SE +/- 0.17, N = 3118.69MIN: 117.93 / MAX: 125.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyGentoo306090120150SE +/- 0.07, N = 3123.20MIN: 121.61 / MAX: 130.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdGentoo1326395265SE +/- 0.22, N = 359.62MIN: 58.66 / MAX: 66.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mGentoo1122334455SE +/- 0.06, N = 349.26MIN: 48.84 / MAX: 57.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetGentooGentoo_OC510152025SE +/- 0.41, N = 15SE +/- 0.70, N = 1221.1221.11MIN: 16.16 / MAX: 40.01MIN: 14.97 / MAX: 34.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2GentooGentoo_OC1.33652.6734.00955.3466.6825SE +/- 0.10, N = 15SE +/- 0.14, N = 125.945.06MIN: 4.9 / MAX: 11.53MIN: 4.38 / MAX: 8.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3GentooGentoo_OC246810SE +/- 0.11, N = 15SE +/- 0.15, N = 126.895.86MIN: 5.58 / MAX: 12.25MIN: 4.99 / MAX: 10.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2GentooGentoo_OC0.99451.9892.98353.9784.9725SE +/- 0.11, N = 15SE +/- 0.12, N = 124.424.09MIN: 3.91 / MAX: 9.88MIN: 3.64 / MAX: 7.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetGentooGentoo_OC246810SE +/- 0.13, N = 15SE +/- 0.15, N = 126.185.65MIN: 5.08 / MAX: 10.21MIN: 4.55 / MAX: 11.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0GentooGentoo_OC3691215SE +/- 0.09, N = 15SE +/- 0.16, N = 1211.039.60MIN: 8.25 / MAX: 26.42MIN: 7.33 / MAX: 15.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceGentooGentoo_OC0.53551.0711.60652.1422.6775SE +/- 0.17, N = 15SE +/- 0.20, N = 122.382.35MIN: 1.7 / MAX: 8.13MIN: 1.63 / MAX: 9.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetGentooGentoo_OC48121620SE +/- 0.11, N = 15SE +/- 0.12, N = 1214.6613.49MIN: 11.86 / MAX: 22.78MIN: 10.68 / MAX: 20.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16GentooGentoo_OC1632486480SE +/- 0.10, N = 15SE +/- 0.06, N = 1271.5558.08MIN: 68.13 / MAX: 94.57MIN: 54.62 / MAX: 81.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18GentooGentoo_OC3691215SE +/- 0.09, N = 15SE +/- 0.13, N = 1212.5111.27MIN: 10.3 / MAX: 21.73MIN: 9.16 / MAX: 33.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetGentooGentoo_OC510152025SE +/- 0.03, N = 15SE +/- 0.05, N = 1219.8715.96MIN: 18.06 / MAX: 30.37MIN: 14.07 / MAX: 21.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50GentooGentoo_OC612182430SE +/- 0.05, N = 15SE +/- 0.07, N = 1225.5022.89MIN: 22.49 / MAX: 36.56MIN: 19.87 / MAX: 37.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyGentooGentoo_OC918273645SE +/- 0.63, N = 15SE +/- 1.03, N = 1237.7035.78MIN: 28.5 / MAX: 63MIN: 26.39 / MAX: 56.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdGentooGentoo_OC612182430SE +/- 0.24, N = 15SE +/- 0.18, N = 1224.3622.96MIN: 18.49 / MAX: 37.19MIN: 16.96 / MAX: 331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mGentooGentoo_OC246810SE +/- 0.12, N = 15SE +/- 0.15, N = 127.836.86MIN: 6.27 / MAX: 16.64MIN: 5.7 / MAX: 11.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomWin10_OCGentoo_OC0.10960.21920.32880.43840.548SE +/- 0.002, N = 3SE +/- 0.002, N = 30.4520.487

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarWin10_OCGentoo_OC0.27590.55180.82771.10361.3795SE +/- 0.013, N = 4SE +/- 0.003, N = 31.1441.226

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomWin10_OCGentoo_OC0.24820.49640.74460.99281.241SE +/- 0.001, N = 3SE +/- 0.002, N = 31.1031.095

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarWin10_OCGentoo_OC0.71121.42242.13362.84483.556SE +/- 0.003, N = 3SE +/- 0.016, N = 33.1613.122

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASWin10_OC150300450600750SE +/- 2.96, N = 3682

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenWin10_OCGentoo_OC90180270360450SE +/- 0.67, N = 34321531. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLGentoo_OC5001000150020002500SE +/- 10.37, N = 324871. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: CUDA + cuDNN

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: CUDA + cuDNNGentoo_OC13002600390052006500SE +/- 21.79, N = 362011. (CXX) g++ options: -flto -pthread

NeatBench

Acceleration: CPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPUWin10_OCGentoo_OC1.32082.64163.96245.28326.604SE +/- 0.55, N = 16SE +/- 0.55, N = 165.865.87

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUWin10_OCGentoo_OC200400600800100010301030

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreWin10_OCGentoo_OC61218243024.5824.61

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoWin10_OCGentoo_OC20406080100SE +/- 0.14, N = 3SE +/- 0.01, N = 373.5174.73

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesWin10_OCGentoo_OC120240360480600SE +/- 0.05, N = 3SE +/- 0.20, N = 3563.71564.86

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarWin10_OCGentoo_OC30060090012001500SE +/- 0.01, N = 3SE +/- 0.03, N = 31410.301429.71

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4Win10_OCGentoo_OC30060090012001500SE +/- 0.06, N = 3SE +/- 0.09, N = 31411.571431.54

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarWin10_OCGentoo_OC1020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 345.4046.02

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4Win10_OCGentoo_OC1020304050SE +/- 0.00, N = 345.4046.01

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarWin10_OCGentoo_OC110220330440550SE +/- 0.06, N = 3SE +/- 0.07, N = 3480.54487.20

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4Win10_OCGentoo_OC100200300400500SE +/- 0.02, N = 3SE +/- 0.01, N = 3445.74452.10

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleWin10_OCGentoo_OC1.10632.21263.31894.42525.5315SE +/- 0.109, N = 12SE +/- 0.365, N = 150.1454.9171. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleWin10_OCGentoo_OC4080120160200SE +/- 0.06, N = 3SE +/- 0.00, N = 3159.5110.011. (CXX) g++ options: -O3 -pthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: NoWin10_OCGentoo_OC1.19232.38463.57694.76925.9615SE +/- 0.047, N = 13SE +/- 0.059, N = 85.2994.899

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesWin10_OCGentoo_OC714212835SE +/- 0.09, N = 3SE +/- 0.03, N = 328.6428.69


Phoronix Test Suite v10.8.4