GTX 950 CompuLab Airtop Intel Core i7-5775C testing with a CompuLab v1.0 (ARTP-3.1.0.637.0.0 X64 BIOS) and eVGA NVIDIA GeForce GTX 950 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2011049-FI-GTX950COM84&sor&grr .
GTX 950 CompuLab Airtop Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Run 1 2 3 Intel Core i7-5775C @ 3.70GHz (4 Cores / 8 Threads) CompuLab v1.0 (ARTP-3.1.0.637.0.0 X64 BIOS) Intel Broadwell-U DMI 16GB 256GB ADATA SP310 eVGA NVIDIA GeForce GTX 950 2GB (1151/3304MHz) Realtek ALC886 G237HL Intel I218-LM + Intel I210 + Intel 7260 Ubuntu 20.10 5.8.0-26-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 NVIDIA 455.28 4.6.0 OpenCL 1.2 CUDA 11.1.96 1.2.142 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x22 - Thermald 2.3 OpenCL Details - GPU Compute Cores: 768 Python Details - Run 1: Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
GTX 950 CompuLab Airtop blender: Barbershop - CUDA blender: Pabellon Barcelona - NVIDIA OptiX redshift: blender: Pabellon Barcelona - CUDA realsr-ncnn: 4x - Yes blender: Classroom - NVIDIA OptiX blender: Classroom - CUDA blender: Fishy Cat - NVIDIA OptiX blender: Fishy Cat - CUDA blender: BMW27 - NVIDIA OptiX blender: BMW27 - CUDA lczero: OpenCL luxcorerender-cl: LuxCore Benchmark luxcorerender-cl: Food octanebench: Total Score luxcorerender-cl: DLSC fahbench: luxcorerender-cl: Rainbow Colors and Prism vkfft: realsr-ncnn: 4x - No ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU - squeezenet namd-cuda: ATPase Simulation - 327,506 Atoms clpeak: Double-Precision Double waifu2x-ncnn: 2x - 3 - Yes cl-mem: Copy cl-mem: Read cl-mem: Write mandelgpu: GPU hashcat: SHA-512 hashcat: MD5 hashcat: SHA1 clpeak: Single-Precision Float viennacl: OpenCL LU Factorization arrayfire: Conjugate Gradient OpenCL waifu2x-ncnn: 2x - 3 - No clpeak: Integer Compute INT financebench: Black-Scholes OpenCL neatbench: GPU clpeak: Global Memory Bandwidth Run 1 2 3 3167.76 1960.71 2131 1896.87 404.653 1021.55 894.18 715.41 646.84 349.29 308.30 3444 0.71 0.32 43.928409 0.90 49.3974 3.10 6633 47.241 20.05 14.34 8.81 6.24 34.00 8.57 1.01 5.78 3.35 2.62 3.77 3.21 10.84 12.44 0.65245 66.66 20.251 79.2 90.8 88 54591952.5 181900000 4834000000 1687633333 1870.18 35.6619 8.296 3.280 542.12 29.131933 9.52 90.68 3297.83 2301.38 2131 2114.70 405.516 1099.24 912.44 949.11 827.47 403.46 381.11 3413 0.72 0.33 43.603681 0.93 49.7603 3.16 6618 47.240 20.10 15.11 8.86 6.37 35.21 8.56 1.01 5.84 3.36 2.64 3.78 3.22 10.92 12.58 0.69626 65.62 20.266 79.2 90.8 87.8 53723378.2 171366667 4835500000 1687600000 1859.87 35.5972 8.297 3.284 531.80 26.676667 9.39 89.32 3297.57 2301.04 2176 2109.62 413.812 1100.13 912.76 950.93 827.39 403.51 381.65 3337 0.71 0.31 42.382028 0.91 49.0599 3.09 6466 48.298 20.04 15.75 9.01 6.60 36.20 8.90 1.10 6.06 3.47 2.80 3.98 3.33 11.19 12.91 0.71381 65.83 20.640 79.2 90.8 87.8 54824518.7 184033333 4706166667 1659433333 1842.12 35.3600 8.420 3.380 532.00 27.372667 9.38 89.59 OpenBenchmarking.org
Blender Blend File: Barbershop - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: CUDA Run 1 3 2 700 1400 2100 2800 3500 SE +/- 33.13, N = 9 SE +/- 0.19, N = 3 SE +/- 0.54, N = 3 3167.76 3297.57 3297.83
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Run 1 3 2 500 1000 1500 2000 2500 SE +/- 0.08, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 1960.71 2301.04 2301.38
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 Run 1 2 3 500 1000 1500 2000 2500 SE +/- 2.03, N = 3 SE +/- 2.19, N = 3 2131 2131 2176
Blender Blend File: Pabellon Barcelona - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: CUDA Run 1 3 2 500 1000 1500 2000 2500 SE +/- 5.87, N = 3 SE +/- 2.34, N = 3 SE +/- 2.93, N = 3 1896.87 2109.62 2114.70
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Run 1 2 3 90 180 270 360 450 SE +/- 6.37, N = 9 SE +/- 6.32, N = 9 SE +/- 6.35, N = 9 404.65 405.52 413.81
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: NVIDIA OptiX Run 1 2 3 200 400 600 800 1000 SE +/- 2.26, N = 3 SE +/- 0.58, N = 3 SE +/- 0.08, N = 3 1021.55 1099.24 1100.13
Blender Blend File: Classroom - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: CUDA Run 1 2 3 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 0.89, N = 3 SE +/- 0.60, N = 3 894.18 912.44 912.76
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: NVIDIA OptiX Run 1 2 3 200 400 600 800 1000 SE +/- 0.10, N = 3 SE +/- 1.20, N = 3 SE +/- 0.39, N = 3 715.41 949.11 950.93
Blender Blend File: Fishy Cat - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CUDA Run 1 3 2 200 400 600 800 1000 SE +/- 0.04, N = 3 SE +/- 0.31, N = 3 SE +/- 0.15, N = 3 646.84 827.39 827.47
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: NVIDIA OptiX Run 1 2 3 90 180 270 360 450 SE +/- 7.22, N = 9 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 349.29 403.46 403.51
Blender Blend File: BMW27 - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CUDA Run 1 2 3 80 160 240 320 400 SE +/- 1.08, N = 3 SE +/- 0.44, N = 3 SE +/- 0.54, N = 3 308.30 381.11 381.65
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL Run 1 2 3 700 1400 2100 2800 3500 SE +/- 50.87, N = 3 SE +/- 24.89, N = 3 SE +/- 43.33, N = 3 3444 3413 3337 1. (CXX) g++ options: -flto -pthread
LuxCoreRender OpenCL Scene: LuxCore Benchmark OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: LuxCore Benchmark 2 3 Run 1 0.162 0.324 0.486 0.648 0.81 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 12 0.72 0.71 0.71 MIN: 0.14 / MAX: 0.86 MIN: 0.14 / MAX: 0.85 MIN: 0.07 / MAX: 0.86
LuxCoreRender OpenCL Scene: Food OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Food 2 Run 1 3 0.0743 0.1486 0.2229 0.2972 0.3715 SE +/- 0.00, N = 3 SE +/- 0.01, N = 12 SE +/- 0.00, N = 3 0.33 0.32 0.31 MIN: 0.11 / MAX: 0.41 MIN: 0.06 / MAX: 0.42 MIN: 0.1 / MAX: 0.4
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score Run 1 2 3 10 20 30 40 50 43.93 43.60 42.38
LuxCoreRender OpenCL Scene: DLSC OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: DLSC 2 3 Run 1 0.2093 0.4186 0.6279 0.8372 1.0465 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 12 0.93 0.91 0.90 MIN: 0.85 / MAX: 0.95 MIN: 0.83 / MAX: 0.93 MIN: 0.19 / MAX: 0.94
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 2 Run 1 3 11 22 33 44 55 SE +/- 0.16, N = 3 SE +/- 0.17, N = 3 SE +/- 0.06, N = 3 49.76 49.40 49.06
LuxCoreRender OpenCL Scene: Rainbow Colors and Prism OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Rainbow Colors and Prism 2 Run 1 3 0.711 1.422 2.133 2.844 3.555 SE +/- 0.01, N = 3 SE +/- 0.06, N = 12 SE +/- 0.00, N = 3 3.16 3.10 3.09 MIN: 2.2 / MAX: 3.3 MIN: 0.69 / MAX: 3.3 MIN: 2.41 / MAX: 3.21
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 Run 1 2 3 1400 2800 4200 5600 7000 SE +/- 32.85, N = 3 SE +/- 18.01, N = 3 SE +/- 16.86, N = 3 6633 6618 6466
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No 2 Run 1 3 11 22 33 44 55 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 47.24 47.24 48.30
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny 3 Run 1 2 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 20.04 20.05 20.10 MIN: 17.33 / MAX: 29.73 MIN: 15.64 / MAX: 21.87 MIN: 18.89 / MAX: 21.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 Run 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 14.34 15.11 15.75 MIN: 13.73 / MAX: 16.79 MIN: 13.88 / MAX: 17.23 MIN: 14.11 / MAX: 20.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet Run 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 8.81 8.86 9.01 MIN: 8.56 / MAX: 11.16 MIN: 8.62 / MAX: 10.61 MIN: 8.66 / MAX: 12.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 Run 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.24 6.37 6.60 MIN: 6.11 / MAX: 8.26 MIN: 6.02 / MAX: 8.14 MIN: 6.16 / MAX: 10.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 Run 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 34.00 35.21 36.20 MIN: 33.54 / MAX: 36.65 MIN: 33.68 / MAX: 38.75 MIN: 34.35 / MAX: 41.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet 2 Run 1 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.56 8.57 8.90 MIN: 8.39 / MAX: 9.07 MIN: 8.28 / MAX: 11.09 MIN: 8.49 / MAX: 10.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface Run 1 2 3 0.2475 0.495 0.7425 0.99 1.2375 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.01 1.01 1.10 MIN: 1 / MAX: 1.23 MAX: 1.16 MIN: 1.01 / MAX: 3.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 Run 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 5.78 5.84 6.06 MIN: 5.55 / MAX: 9.45 MIN: 5.6 / MAX: 8.14 MIN: 5.63 / MAX: 9.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet Run 1 2 3 0.7808 1.5616 2.3424 3.1232 3.904 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.35 3.36 3.47 MIN: 3.29 / MAX: 4.84 MIN: 3.28 / MAX: 4.97 MIN: 3.27 / MAX: 5.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 Run 1 2 3 0.63 1.26 1.89 2.52 3.15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.62 2.64 2.80 MIN: 2.51 / MAX: 4.09 MIN: 2.55 / MAX: 4.15 MIN: 2.55 / MAX: 5.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Run 1 2 3 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 3.77 3.78 3.98 MIN: 3.67 / MAX: 5.48 MIN: 3.67 / MAX: 5.63 MIN: 3.67 / MAX: 7.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Run 1 2 3 0.7493 1.4986 2.2479 2.9972 3.7465 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.21 3.22 3.33 MIN: 3.13 / MAX: 4.86 MIN: 3.14 / MAX: 4.81 MIN: 3.13 / MAX: 5.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet Run 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 10.84 10.92 11.19 MIN: 9.36 / MAX: 12.61 MIN: 9.24 / MAX: 13.32 MIN: 9.62 / MAX: 15.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet Run 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 12.44 12.58 12.91 MIN: 11.77 / MAX: 12.86 MIN: 11.71 / MAX: 15.31 MIN: 11.86 / MAX: 18.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms Run 1 2 3 0.1606 0.3212 0.4818 0.6424 0.803 SE +/- 0.00639, N = 3 SE +/- 0.00141, N = 3 SE +/- 0.00135, N = 3 0.65245 0.69626 0.71381
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Run 1 3 2 15 30 45 60 75 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 66.66 65.83 65.62 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Run 1 2 3 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 20.25 20.27 20.64
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy 3 2 Run 1 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 79.2 79.2 79.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read 3 2 Run 1 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.8 90.8 90.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Run 1 3 2 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 88.0 87.8 87.8 1. (CC) gcc options: -O2 -flto -lOpenCL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU 3 Run 1 2 12M 24M 36M 48M 60M SE +/- 364793.78, N = 3 SE +/- 69191.69, N = 3 SE +/- 291084.10, N = 3 54824518.7 54591952.5 53723378.2 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 3 Run 1 2 40M 80M 120M 160M 200M SE +/- 635959.47, N = 3 SE +/- 1660177.84, N = 15 SE +/- 536449.23, N = 3 184033333 181900000 171366667
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 2 Run 1 3 1000M 2000M 3000M 4000M 5000M SE +/- 8524670.08, N = 3 SE +/- 26534003.34, N = 3 SE +/- 29849195.04, N = 3 4835500000 4834000000 4706166667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 Run 1 2 3 400M 800M 1200M 1600M 2000M SE +/- 6483140.53, N = 3 SE +/- 5008991.91, N = 3 SE +/- 3933757.04, N = 3 1687633333 1687600000 1659433333
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Run 1 2 3 400 800 1200 1600 2000 SE +/- 31.67, N = 15 SE +/- 25.82, N = 15 SE +/- 33.84, N = 15 1870.18 1859.87 1842.12 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
ViennaCL OpenCL LU Factorization OpenBenchmarking.org GFLOPS, More Is Better ViennaCL 1.4.2 OpenCL LU Factorization Run 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 35.66 35.60 35.36 1. (CXX) g++ options: -rdynamic -lOpenCL
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL Run 1 2 3 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 8.296 8.297 8.420 1. (CXX) g++ options: -rdynamic
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Run 1 2 3 0.7605 1.521 2.2815 3.042 3.8025 SE +/- 0.012, N = 3 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 3.280 3.284 3.380
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Run 1 3 2 120 240 360 480 600 SE +/- 6.15, N = 3 SE +/- 4.47, N = 3 SE +/- 6.20, N = 3 542.12 532.00 531.80 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-06-06 Benchmark: Black-Scholes OpenCL 2 3 Run 1 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 SE +/- 0.38, N = 15 26.68 27.37 29.13 1. (CXX) g++ options: -O3 -lOpenCL
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Run 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.52 9.39 9.38
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Run 1 3 2 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 90.68 89.59 89.32 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4