GTX 950 CompuLab Airtop Intel Core i7-5775C testing with a CompuLab v1.0 (ARTP-3.1.0.637.0.0 X64 BIOS) and eVGA NVIDIA GeForce GTX 950 2GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2011049-FI-GTX950COM84&rdt .
GTX 950 CompuLab Airtop Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Run 1 2 3 Intel Core i7-5775C @ 3.70GHz (4 Cores / 8 Threads) CompuLab v1.0 (ARTP-3.1.0.637.0.0 X64 BIOS) Intel Broadwell-U DMI 16GB 256GB ADATA SP310 eVGA NVIDIA GeForce GTX 950 2GB (1151/3304MHz) Realtek ALC886 G237HL Intel I218-LM + Intel I210 + Intel 7260 Ubuntu 20.10 5.8.0-26-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 NVIDIA 455.28 4.6.0 OpenCL 1.2 CUDA 11.1.96 1.2.142 GCC 10.2.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x22 - Thermald 2.3 OpenCL Details - GPU Compute Cores: 768 Python Details - Run 1: Python 3.8.6 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
GTX 950 CompuLab Airtop realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes waifu2x-ncnn: 2x - 3 - No waifu2x-ncnn: 2x - 3 - Yes vkfft: hashcat: MD5 hashcat: SHA1 hashcat: SHA-512 financebench: Black-Scholes OpenCL viennacl: OpenCL LU Factorization cl-mem: Copy cl-mem: Read cl-mem: Write namd-cuda: ATPase Simulation - 327,506 Atoms octanebench: Total Score redshift: luxcorerender-cl: DLSC luxcorerender-cl: Food luxcorerender-cl: LuxCore Benchmark luxcorerender-cl: Rainbow Colors and Prism fahbench: lczero: OpenCL arrayfire: Conjugate Gradient OpenCL ncnn: Vulkan GPU - squeezenet ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny blender: BMW27 - CUDA blender: Classroom - CUDA blender: Fishy Cat - CUDA blender: Barbershop - CUDA blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Pabellon Barcelona - CUDA blender: Pabellon Barcelona - NVIDIA OptiX mandelgpu: GPU clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth neatbench: GPU Run 1 2 3 47.241 404.653 3.280 20.251 6633 4834000000 1687633333 181900000 29.131933 35.6619 79.2 90.8 88 0.65245 43.928409 2131 0.90 0.32 0.71 3.10 49.3974 3444 8.296 12.44 10.84 3.21 3.77 2.62 3.35 5.78 1.01 8.57 34.00 6.24 8.81 14.34 20.05 308.30 894.18 646.84 3167.76 349.29 1021.55 715.41 1896.87 1960.71 54591952.5 542.12 1870.18 66.66 90.68 9.52 47.240 405.516 3.284 20.266 6618 4835500000 1687600000 171366667 26.676667 35.5972 79.2 90.8 87.8 0.69626 43.603681 2131 0.93 0.33 0.72 3.16 49.7603 3413 8.297 12.58 10.92 3.22 3.78 2.64 3.36 5.84 1.01 8.56 35.21 6.37 8.86 15.11 20.10 381.11 912.44 827.47 3297.83 403.46 1099.24 949.11 2114.70 2301.38 53723378.2 531.80 1859.87 65.62 89.32 9.39 48.298 413.812 3.380 20.640 6466 4706166667 1659433333 184033333 27.372667 35.3600 79.2 90.8 87.8 0.71381 42.382028 2176 0.91 0.31 0.71 3.09 49.0599 3337 8.420 12.91 11.19 3.33 3.98 2.80 3.47 6.06 1.10 8.90 36.20 6.60 9.01 15.75 20.04 381.65 912.76 827.39 3297.57 403.51 1100.13 950.93 2109.62 2301.04 54824518.7 532.00 1842.12 65.83 89.59 9.38 OpenBenchmarking.org
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Run 1 2 3 11 22 33 44 55 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 47.24 47.24 48.30
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Run 1 2 3 90 180 270 360 450 SE +/- 6.37, N = 9 SE +/- 6.32, N = 9 SE +/- 6.35, N = 9 404.65 405.52 413.81
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Run 1 2 3 0.7605 1.521 2.2815 3.042 3.8025 SE +/- 0.012, N = 3 SE +/- 0.006, N = 3 SE +/- 0.013, N = 3 3.280 3.284 3.380
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Run 1 2 3 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 20.25 20.27 20.64
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 2020-09-29 Run 1 2 3 1400 2800 4200 5600 7000 SE +/- 32.85, N = 3 SE +/- 18.01, N = 3 SE +/- 16.86, N = 3 6633 6618 6466
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 Run 1 2 3 1000M 2000M 3000M 4000M 5000M SE +/- 26534003.34, N = 3 SE +/- 8524670.08, N = 3 SE +/- 29849195.04, N = 3 4834000000 4835500000 4706166667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 Run 1 2 3 400M 800M 1200M 1600M 2000M SE +/- 6483140.53, N = 3 SE +/- 5008991.91, N = 3 SE +/- 3933757.04, N = 3 1687633333 1687600000 1659433333
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 Run 1 2 3 40M 80M 120M 160M 200M SE +/- 1660177.84, N = 15 SE +/- 536449.23, N = 3 SE +/- 635959.47, N = 3 181900000 171366667 184033333
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-06-06 Benchmark: Black-Scholes OpenCL Run 1 2 3 7 14 21 28 35 SE +/- 0.38, N = 15 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 29.13 26.68 27.37 1. (CXX) g++ options: -O3 -lOpenCL
ViennaCL OpenCL LU Factorization OpenBenchmarking.org GFLOPS, More Is Better ViennaCL 1.4.2 OpenCL LU Factorization Run 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 35.66 35.60 35.36 1. (CXX) g++ options: -rdynamic -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Run 1 2 3 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 79.2 79.2 79.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Run 1 2 3 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 90.8 90.8 90.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Run 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 88.0 87.8 87.8 1. (CC) gcc options: -O2 -flto -lOpenCL
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms Run 1 2 3 0.1606 0.3212 0.4818 0.6424 0.803 SE +/- 0.00639, N = 3 SE +/- 0.00141, N = 3 SE +/- 0.00135, N = 3 0.65245 0.69626 0.71381
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score Run 1 2 3 10 20 30 40 50 43.93 43.60 42.38
RedShift Demo OpenBenchmarking.org Seconds, Fewer Is Better RedShift Demo 3.0 Run 1 2 3 500 1000 1500 2000 2500 SE +/- 2.03, N = 3 SE +/- 2.19, N = 3 2131 2131 2176
LuxCoreRender OpenCL Scene: DLSC OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: DLSC Run 1 2 3 0.2093 0.4186 0.6279 0.8372 1.0465 SE +/- 0.02, N = 12 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.90 0.93 0.91 MIN: 0.19 / MAX: 0.94 MIN: 0.85 / MAX: 0.95 MIN: 0.83 / MAX: 0.93
LuxCoreRender OpenCL Scene: Food OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Food Run 1 2 3 0.0743 0.1486 0.2229 0.2972 0.3715 SE +/- 0.01, N = 12 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.32 0.33 0.31 MIN: 0.06 / MAX: 0.42 MIN: 0.11 / MAX: 0.41 MIN: 0.1 / MAX: 0.4
LuxCoreRender OpenCL Scene: LuxCore Benchmark OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: LuxCore Benchmark Run 1 2 3 0.162 0.324 0.486 0.648 0.81 SE +/- 0.02, N = 12 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.71 0.72 0.71 MIN: 0.07 / MAX: 0.86 MIN: 0.14 / MAX: 0.86 MIN: 0.14 / MAX: 0.85
LuxCoreRender OpenCL Scene: Rainbow Colors and Prism OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender OpenCL 2.3 Scene: Rainbow Colors and Prism Run 1 2 3 0.711 1.422 2.133 2.844 3.555 SE +/- 0.06, N = 12 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.10 3.16 3.09 MIN: 0.69 / MAX: 3.3 MIN: 2.2 / MAX: 3.3 MIN: 2.41 / MAX: 3.21
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 Run 1 2 3 11 22 33 44 55 SE +/- 0.17, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 49.40 49.76 49.06
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL Run 1 2 3 700 1400 2100 2800 3500 SE +/- 50.87, N = 3 SE +/- 24.89, N = 3 SE +/- 43.33, N = 3 3444 3413 3337 1. (CXX) g++ options: -flto -pthread
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL Run 1 2 3 2 4 6 8 10 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 8.296 8.297 8.420 1. (CXX) g++ options: -rdynamic
NCNN Target: Vulkan GPU - Model: squeezenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: squeezenet Run 1 2 3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 12.44 12.58 12.91 MIN: 11.77 / MAX: 12.86 MIN: 11.71 / MAX: 15.31 MIN: 11.86 / MAX: 18.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mobilenet Run 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 10.84 10.92 11.19 MIN: 9.36 / MAX: 12.61 MIN: 9.24 / MAX: 13.32 MIN: 9.62 / MAX: 15.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Run 1 2 3 0.7493 1.4986 2.2479 2.9972 3.7465 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 3.21 3.22 3.33 MIN: 3.13 / MAX: 4.86 MIN: 3.14 / MAX: 4.81 MIN: 3.13 / MAX: 5.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Run 1 2 3 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 3.77 3.78 3.98 MIN: 3.67 / MAX: 5.48 MIN: 3.67 / MAX: 5.63 MIN: 3.67 / MAX: 7.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: shufflenet-v2 Run 1 2 3 0.63 1.26 1.89 2.52 3.15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.62 2.64 2.80 MIN: 2.51 / MAX: 4.09 MIN: 2.55 / MAX: 4.15 MIN: 2.55 / MAX: 5.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: mnasnet Run 1 2 3 0.7808 1.5616 2.3424 3.1232 3.904 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 3.35 3.36 3.47 MIN: 3.29 / MAX: 4.84 MIN: 3.28 / MAX: 4.97 MIN: 3.27 / MAX: 5.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: efficientnet-b0 Run 1 2 3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 5.78 5.84 6.06 MIN: 5.55 / MAX: 9.45 MIN: 5.6 / MAX: 8.14 MIN: 5.63 / MAX: 9.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: blazeface Run 1 2 3 0.2475 0.495 0.7425 0.99 1.2375 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.01 1.01 1.10 MIN: 1 / MAX: 1.23 MAX: 1.16 MIN: 1.01 / MAX: 3.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: googlenet Run 1 2 3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.57 8.56 8.90 MIN: 8.28 / MAX: 11.09 MIN: 8.39 / MAX: 9.07 MIN: 8.49 / MAX: 10.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: vgg16 Run 1 2 3 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 34.00 35.21 36.20 MIN: 33.54 / MAX: 36.65 MIN: 33.68 / MAX: 38.75 MIN: 34.35 / MAX: 41.96 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet18 Run 1 2 3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.24 6.37 6.60 MIN: 6.11 / MAX: 8.26 MIN: 6.02 / MAX: 8.14 MIN: 6.16 / MAX: 10.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: alexnet Run 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 8.81 8.86 9.01 MIN: 8.56 / MAX: 11.16 MIN: 8.62 / MAX: 10.61 MIN: 8.66 / MAX: 12.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: resnet50 Run 1 2 3 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 14.34 15.11 15.75 MIN: 13.73 / MAX: 16.79 MIN: 13.88 / MAX: 17.23 MIN: 14.11 / MAX: 20.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20200916 Target: Vulkan GPU - Model: yolov4-tiny Run 1 2 3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 20.05 20.10 20.04 MIN: 15.64 / MAX: 21.87 MIN: 18.89 / MAX: 21.77 MIN: 17.33 / MAX: 29.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender Blend File: BMW27 - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CUDA Run 1 2 3 80 160 240 320 400 SE +/- 1.08, N = 3 SE +/- 0.44, N = 3 SE +/- 0.54, N = 3 308.30 381.11 381.65
Blender Blend File: Classroom - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: CUDA Run 1 2 3 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 0.89, N = 3 SE +/- 0.60, N = 3 894.18 912.44 912.76
Blender Blend File: Fishy Cat - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CUDA Run 1 2 3 200 400 600 800 1000 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 SE +/- 0.31, N = 3 646.84 827.47 827.39
Blender Blend File: Barbershop - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: CUDA Run 1 2 3 700 1400 2100 2800 3500 SE +/- 33.13, N = 9 SE +/- 0.54, N = 3 SE +/- 0.19, N = 3 3167.76 3297.83 3297.57
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: NVIDIA OptiX Run 1 2 3 90 180 270 360 450 SE +/- 7.22, N = 9 SE +/- 0.19, N = 3 SE +/- 0.14, N = 3 349.29 403.46 403.51
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: NVIDIA OptiX Run 1 2 3 200 400 600 800 1000 SE +/- 2.26, N = 3 SE +/- 0.58, N = 3 SE +/- 0.08, N = 3 1021.55 1099.24 1100.13
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: NVIDIA OptiX Run 1 2 3 200 400 600 800 1000 SE +/- 0.10, N = 3 SE +/- 1.20, N = 3 SE +/- 0.39, N = 3 715.41 949.11 950.93
Blender Blend File: Pabellon Barcelona - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: CUDA Run 1 2 3 500 1000 1500 2000 2500 SE +/- 5.87, N = 3 SE +/- 2.93, N = 3 SE +/- 2.34, N = 3 1896.87 2114.70 2109.62
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Run 1 2 3 500 1000 1500 2000 2500 SE +/- 0.08, N = 3 SE +/- 0.17, N = 3 SE +/- 0.19, N = 3 1960.71 2301.38 2301.04
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Run 1 2 3 12M 24M 36M 48M 60M SE +/- 69191.69, N = 3 SE +/- 291084.10, N = 3 SE +/- 364793.78, N = 3 54591952.5 53723378.2 54824518.7 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Run 1 2 3 120 240 360 480 600 SE +/- 6.15, N = 3 SE +/- 6.20, N = 3 SE +/- 4.47, N = 3 542.12 531.80 532.00 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Run 1 2 3 400 800 1200 1600 2000 SE +/- 31.67, N = 15 SE +/- 25.82, N = 15 SE +/- 33.84, N = 15 1870.18 1859.87 1842.12 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Run 1 2 3 15 30 45 60 75 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 66.66 65.62 65.83 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Run 1 2 3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 90.68 89.32 89.59 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Run 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.52 9.39 9.38
Phoronix Test Suite v10.8.4