compulab-airtop-3-rtx-4000-compute

Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2010311-FI-COMPULABA24&sro&grs.

compulab-airtop-3-rtx-4000-compute ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution11a21b1c1d1eNVIDIA Quadro RTX 4000RTX 4000NVIDIA RTX 4000Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads)Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS)Intel Cannon Lake PCH64GBSamsung SSD 970 EVO Plus 250GBNVIDIA Quadro RTX 4000 8GB (1005/6500MHz)Intel Cannon Lake PCH cAVSVE228Intel I219-LM + Intel I210Ubuntu 20.105.8.0-26-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9NVIDIA 455.284.6.0OpenCL 1.2 CUDA 11.1.961.2.142GCC 10.2.0ext41920x1080NVIDIA Quadro RTX 4000 8GB (300/405MHz)NVIDIA Quadro RTX 4000 8GB (1005/6500MHz)NVIDIA Quadro RTX 4000 8GB (300/405MHz)NVIDIA Quadro RTX 4000 8GB (1005/6500MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 2.3OpenCL Details- GPU Compute Cores: 2304Python Details- 1, 1a, 1b, 1d, 1e, NVIDIA Quadro RTX 4000, RTX 4000, NVIDIA RTX 4000: Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

compulab-airtop-3-rtx-4000-compute realsr-ncnn: 4x - Yesclpeak: Single-Precision Floathashcat: 7-Ziphashcat: MD5realsr-ncnn: 4x - Nowaifu2x-ncnn: 2x - 3 - Yeshashcat: SHA1hashcat: TrueCrypt RIPEMD160 + XTShashcat: SHA-512clpeak: Integer Compute INTncnn: Vulkan GPU - resnet50vkfft: ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - vgg16redshift: blender: Classroom - CUDAncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - shufflenet-v2blender: Classroom - NVIDIA OptiXcl-mem: Writeblender: Barbershop - CUDAncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - blazefacecl-mem: Copyclpeak: Global Memory Bandwidthncnn: Vulkan GPU - mnasnetblender: Fishy Cat - NVIDIA OptiXplaidml: No - Inference - DenseNet 201 - OpenCLblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: Fishy Cat - CUDAncnn: Vulkan GPU - efficientnet-b0blender: BMW27 - CUDAplaidml: No - Inference - Mobilenet - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLncnn: Vulkan GPU - squeezenetviennacl: OpenCL LU Factorizationplaidml: No - Inference - IMDB LSTM - OpenCLfahbench: mandelgpu: GPUarrayfire: Conjugate Gradient OpenCLblender: Pabellon Barcelona - CUDAcl-mem: Readclpeak: Double-Precision Doublefinancebench: Black-Scholes OpenCLncnn: Vulkan GPU-v2-v2 - mobilenet-v2neatbench: GPUblender: BMW27 - NVIDIA OptiXncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - mobilenetluxcorerender-cl: Rainbow Colors and Prismluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: Foodluxcorerender-cl: DLSC11a21b1c1d1eNVIDIA Quadro RTX 4000RTX 4000NVIDIA RTX 400080.4454469002512196666712.5525.5028704500000327233110226666725694325.5283.068.2883379.714.03481.1294443002494470000012.6045.5518642000000326300109526666725585320.4282.668.4737379.214.03281.5444426672483896666712.6655.5728615933333324033109103333325486321.1282.168.2546379.214.03782.46412.7015.67184.1744333002425903333312.9785.6948426866667317867107006666725027382321.1281.168.3795191.6264379.314.03610.423.401.523.9981.1746033.104430002487690000012.5805.571863350000032426710922000005712.253.89254572.188.79381218.663.321.33115.78322.5756.641.710.63282.1346.091.5258.38140.54160.821307.24112.742.7357.801490.381843.843.7768.4059423.50191.8417248122412.92.247459.35379.3259.6614.0341.4831.032.148.621.84.6610.783.501.574.0985.9266004.484246332384096666713.0785.785825403333331223310497333335741.923.93246842.219.14391224.853.401.35118.39318.4771.191.740.63278.7340.911.5459.12138.91160.951321.89114.002.7558.101475.981834.203.7868.0188421.03190.7594248177151.42.255460.64379.3259.5014.0341.4830.929.238.331.954.6710.793.471.554.0187.7936536.454177672350686666713.3505.861818156666730940010415000006013.594.08245382.279.02393223.813.371.36117.20319.4764.301.740.64278.8342.371.5458.69139.15159.091307.37113.882.7658.411480.201829.243.8068.0204420.80190.8199246857018.02.257459.08379.3259.3314.0341.4830.229.148.221.774.8610.733.471.554.0280.5494464002507546666712.4945.5408700533333327567110043333325593379323.0282.168.2894379.314.032OpenBenchmarking.org

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yes11a1b1c1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400020406080100SE +/- 0.35, N = 3SE +/- 0.36, N = 3SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 0.36, N = 3SE +/- 0.36, N = 3SE +/- 0.45, N = 3SE +/- 0.37, N = 3SE +/- 0.27, N = 380.4581.1381.5482.4684.1781.1785.9380.5587.79

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Float1eNVIDIA Quadro RTX 4000RTX 400014002800420056007000SE +/- 35.64, N = 3SE +/- 55.06, N = 3SE +/- 97.61, N = 36033.106004.486536.451. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zip11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 4000100K200K300K400K500KSE +/- 208.17, N = 3SE +/- 556.78, N = 3SE +/- 202.76, N = 3SE +/- 321.46, N = 3SE +/- 1365.04, N = 3SE +/- 463.08, N = 3SE +/- 200.00, N = 3SE +/- 233.33, N = 3446900444300442667433300443000424633446400417767

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD511a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40005000M10000M15000M20000M25000MSE +/- 25031801.99, N = 3SE +/- 2051828.45, N = 3SE +/- 12651789.51, N = 3SE +/- 2643440.52, N = 3SE +/- 12698162.60, N = 3SE +/- 13574649.58, N = 3SE +/- 24626025.08, N = 3SE +/- 2355372.11, N = 32512196666724944700000248389666672425903333324876900000238409666672507546666723506866667

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: No11a1b1c1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40003691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5512.6012.6712.7012.9812.5813.0812.4913.35

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes11a1b1c1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40001.31872.63743.95615.27486.5935SE +/- 0.008, N = 3SE +/- 0.023, N = 3SE +/- 0.022, N = 3SE +/- 0.047, N = 3SE +/- 0.008, N = 3SE +/- 0.017, N = 3SE +/- 0.018, N = 3SE +/- 0.041, N = 3SE +/- 0.014, N = 35.5025.5515.5725.6715.6945.5715.7855.5405.861

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA111a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40002000M4000M6000M8000M10000MSE +/- 9832090.32, N = 3SE +/- 6847870.72, N = 3SE +/- 3773739.67, N = 3SE +/- 7846938.54, N = 3SE +/- 6005275.46, N = 3SE +/- 5691026.07, N = 3SE +/- 2630800.47, N = 3SE +/- 6590228.46, N = 387045000008642000000861593333384268666678633500000825403333387005333338181566667

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTS11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400070K140K210K280K350KSE +/- 533.33, N = 3SE +/- 185.59, N = 3SE +/- 88.19, N = 3SE +/- 233.33, N = 3SE +/- 317.98, N = 3SE +/- 683.94, N = 3327233326300324033317867324267312233327567309400

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-51211a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 4000200M400M600M800M1000MSE +/- 819213.72, N = 3SE +/- 491030.66, N = 3SE +/- 643773.60, N = 3SE +/- 1017076.42, N = 3SE +/- 1021436.90, N = 3SE +/- 1260070.54, N = 3SE +/- 240370.09, N = 3SE +/- 953939.20, N = 311022666671095266667109103333310700666671092200000104973333311004333331041500000

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INT1eNVIDIA Quadro RTX 4000RTX 400013002600390052006500SE +/- 68.26, N = 12SE +/- 46.13, N = 3SE +/- 102.19, N = 35712.255741.926013.591. (CXX) g++ options: -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet501eNVIDIA Quadro RTX 4000RTX 40000.9181.8362.7543.6724.59SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 33.893.934.08MIN: 3.86 / MAX: 3.99MIN: 3.91 / MAX: 4.04MIN: 3.92 / MAX: 40.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 2020-09-2911a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40006K12K18K24K30KSE +/- 28.39, N = 3SE +/- 16.51, N = 3SE +/- 27.82, N = 3SE +/- 17.21, N = 3SE +/- 32.54, N = 3SE +/- 20.11, N = 3SE +/- 4.04, N = 32569425585254862502725457246842559324538

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet1eNVIDIA Quadro RTX 4000RTX 40000.51081.02161.53242.04322.554SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 32.182.212.27MIN: 1.91 / MAX: 11.43MIN: 1.91 / MAX: 6.96MIN: 2.15 / MAX: 23.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg161eNVIDIA Quadro RTX 4000RTX 40003691215SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 38.799.149.02MIN: 8.1 / MAX: 20.83MIN: 8.49 / MAX: 36.48MIN: 8.35 / MAX: 20.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.01d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400090180270360450SE +/- 2.60, N = 3SE +/- 2.31, N = 3SE +/- 4.63, N = 3SE +/- 2.33, N = 3SE +/- 4.91, N = 3382381391379393

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDA1eNVIDIA Quadro RTX 4000RTX 400050100150200250SE +/- 1.49, N = 3SE +/- 3.62, N = 3SE +/- 3.26, N = 3218.66224.85223.81

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet1eNVIDIA Quadro RTX 4000RTX 40000.7651.532.2953.063.825SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 33.323.403.37MIN: 3.29 / MAX: 3.43MIN: 3.33 / MAX: 20.26MIN: 3.35 / MAX: 3.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v21eNVIDIA Quadro RTX 4000RTX 40000.3060.6120.9181.2241.53SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.331.351.36MIN: 1.32 / MAX: 1.4MIN: 1.33 / MAX: 1.4MIN: 1.34 / MAX: 1.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX1eNVIDIA Quadro RTX 4000RTX 4000306090120150SE +/- 0.87, N = 3SE +/- 0.55, N = 3SE +/- 0.57, N = 3115.78118.39117.20

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Write11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400070140210280350SE +/- 1.79, N = 3SE +/- 1.48, N = 3SE +/- 0.78, N = 3SE +/- 0.96, N = 3SE +/- 0.58, N = 3SE +/- 2.17, N = 3SE +/- 1.47, N = 3SE +/- 1.44, N = 3325.5320.4321.1321.1322.5318.4323.0319.41. (CC) gcc options: -O2 -flto -lOpenCL

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDA1eNVIDIA Quadro RTX 4000RTX 4000170340510680850SE +/- 2.80, N = 3SE +/- 1.15, N = 3SE +/- 0.87, N = 3756.64771.19764.30

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31eNVIDIA Quadro RTX 4000RTX 40000.39150.7831.17451.5661.9575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.711.741.74MIN: 1.7 / MAX: 1.75MIN: 1.73 / MAX: 1.81MIN: 1.73 / MAX: 1.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface1eNVIDIA Quadro RTX 4000RTX 40000.1440.2880.4320.5760.72SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.630.630.64MIN: 0.62 / MAX: 0.68MIN: 0.62 / MAX: 0.65MIN: 0.62 / MAX: 0.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400060120180240300SE +/- 0.26, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.28, N = 3SE +/- 0.12, N = 3283.0282.6282.1281.1282.1278.7282.1278.81. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidth1eNVIDIA Quadro RTX 4000RTX 400080160240320400SE +/- 4.72, N = 3SE +/- 4.44, N = 3SE +/- 5.13, N = 3346.09340.91342.371. (CXX) g++ options: -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet1eNVIDIA Quadro RTX 4000RTX 40000.34650.6931.03951.3861.7325SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.521.541.54MIN: 1.5 / MAX: 1.56MIN: 1.53 / MAX: 1.63MIN: 1.53 / MAX: 1.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1eNVIDIA Quadro RTX 4000RTX 40001326395265SE +/- 0.21, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 358.3859.1258.69

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL1eNVIDIA Quadro RTX 4000RTX 4000306090120150SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3140.54138.91139.15

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX1eNVIDIA Quadro RTX 4000RTX 40004080120160200SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.48, N = 3160.82160.95159.09

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX1eNVIDIA Quadro RTX 4000RTX 400030060090012001500SE +/- 4.68, N = 3SE +/- 1.05, N = 3SE +/- 0.53, N = 31307.241321.891307.37

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDA1eNVIDIA Quadro RTX 4000RTX 4000306090120150SE +/- 0.28, N = 3SE +/- 0.16, N = 3SE +/- 0.21, N = 3112.74114.00113.88

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b01eNVIDIA Quadro RTX 4000RTX 40000.6211.2421.8632.4843.105SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.732.752.76MIN: 2.7 / MAX: 8.24MIN: 2.74 / MAX: 3.38MIN: 2.75 / MAX: 3.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDA1eNVIDIA Quadro RTX 4000RTX 40001326395265SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.10, N = 357.8058.1058.41

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL1eNVIDIA Quadro RTX 4000RTX 400030060090012001500SE +/- 5.16, N = 3SE +/- 5.69, N = 3SE +/- 5.32, N = 31490.381475.981480.20

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL1eNVIDIA Quadro RTX 4000RTX 4000400800120016002000SE +/- 9.92, N = 3SE +/- 2.10, N = 3SE +/- 8.76, N = 31843.841834.201829.24

NCNN

Target: Vulkan GPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet1eNVIDIA Quadro RTX 4000RTX 40000.8551.712.5653.424.275SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.773.783.80MIN: 3.71 / MAX: 3.87MIN: 3.72 / MAX: 3.84MIN: 3.74 / MAX: 10.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU Factorization11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 40001530456075SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 368.2968.4768.2568.3868.4168.0268.2968.021. (CXX) g++ options: -rdynamic -lOpenCL

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL1eNVIDIA Quadro RTX 4000RTX 400090180270360450SE +/- 0.45, N = 3SE +/- 1.26, N = 3SE +/- 0.32, N = 3423.50421.03420.80

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.21d1eNVIDIA Quadro RTX 4000RTX 40004080120160200SE +/- 0.40, N = 3SE +/- 0.32, N = 3SE +/- 0.27, N = 3SE +/- 0.37, N = 3191.63191.84190.76190.82

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPU1eNVIDIA Quadro RTX 4000RTX 400050M100M150M200M250MSE +/- 711502.39, N = 3SE +/- 308768.05, N = 3SE +/- 540319.59, N = 3248122412.9248177151.4246857018.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCL1eNVIDIA Quadro RTX 4000RTX 40000.50781.01561.52342.03122.539SE +/- 0.008, N = 3SE +/- 0.012, N = 3SE +/- 0.007, N = 32.2472.2552.2571. (CXX) g++ options: -rdynamic

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDA1eNVIDIA Quadro RTX 4000RTX 4000100200300400500SE +/- 1.17, N = 3SE +/- 1.47, N = 3SE +/- 0.47, N = 3459.35460.64459.08

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Read11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400080160240320400SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3379.7379.2379.2379.3379.3379.3379.3379.31. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Double1eNVIDIA Quadro RTX 4000RTX 400060120180240300SE +/- 0.31, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3259.66259.50259.331. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCL11a1b1d1eNVIDIA Quadro RTX 4000NVIDIA RTX 4000RTX 400048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 314.0314.0314.0414.0414.0314.0314.0314.031. (CXX) g++ options: -O3 -lOpenCL

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21eNVIDIA Quadro RTX 4000RTX 40000.3330.6660.9991.3321.665SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.481.481.48MIN: 1.44 / MAX: 20.23MIN: 1.46 / MAX: 1.5MIN: 1.47 / MAX: 1.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU1eNVIDIA Quadro RTX 4000RTX 4000714212835SE +/- 0.66, N = 15SE +/- 0.69, N = 15SE +/- 0.63, N = 1531.030.930.2

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX1eNVIDIA Quadro RTX 4000RTX 4000714212835SE +/- 3.24, N = 15SE +/- 0.09, N = 3SE +/- 0.06, N = 332.1429.2329.14

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny1eNVIDIA Quadro RTX 4000RTX 4000246810SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 38.628.338.22MIN: 8.1 / MAX: 74.77MIN: 8.13 / MAX: 55.28MIN: 8.15 / MAX: 8.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet181eNVIDIA Quadro RTX 4000RTX 40000.43880.87761.31641.75522.194SE +/- 0.05, N = 2SE +/- 0.13, N = 3SE +/- 0.04, N = 31.801.951.77MIN: 1.69 / MAX: 21.82MIN: 1.7 / MAX: 20.49MIN: 1.71 / MAX: 24.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet1eNVIDIA Quadro RTX 4000RTX 40001.09352.1873.28054.3745.4675SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 34.664.674.86MIN: 4.6 / MAX: 4.86MIN: 4.64 / MAX: 4.75MIN: 4.64 / MAX: 71.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LuxCoreRender OpenCL

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and Prism1d1eNVIDIA Quadro RTX 4000RTX 40003691215SE +/- 0.34, N = 12SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 310.4210.7810.7910.73MIN: 3.45 / MAX: 11.19MIN: 10.09 / MAX: 11.23MIN: 10.45 / MAX: 11.21MIN: 9.75 / MAX: 11.24

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore Benchmark1d1eNVIDIA Quadro RTX 4000RTX 40000.78751.5752.36253.153.9375SE +/- 0.07, N = 12SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 33.403.503.473.47MIN: 0.17 / MAX: 3.97MIN: 0.27 / MAX: 4MIN: 0.27 / MAX: 3.96MIN: 0.33 / MAX: 3.96

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Food1d1eNVIDIA Quadro RTX 4000RTX 40000.35330.70661.05991.41321.7665SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.521.571.551.55MIN: 0.14 / MAX: 1.88MIN: 0.26 / MAX: 1.89MIN: 0.25 / MAX: 1.85MIN: 0.26 / MAX: 1.86

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSC1d1eNVIDIA Quadro RTX 4000RTX 40000.92031.84062.76093.68124.6015SE +/- 0.08, N = 12SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.994.094.014.02MIN: 1.12 / MAX: 4.22MIN: 3.82 / MAX: 4.25MIN: 3.83 / MAX: 4.21MIN: 3.82 / MAX: 4.2


Phoronix Test Suite v10.8.4