RTX 3070 Compute

AMD Ryzen 9 5900X 12-Core testing with a ASUS ROG CROSSHAIR VIII HERO (3402 BIOS) and NVIDIA GeForce RTX 3070 8GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104078-IB-RTX3070CO18&grs.

RTX 3070 ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen 9 5900X 12-Core @ 3.70GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (3402 BIOS)AMD Starship/Matisse16GB1000GB Sabrent Rocket 4.0 Plus + 2000GBNVIDIA GeForce RTX 3070 8GBNVIDIA Device 228bASUS VP28URealtek RTL8125 2.5GbE + Intel I211Ubuntu 20.045.8.0-48-generic (x86_64)GNOME Shell 3.36.7X Server 1.20.9NVIDIA 460.674.6.0OpenCL 1.2 CUDA 11.2.1621.2.155GCC 9.3.0 + CUDA 11.2ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details- GPU Compute Cores: 5888Python Details- 1: Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

RTX 3070 Computencnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - mobilenetviennacl: CPU BLAS - dAXPYncnn: Vulkan GPU - blazefaceviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dDOTviennacl: OpenCL BLAS - dGEMM-TNmixbench: OpenCL - Double Precisionviennacl: CPU BLAS - dGEMM-TTncnn: Vulkan GPU - shufflenet-v2viennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - sAXPYncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - mnasnetviennacl: OpenCL BLAS - dGEMM-NNncnn: Vulkan GPU - resnet50clpeak: Double-Precision Doublevkfft: shoc: OpenCL - GEMM SGEMM_Nrodinia: OpenCL Particle Filterncnn: Vulkan GPU-v2-v2 - mobilenet-v2viennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMM-NTnamd-cuda: ATPase Simulation - 327,506 Atomsmixbench: OpenCL - Integerncnn: Vulkan GPU - googlenetviennacl: CPU BLAS - dGEMV-Tncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - regnety_400mlczero: OpenCLgromacs-gpu: Water Benchmarkviennacl: CPU BLAS - dGEMV-Nclpeak: Integer Compute INTviennacl: OpenCL BLAS - dGEMV-Tncnn: Vulkan GPU - resnet18waifu2x-ncnn: 2x - 3 - Yesclpeak: Single-Precision Floatmixbench: OpenCL - Single Precisionblender: Classroom - CUDAblender: Pabellon Barcelona - NVIDIA OptiXshoc: OpenCL - Texture Read Bandwidthfahbench: blender: Fishy Cat - CUDAblender: Fishy Cat - NVIDIA OptiXarrayfire: Conjugate Gradient OpenCLluxcorerender-cl: DLSCrealsr-ncnn: 4x - Yesv-ray: NVIDIA CUDA GPUblender: BMW27 - NVIDIA OptiXblender: Pabellon Barcelona - CUDAhashcat: TrueCrypt RIPEMD160 + XTSncnn: Vulkan GPU - vgg16viennacl: OpenCL BLAS - sDOThashcat: SHA-512viennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - dCOPYindigobench: OpenCL GPU - Bedroomvkresample: 2x - Doubleshoc: OpenCL - Max SP Flopsviennacl: OpenCL BLAS - dAXPYindigobench: OpenCL GPU - Supercarviennacl: OpenCL BLAS - dDOTvkresample: 2x - Singleblender: Barbershop - NVIDIA OptiXcl-mem: Copybetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestblender: Barbershop - CUDAblender: Classroom - NVIDIA OptiXhashcat: SHA1v-ray: NVIDIA RTX GPUhashcat: MD5luxcorerender-cl: LuxCore Benchmarkoctanebench: Total Scoreblender: BMW27 - CUDArealsr-ncnn: 4x - Nofinancebench: Black-Scholes OpenCLshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - FFT SPcl-mem: Readshoc: OpenCL - MD5 Hashmandelgpu: GPUcl-mem: Writeshoc: OpenCL - S3Dluxcorerender-cl: Rainbow Colors and Prismshoc: OpenCL - Reductionshoc: OpenCL - Bus Speed Readbackclpeak: Global Memory Bandwidthhashcat: 7-Zipshoc: OpenCL - Triadviennacl: OpenCL BLAS - sCOPYluxcorerender-cl: Foodredshift: ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU-v3-v3 - mobilenet-v312314.4912.8534.51.8423.262.054.757.044.3342295.1155.34.8514153.494.011.084.0534224.47360.90320043806.575.9584.392223430.1327311433.7413.2781.35.5816.75289148.13876.510264.2833413.954.29720099.7522081.7976.1076.772120.56267.090154.2035.702.0867.9750.058134216.11190.2950103355.71325166480000035836512.917220.45223117.839637.25739717.489465.14297.64.3096.039509.3348.17131201333331713387788333336.51410.51404329.078.43810.48326.28141134.99393.625.4712319970658.9380.2218.44019.18325.67126.3950389.5768673324.67212933.3922822.204.2215.4513.5633.41.9022.663.453.555.843.5336299.6554.54.9213952.792.811.224.1033824.75364.99323233769.316.0164.352203400.1338811336.9713.1781.95.6216.87291178.08377.010202.3933214.034.30319991.7321965.6676.4977.162131.28265.832454.4435.842.0947.9449.994133716.17190.9850283355.89324166956666735736412.882221.03723179.039537.16339617.532464.01296.94.3006.051508.3848.26131442666671710388390333336.50411.11167529.038.42710.49626.30971133.82393.225.4940319688588.0379.9218.60519.19325.80126.3909389.6168670024.67142933.3922823.734.244.32150.1818.437OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1248121620SE +/- 0.15, N = 3SE +/- 0.12, N = 314.4915.45MIN: 13.53 / MAX: 35.44MIN: 13.94 / MAX: 74.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123691215SE +/- 0.08, N = 3SE +/- 0.01, N = 312.8513.56MIN: 11.95 / MAX: 34.82MIN: 12.13 / MAX: 52.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY12816243240SE +/- 0.22, N = 3SE +/- 0.65, N = 334.533.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface120.42750.8551.28251.712.1375SE +/- 0.01, N = 3SE +/- 0.06, N = 31.841.90MIN: 1.75 / MAX: 3.08MIN: 1.73 / MAX: 11.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY12612182430SE +/- 0.10, N = 3SE +/- 0.74, N = 323.222.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY121428425670SE +/- 0.84, N = 3SE +/- 0.59, N = 362.063.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN121224364860SE +/- 0.10, N = 3SE +/- 1.00, N = 354.753.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN121326395265SE +/- 0.12, N = 3SE +/- 1.22, N = 357.055.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT121020304050SE +/- 0.42, N = 3SE +/- 0.62, N = 344.343.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN12701402102803503423361. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Mixbench

Backend: OpenCL - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Double Precision1270140210280350SE +/- 1.86, N = 3SE +/- 0.76, N = 3295.11299.651. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT121224364860SE +/- 0.15, N = 3SE +/- 0.97, N = 355.354.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2121.1072.2143.3214.4285.535SE +/- 0.06, N = 3SE +/- 0.02, N = 34.854.92MIN: 4.59 / MAX: 6.05MIN: 4.48 / MAX: 25.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT12306090120150SE +/- 1.15, N = 3SE +/- 3.53, N = 31411391. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT121224364860SE +/- 0.15, N = 3SE +/- 0.70, N = 353.452.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY1220406080100SE +/- 0.87, N = 3SE +/- 2.27, N = 394.092.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123691215SE +/- 0.03, N = 3SE +/- 0.07, N = 311.0811.22MIN: 10.26 / MAX: 26.8MIN: 10.16 / MAX: 47.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet120.92251.8452.76753.694.6125SE +/- 0.11, N = 3SE +/- 0.01, N = 34.054.10MIN: 3.65 / MAX: 20.14MIN: 3.66 / MAX: 26.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN1270140210280350SE +/- 1.00, N = 3SE +/- 0.67, N = 33423381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet5012612182430SE +/- 0.33, N = 3SE +/- 0.37, N = 324.4724.75MIN: 22.75 / MAX: 54.75MIN: 22.79 / MAX: 62.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Double1280160240320400SE +/- 0.04, N = 3SE +/- 0.03, N = 3360.90364.991. (CXX) g++ options: -O3 -rdynamic -lOpenCL

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1127K14K21K28K35KSE +/- 238.49, N = 3SE +/- 422.26, N = 332004323231. (CXX) g++ options: -O3 -pthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N128001600240032004000SE +/- 10.00, N = 3SE +/- 22.78, N = 33806.573769.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filter12246810SE +/- 0.014, N = 3SE +/- 0.023, N = 35.9586.0161. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2120.98781.97562.96343.95124.939SE +/- 0.04, N = 3SE +/- 0.05, N = 34.394.35MIN: 4.12 / MAX: 5.78MIN: 3.99 / MAX: 6.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N12501001502002502222201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT1270140210280350SE +/- 1.50, N = 23433401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atoms120.03010.06020.09030.12040.1505SE +/- 0.00056, N = 3SE +/- 0.00147, N = 30.132730.13388

Mixbench

Backend: OpenCL - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Integer122K4K6K8K10KSE +/- 3.19, N = 3SE +/- 53.09, N = 311433.7411336.971. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123691215SE +/- 0.08, N = 3SE +/- 0.27, N = 313.2713.17MIN: 12.01 / MAX: 35.62MIN: 11.88 / MAX: 33.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T1220406080100SE +/- 1.43, N = 3SE +/- 0.57, N = 381.381.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0121.26452.5293.79355.0586.3225SE +/- 0.10, N = 3SE +/- 0.12, N = 35.585.62MIN: 5.12 / MAX: 20.98MIN: 5.1 / MAX: 21.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1248121620SE +/- 0.25, N = 3SE +/- 0.32, N = 316.7516.87MIN: 15.59 / MAX: 33.2MIN: 15.42 / MAX: 40.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCL126K12K18K24K30KSE +/- 74.99, N = 3SE +/- 234.03, N = 328914291171. (CXX) g++ options: -flto -pthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark12246810SE +/- 0.024, N = 3SE +/- 0.016, N = 38.1388.0831. (CXX) g++ options: -O3 -lpthread -ldl -lrt -lm

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N1220406080100SE +/- 0.45, N = 3SE +/- 0.49, N = 376.577.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INT122K4K6K8K10KSE +/- 105.22, N = 5SE +/- 86.71, N = 310264.2810202.391. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T12701402102803503343321. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181248121620SE +/- 0.01, N = 3SE +/- 0.07, N = 313.9514.03MIN: 13.09 / MAX: 42.15MIN: 12.85 / MAX: 39.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes1230.97221.94442.91663.88884.861SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 34.2974.3034.321

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Float124K8K12K16K20KSE +/- 2.13, N = 3SE +/- 109.12, N = 320099.7519991.731. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Mixbench

Backend: OpenCL - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Single Precision125K10K15K20K25KSE +/- 8.94, N = 3SE +/- 109.50, N = 322081.7921965.661. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CUDA1220406080100SE +/- 0.01, N = 3SE +/- 0.02, N = 376.1076.49

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX1220406080100SE +/- 0.05, N = 3SE +/- 0.04, N = 376.7777.16

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth125001000150020002500SE +/- 7.35, N = 3SE +/- 5.26, N = 32120.562131.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.21260120180240300SE +/- 0.04, N = 3SE +/- 0.14, N = 3267.09265.83

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CUDA121224364860SE +/- 0.02, N = 3SE +/- 0.04, N = 354.2054.44

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: NVIDIA OptiX12816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 335.7035.84

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCL120.47120.94241.41361.88482.356SE +/- 0.000, N = 3SE +/- 0.000, N = 32.0862.0941. (CXX) g++ options: -rdynamic

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSC12246810SE +/- 0.00, N = 3SE +/- 0.00, N = 37.977.94MIN: 7.86 / MAX: 8.17MIN: 7.82 / MAX: 8.14

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yes1231122334455SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 350.0649.9950.18

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5Mode: NVIDIA CUDA GPU1230060090012001500SE +/- 1.20, N = 3SE +/- 0.67, N = 313421337

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: NVIDIA OptiX1248121620SE +/- 0.04, N = 3SE +/- 0.04, N = 316.1116.17

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CUDA124080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3190.29190.98

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTS12110K220K330K440K550KSE +/- 1550.63, N = 3SE +/- 1197.68, N = 3501033502833

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16121326395265SE +/- 0.18, N = 3SE +/- 0.11, N = 355.7155.89MIN: 51.94 / MAX: 108.04MIN: 52.71 / MAX: 91.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT12701402102803503253241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-51212400M800M1200M1600M2000MSE +/- 723417.81, N = 3SE +/- 866666.67, N = 316648000001669566667

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY12801602403204003583571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY1280160240320400SE +/- 0.33, N = 33653641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Bedroom123691215SE +/- 0.02, N = 3SE +/- 0.02, N = 312.9212.88

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1250100150200250SE +/- 0.07, N = 3SE +/- 0.07, N = 3220.45221.041. (CXX) g++ options: -O3 -pthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops125K10K15K20K25KSE +/- 31.09, N = 3SE +/- 49.54, N = 323117.823179.01. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY12901802703604503963951. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Supercar12918273645SE +/- 0.03, N = 3SE +/- 0.02, N = 337.2637.16

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT12901802703604503973961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single1248121620SE +/- 0.01, N = 3SE +/- 0.01, N = 317.4917.531. (CXX) g++ options: -O3 -pthread

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: NVIDIA OptiX12100200300400500SE +/- 1.92, N = 3SE +/- 1.26, N = 3465.14464.01

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy1260120180240300SE +/- 0.17, N = 3SE +/- 0.20, N = 3297.6296.91. (CC) gcc options: -O2 -flto -lOpenCL

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest120.96951.9392.90853.8784.8475SE +/- 0.010, N = 3SE +/- 0.019, N = 34.3094.3001. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest12246810SE +/- 0.025, N = 3SE +/- 0.015, N = 36.0396.0511. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CUDA12110220330440550SE +/- 0.43, N = 3SE +/- 0.25, N = 3509.33508.38

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: NVIDIA OptiX121122334455SE +/- 0.02, N = 3SE +/- 0.03, N = 348.1748.26

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1123000M6000M9000M12000M15000MSE +/- 21817526.08, N = 3SE +/- 5394235.61, N = 31312013333313144266667

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5Mode: NVIDIA RTX GPU12400800120016002000SE +/- 2.00, N = 317131710

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5128000M16000M24000M32000M40000MSE +/- 49139065.70, N = 3SE +/- 42362968.63, N = 33877883333338839033333

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore Benchmark12246810SE +/- 0.00, N = 3SE +/- 0.01, N = 36.516.50MIN: 0.27 / MAX: 7.46MIN: 0.32 / MAX: 7.45

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total Score1290180270360450410.51411.11

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CUDA12714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 329.0729.03

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: No123246810SE +/- 0.085, N = 3SE +/- 0.093, N = 3SE +/- 0.096, N = 38.4388.4278.437

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCL123691215SE +/- 0.01, N = 3SE +/- 0.04, N = 310.4810.501. (CXX) g++ options: -O3 -march=native -fopenmp

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download12612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 326.2826.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP122004006008001000SE +/- 0.71, N = 3SE +/- 0.99, N = 31134.991133.821. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Read1290180270360450SE +/- 0.07, N = 3SE +/- 0.36, N = 3393.6393.21. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash12612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 325.4725.491. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPU1270M140M210M280M350MSE +/- 682590.08, N = 3SE +/- 1129798.58, N = 3319970658.9319688588.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Write1280160240320400SE +/- 0.06, N = 3SE +/- 0.13, N = 3380.2379.91. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D1250100150200250SE +/- 0.27, N = 3SE +/- 0.28, N = 3218.44218.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

LuxCoreRender OpenCL

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and Prism12510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 319.1819.19MIN: 17.88 / MAX: 20.08MIN: 17.89 / MAX: 20.09

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction1270140210280350SE +/- 0.56, N = 3SE +/- 0.46, N = 3325.67325.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback12612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 326.4026.391. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidth1280160240320400SE +/- 0.02, N = 3SE +/- 0.03, N = 3389.57389.611. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zip12150K300K450K600K750KSE +/- 1386.04, N = 3SE +/- 1069.27, N = 3686733686700

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad12612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 324.6724.671. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY1260120180240300SE +/- 0.58, N = 32932931. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Food120.76281.52562.28843.05123.814SE +/- 0.03, N = 3SE +/- 0.02, N = 33.393.39MIN: 0.23 / MAX: 4.24MIN: 0.26 / MAX: 4.22

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.01250100150200250SE +/- 0.88, N = 3SE +/- 1.00, N = 3228228

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny12612182430SE +/- 0.01, N = 3SE +/- 1.20, N = 322.2023.73MIN: 21.01 / MAX: 44.66MIN: 21.14 / MAX: 142.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3120.9541.9082.8623.8164.77SE +/- 0.06, N = 3SE +/- 0.15, N = 34.224.24MIN: 3.9 / MAX: 30.64MIN: 3.8 / MAX: 24.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.4