RTX 3070 Compute

AMD Ryzen 9 5900X 12-Core testing with a ASUS ROG CROSSHAIR VIII HERO (3402 BIOS) and NVIDIA GeForce RTX 3070 8GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104078-IB-RTX3070CO18&grw&sor&rro.

RTX 3070 ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen 9 5900X 12-Core @ 3.70GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (3402 BIOS)AMD Starship/Matisse16GB1000GB Sabrent Rocket 4.0 Plus + 2000GBNVIDIA GeForce RTX 3070 8GBNVIDIA Device 228bASUS VP28URealtek RTL8125 2.5GbE + Intel I211Ubuntu 20.045.8.0-48-generic (x86_64)GNOME Shell 3.36.7X Server 1.20.9NVIDIA 460.674.6.0OpenCL 1.2 CUDA 11.2.1621.2.155GCC 9.3.0 + CUDA 11.2ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details- GPU Compute Cores: 5888Python Details- 1: Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

RTX 3070 Computebetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestlczero: OpenCLshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetgromacs-gpu: Water Benchmarkncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50luxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: DLSCluxcorerender-cl: Foodncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mrodinia: OpenCL Particle Filterarrayfire: Conjugate Gradient OpenCLv-ray: NVIDIA RTX GPUluxcorerender-cl: Rainbow Colors and Prismv-ray: NVIDIA CUDA GPUblender: BMW27 - CUDAblender: Classroom - CUDAblender: Fishy Cat - CUDAblender: Barbershop - CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: Pabellon Barcelona - CUDAblender: Pabellon Barcelona - NVIDIA OptiXindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarfahbench: hashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSmixbench: OpenCL - Integermixbench: OpenCL - Double Precisionmixbench: OpenCL - Single Precisionnamd-cuda: ATPase Simulation - 327,506 Atomsoctanebench: Total Scoreredshift: financebench: Black-Scholes OpenCLcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthmandelgpu: GPUviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNrealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yesvkfft: vkresample: 2x - Doublevkresample: 2x - Singlewaifu2x-ncnn: 2x - 3 - Yes1234.3096.03928914218.44024.67211134.9925.4712325.6713806.5723117.826.281426.39502120.5612.854.394.224.854.055.581.8413.278.13855.7113.9511.0824.476.517.973.3922.2014.4916.755.9582.086171319.18134229.0776.1054.20509.3316.1148.1735.70465.14190.2976.7712.91737.257267.09013877883333313120133333686733166480000050103311433.74295.1122081.790.13273410.51404322810.483297.6393.6380.210264.2820099.75360.90389.57319970658.962.094.014123.234.544.376.581.354.753.457.055.32933583253653963972223343423433428.43850.05832004220.45217.4894.2974.3006.05129117218.60524.67141133.8225.4940325.8013769.3123179.026.309726.39092131.2813.564.354.244.924.105.621.9013.178.08355.8914.0311.2224.756.507.943.3923.7315.4516.876.0162.094171019.19133729.0376.4954.44508.3816.1748.2635.84464.01190.9877.1612.88237.163265.83243883903333313144266667686700166956666750283311336.97299.6521965.660.13388411.11167522810.496296.9393.2379.910202.3919991.73364.99389.61319688588.063.492.813922.633.443.577.081.953.552.755.854.52933573243643953962203323383403368.42749.99432323221.03717.5324.3038.43750.1814.321OpenBenchmarking.org

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest120.96951.9392.90853.8784.8475SE +/- 0.010, N = 3SE +/- 0.019, N = 34.3094.3001. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest21246810SE +/- 0.015, N = 3SE +/- 0.025, N = 36.0516.0391. (CXX) g++ options: -O3 -O2 -lpthread -ldl

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCL126K12K18K24K30KSE +/- 74.99, N = 3SE +/- 234.03, N = 328914291171. (CXX) g++ options: -flto -pthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D1250100150200250SE +/- 0.27, N = 3SE +/- 0.28, N = 3218.44218.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad21612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 324.6724.671. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP212004006008001000SE +/- 0.99, N = 3SE +/- 0.71, N = 31133.821134.991. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash12612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 325.4725.491. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction1270140210280350SE +/- 0.56, N = 3SE +/- 0.46, N = 3325.67325.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N218001600240032004000SE +/- 22.78, N = 3SE +/- 10.00, N = 33769.313806.571. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops125K10K15K20K25KSE +/- 31.09, N = 3SE +/- 49.54, N = 323117.823179.01. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download12612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 326.2826.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback21612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 326.3926.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth125001000150020002500SE +/- 7.35, N = 3SE +/- 5.26, N = 32120.562131.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet213691215SE +/- 0.01, N = 3SE +/- 0.08, N = 313.5612.85MIN: 12.13 / MAX: 52.71MIN: 11.95 / MAX: 34.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2120.98781.97562.96343.95124.939SE +/- 0.04, N = 3SE +/- 0.05, N = 34.394.35MIN: 4.12 / MAX: 5.78MIN: 3.99 / MAX: 6.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3210.9541.9082.8623.8164.77SE +/- 0.15, N = 3SE +/- 0.06, N = 34.244.22MIN: 3.8 / MAX: 24.56MIN: 3.9 / MAX: 30.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2211.1072.2143.3214.4285.535SE +/- 0.02, N = 3SE +/- 0.06, N = 34.924.85MIN: 4.48 / MAX: 25.29MIN: 4.59 / MAX: 6.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet210.92251.8452.76753.694.6125SE +/- 0.01, N = 3SE +/- 0.11, N = 34.104.05MIN: 3.66 / MAX: 26.14MIN: 3.65 / MAX: 20.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0211.26452.5293.79355.0586.3225SE +/- 0.12, N = 3SE +/- 0.10, N = 35.625.58MIN: 5.1 / MAX: 21.88MIN: 5.12 / MAX: 20.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface210.42750.8551.28251.712.1375SE +/- 0.06, N = 3SE +/- 0.01, N = 31.901.84MIN: 1.73 / MAX: 11.77MIN: 1.75 / MAX: 3.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123691215SE +/- 0.08, N = 3SE +/- 0.27, N = 313.2713.17MIN: 12.01 / MAX: 35.62MIN: 11.88 / MAX: 33.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark21246810SE +/- 0.016, N = 3SE +/- 0.024, N = 38.0838.1381. (CXX) g++ options: -O3 -lpthread -ldl -lrt -lm

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16211326395265SE +/- 0.11, N = 3SE +/- 0.18, N = 355.8955.71MIN: 52.71 / MAX: 91.56MIN: 51.94 / MAX: 108.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet182148121620SE +/- 0.07, N = 3SE +/- 0.01, N = 314.0313.95MIN: 12.85 / MAX: 39.19MIN: 13.09 / MAX: 42.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet213691215SE +/- 0.07, N = 3SE +/- 0.03, N = 311.2211.08MIN: 10.16 / MAX: 47.07MIN: 10.26 / MAX: 26.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet5021612182430SE +/- 0.37, N = 3SE +/- 0.33, N = 324.7524.47MIN: 22.79 / MAX: 62.12MIN: 22.75 / MAX: 54.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LuxCoreRender OpenCL

Scene: LuxCore Benchmark

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore Benchmark21246810SE +/- 0.01, N = 3SE +/- 0.00, N = 36.506.51MIN: 0.32 / MAX: 7.45MIN: 0.27 / MAX: 7.46

LuxCoreRender OpenCL

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSC21246810SE +/- 0.00, N = 3SE +/- 0.00, N = 37.947.97MIN: 7.82 / MAX: 8.14MIN: 7.86 / MAX: 8.17

LuxCoreRender OpenCL

Scene: Food

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Food120.76281.52562.28843.05123.814SE +/- 0.03, N = 3SE +/- 0.02, N = 33.393.39MIN: 0.23 / MAX: 4.24MIN: 0.26 / MAX: 4.22

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny21612182430SE +/- 1.20, N = 3SE +/- 0.01, N = 323.7322.20MIN: 21.14 / MAX: 142.5MIN: 21.01 / MAX: 44.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd2148121620SE +/- 0.12, N = 3SE +/- 0.15, N = 315.4514.49MIN: 13.94 / MAX: 74.44MIN: 13.53 / MAX: 35.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m2148121620SE +/- 0.32, N = 3SE +/- 0.25, N = 316.8716.75MIN: 15.42 / MAX: 40.88MIN: 15.59 / MAX: 33.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filter21246810SE +/- 0.023, N = 3SE +/- 0.014, N = 36.0165.9581. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCL210.47120.94241.41361.88482.356SE +/- 0.000, N = 3SE +/- 0.000, N = 32.0942.0861. (CXX) g++ options: -rdynamic

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5Mode: NVIDIA RTX GPU21400800120016002000SE +/- 2.00, N = 317101713

LuxCoreRender OpenCL

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: Rainbow Colors and Prism12510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 319.1819.19MIN: 17.88 / MAX: 20.08MIN: 17.89 / MAX: 20.09

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5Mode: NVIDIA CUDA GPU2130060090012001500SE +/- 0.67, N = 3SE +/- 1.20, N = 313371342

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CUDA12714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 329.0729.03

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CUDA2120406080100SE +/- 0.02, N = 3SE +/- 0.01, N = 376.4976.10

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CUDA211224364860SE +/- 0.04, N = 3SE +/- 0.02, N = 354.4454.20

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CUDA12110220330440550SE +/- 0.43, N = 3SE +/- 0.25, N = 3509.33508.38

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: NVIDIA OptiX2148121620SE +/- 0.04, N = 3SE +/- 0.04, N = 316.1716.11

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: NVIDIA OptiX211122334455SE +/- 0.03, N = 3SE +/- 0.02, N = 348.2648.17

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: NVIDIA OptiX21816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 335.8435.70

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: NVIDIA OptiX12100200300400500SE +/- 1.92, N = 3SE +/- 1.26, N = 3465.14464.01

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CUDA214080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3190.98190.29

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX2120406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 377.1676.77

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Bedroom213691215SE +/- 0.02, N = 3SE +/- 0.02, N = 312.8812.92

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Supercar21918273645SE +/- 0.02, N = 3SE +/- 0.03, N = 337.1637.26

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.22160120180240300SE +/- 0.14, N = 3SE +/- 0.04, N = 3265.83267.09

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5128000M16000M24000M32000M40000MSE +/- 49139065.70, N = 3SE +/- 42362968.63, N = 33877883333338839033333

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1123000M6000M9000M12000M15000MSE +/- 21817526.08, N = 3SE +/- 5394235.61, N = 31312013333313144266667

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zip21150K300K450K600K750KSE +/- 1069.27, N = 3SE +/- 1386.04, N = 3686700686733

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-51212400M800M1200M1600M2000MSE +/- 723417.81, N = 3SE +/- 866666.67, N = 316648000001669566667

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTS12110K220K330K440K550KSE +/- 1550.63, N = 3SE +/- 1197.68, N = 3501033502833

Mixbench

Backend: OpenCL - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Integer212K4K6K8K10KSE +/- 53.09, N = 3SE +/- 3.19, N = 311336.9711433.741. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Double Precision1270140210280350SE +/- 1.86, N = 3SE +/- 0.76, N = 3295.11299.651. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Single Precision215K10K15K20K25KSE +/- 109.50, N = 3SE +/- 8.94, N = 321965.6622081.791. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atoms210.03010.06020.09030.12040.1505SE +/- 0.00147, N = 3SE +/- 0.00056, N = 30.133880.13273

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total Score1290180270360450410.51411.11

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.02150100150200250SE +/- 1.00, N = 3SE +/- 0.88, N = 3228228

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCL213691215SE +/- 0.04, N = 3SE +/- 0.01, N = 310.5010.481. (CXX) g++ options: -O3 -march=native -fopenmp

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copy2160120180240300SE +/- 0.20, N = 3SE +/- 0.17, N = 3296.9297.61. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Read2190180270360450SE +/- 0.36, N = 3SE +/- 0.07, N = 3393.2393.61. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Write2180160240320400SE +/- 0.13, N = 3SE +/- 0.06, N = 3379.9380.21. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INT212K4K6K8K10KSE +/- 86.71, N = 3SE +/- 105.22, N = 510202.3910264.281. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Float214K8K12K16K20KSE +/- 109.12, N = 3SE +/- 2.13, N = 319991.7320099.751. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Double1280160240320400SE +/- 0.04, N = 3SE +/- 0.03, N = 3360.90364.991. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidth1280160240320400SE +/- 0.02, N = 3SE +/- 0.03, N = 3389.57389.611. (CXX) g++ options: -O3 -rdynamic -lOpenCL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPU2170M140M210M280M350MSE +/- 1129798.58, N = 3SE +/- 682590.08, N = 3319688588.0319970658.91. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY121428425670SE +/- 0.84, N = 3SE +/- 0.59, N = 362.063.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY2120406080100SE +/- 2.27, N = 3SE +/- 0.87, N = 392.894.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT21306090120150SE +/- 3.53, N = 3SE +/- 1.15, N = 31391411. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY21612182430SE +/- 0.74, N = 3SE +/- 0.10, N = 322.623.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY21816243240SE +/- 0.65, N = 3SE +/- 0.22, N = 333.434.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT211020304050SE +/- 0.62, N = 3SE +/- 0.42, N = 343.544.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N1220406080100SE +/- 0.45, N = 3SE +/- 0.49, N = 376.577.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T1220406080100SE +/- 1.43, N = 3SE +/- 0.57, N = 381.381.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN211224364860SE +/- 1.00, N = 3SE +/- 0.10, N = 353.554.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT211224364860SE +/- 0.70, N = 3SE +/- 0.15, N = 352.753.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN211326395265SE +/- 1.22, N = 3SE +/- 0.12, N = 355.857.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT211224364860SE +/- 0.97, N = 3SE +/- 0.15, N = 354.555.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY1260120180240300SE +/- 0.58, N = 32932931. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY21801602403204003573581. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT21701402102803503243251. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY2180160240320400SE +/- 0.33, N = 33643651. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY21901802703604503953961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT21901802703604503963971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N21501001502002502202221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T21701402102803503323341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN2170140210280350SE +/- 0.67, N = 3SE +/- 1.00, N = 33383421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT2170140210280350SE +/- 1.50, N = 23403431. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN21701402102803503363421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: No132246810SE +/- 0.085, N = 3SE +/- 0.096, N = 3SE +/- 0.093, N = 38.4388.4378.427

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yes3121122334455SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 350.1850.0649.99

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1127K14K21K28K35KSE +/- 238.49, N = 3SE +/- 422.26, N = 332004323231. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double2150100150200250SE +/- 0.07, N = 3SE +/- 0.07, N = 3221.04220.451. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single2148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 317.5317.491. (CXX) g++ options: -O3 -pthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes3210.97221.94442.91663.88884.861SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 34.3214.3034.297


Phoronix Test Suite v10.8.5