20210110 ubuntu studio 20.10 performance AMD Ryzen 7 3700X 8-Core testing with a ASRock X570 Phantom Gaming-ITX/TB3 (P2.80 BIOS) and Zotac NVIDIA GeForce RTX 2060 SUPER 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101114-HA-20210110U51&gru .
20210110 ubuntu studio 20.10 performance Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Display Server Display Driver OpenCL Vulkan Compiler File-System Screen Resolution scaling_governer performance AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASRock X570 Phantom Gaming-ITX/TB3 (P2.80 BIOS) AMD Starship/Matisse 32GB 500GB Western Digital WDS500G3X0C-00SJG0 + 4 x 2000GB Seagate ST2000LM015-2E81 Zotac NVIDIA GeForce RTX 2060 SUPER 8GB NVIDIA TU106 HD Audio Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.8.0-36-lowlatency (x86_64) X Server 1.20.9 modesetting 1.20.9 OpenCL 1.2 CUDA 11.2.109 1.2.155 GCC 10.2.0 + CUDA 11.0 btrfs 640x480 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8701021 - Python 3.8.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
20210110 ubuntu studio 20.10 performance cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Global Memory Bandwidth mixbench: OpenCL - Double Precision mixbench: OpenCL - Single Precision mixbench: NVIDIA CUDA - Half Precision mixbench: NVIDIA CUDA - Double Precision mixbench: NVIDIA CUDA - Single Precision viennacl: OpenCL LU Factorization clpeak: Single-Precision Float clpeak: Double-Precision Double mixbench: OpenCL - Integer mixbench: NVIDIA CUDA - Integer clpeak: Integer Compute INT hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS lczero: OpenCL fahbench: financebench: Black-Scholes OpenCL arrayfire: Conjugate Gradient OpenCL ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m blender: BMW27 - CUDA blender: Classroom - CUDA blender: Fishy Cat - CUDA blender: Barbershop - CUDA blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - CUDA blender: Pabellon Barcelona - NVIDIA OptiX scaling_governer performance 284.8 398.1 357.5 368.02 220.57 8175.98 16380.84 220.56 8224.48 76.0240 6738.04 259.49 8111.60 7329.5 6702.98 28558000000 9097633333 478467 1243000000 336267 16705 202.1038 10.513 2.074 18.55 6.03 5.09 6.65 5.05 8.00 2.27 16.42 58.82 16.95 13.46 31.12 27.20 22.57 18.65 66.95 303.94 125.22 1069.59 35.93 146.29 60.29 1789.88 560.63 197.31 OpenBenchmarking.org
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy scaling_governer performance 60 120 180 240 300 SE +/- 0.03, N = 3 284.8 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read scaling_governer performance 90 180 270 360 450 SE +/- 0.03, N = 3 398.1 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write scaling_governer performance 80 160 240 320 400 SE +/- 0.30, N = 3 357.5 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth scaling_governer performance 80 160 240 320 400 SE +/- 0.01, N = 3 368.02 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Mixbench Backend: OpenCL - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision scaling_governer performance 50 100 150 200 250 SE +/- 0.00, N = 3 220.57 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: OpenCL - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision scaling_governer performance 2K 4K 6K 8K 10K SE +/- 3.02, N = 3 8175.98 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Half Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision scaling_governer performance 4K 8K 12K 16K 20K SE +/- 2.02, N = 3 16380.84 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Double Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision scaling_governer performance 50 100 150 200 250 SE +/- 0.00, N = 3 220.56 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Single Precision OpenBenchmarking.org GFLOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision scaling_governer performance 2K 4K 6K 8K 10K SE +/- 0.90, N = 3 8224.48 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
ViennaCL OpenCL LU Factorization OpenBenchmarking.org GFLOPS, More Is Better ViennaCL 1.4.2 OpenCL LU Factorization scaling_governer performance 20 40 60 80 100 SE +/- 0.26, N = 3 76.02 1. (CXX) g++ options: -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float scaling_governer performance 1400 2800 4200 5600 7000 SE +/- 4.21, N = 3 6738.04 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double scaling_governer performance 60 120 180 240 300 SE +/- 0.64, N = 3 259.49 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Mixbench Backend: OpenCL - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer scaling_governer performance 2K 4K 6K 8K 10K SE +/- 19.52, N = 3 8111.60 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
Mixbench Backend: NVIDIA CUDA - Benchmark: Integer OpenBenchmarking.org GIOPS, More Is Better Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer scaling_governer performance 1600 3200 4800 6400 8000 SE +/- 0.18, N = 3 7329.5 1. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT scaling_governer performance 1400 2800 4200 5600 7000 SE +/- 56.50, N = 13 6702.98 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: MD5 scaling_governer performance 6000M 12000M 18000M 24000M 30000M SE +/- 39471677.61, N = 3 28558000000
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA1 scaling_governer performance 2000M 4000M 6000M 8000M 10000M SE +/- 19168231.82, N = 3 9097633333
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: 7-Zip scaling_governer performance 100K 200K 300K 400K 500K SE +/- 352.77, N = 3 478467
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: SHA-512 scaling_governer performance 300M 600M 900M 1200M 1500M SE +/- 2300724.52, N = 3 1243000000
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS scaling_governer performance 70K 140K 210K 280K 350K SE +/- 1156.62, N = 3 336267
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.26 Backend: OpenCL scaling_governer performance 4K 8K 12K 16K 20K SE +/- 90.96, N = 3 16705 1. (CXX) g++ options: -flto -pthread
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 scaling_governer performance 40 80 120 160 200 SE +/- 0.30, N = 3 202.10
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-06-06 Benchmark: Black-Scholes OpenCL scaling_governer performance 3 6 9 12 15 SE +/- 0.00, N = 3 10.51 1. (CXX) g++ options: -O3 -lOpenCL
ArrayFire Test: Conjugate Gradient OpenCL OpenBenchmarking.org ms, Fewer Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL scaling_governer performance 0.4667 0.9334 1.4001 1.8668 2.3335 SE +/- 0.001, N = 3 2.074 1. (CXX) g++ options: -rdynamic
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mobilenet scaling_governer performance 5 10 15 20 25 SE +/- 0.06, N = 3 18.55 MIN: 18.33 / MAX: 18.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 scaling_governer performance 2 4 6 8 10 SE +/- 0.04, N = 3 6.03 MIN: 5.9 / MAX: 7.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 scaling_governer performance 1.1453 2.2906 3.4359 4.5812 5.7265 SE +/- 0.04, N = 3 5.09 MIN: 4.99 / MAX: 5.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: shufflenet-v2 scaling_governer performance 2 4 6 8 10 SE +/- 0.01, N = 3 6.65 MIN: 6.55 / MAX: 6.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: mnasnet scaling_governer performance 1.1363 2.2726 3.4089 4.5452 5.6815 SE +/- 0.04, N = 3 5.05 MIN: 4.95 / MAX: 5.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: efficientnet-b0 scaling_governer performance 2 4 6 8 10 SE +/- 0.06, N = 3 8.00 MIN: 7.83 / MAX: 8.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: blazeface scaling_governer performance 0.5108 1.0216 1.5324 2.0432 2.554 SE +/- 0.01, N = 3 2.27 MIN: 2.22 / MAX: 2.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: googlenet scaling_governer performance 4 8 12 16 20 SE +/- 0.35, N = 3 16.42 MIN: 15.88 / MAX: 99.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: vgg16 scaling_governer performance 13 26 39 52 65 SE +/- 0.14, N = 3 58.82 MIN: 58.32 / MAX: 63.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet18 scaling_governer performance 4 8 12 16 20 SE +/- 0.17, N = 3 16.95 MIN: 16.63 / MAX: 19.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: alexnet scaling_governer performance 3 6 9 12 15 SE +/- 0.03, N = 3 13.46 MIN: 13.25 / MAX: 13.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: resnet50 scaling_governer performance 7 14 21 28 35 SE +/- 0.33, N = 3 31.12 MIN: 30.51 / MAX: 34.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: yolov4-tiny scaling_governer performance 6 12 18 24 30 SE +/- 0.40, N = 3 27.20 MIN: 26.6 / MAX: 28.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: squeezenet_ssd scaling_governer performance 5 10 15 20 25 SE +/- 0.05, N = 3 22.57 MIN: 22.3 / MAX: 22.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20201218 Target: Vulkan GPU - Model: regnety_400m scaling_governer performance 5 10 15 20 25 SE +/- 0.08, N = 3 18.65 MIN: 18.37 / MAX: 19.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender Blend File: BMW27 - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: CUDA scaling_governer performance 15 30 45 60 75 SE +/- 0.17, N = 3 66.95
Blender Blend File: Classroom - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: CUDA scaling_governer performance 70 140 210 280 350 SE +/- 0.16, N = 3 303.94
Blender Blend File: Fishy Cat - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: CUDA scaling_governer performance 30 60 90 120 150 SE +/- 0.57, N = 3 125.22
Blender Blend File: Barbershop - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: CUDA scaling_governer performance 200 400 600 800 1000 SE +/- 0.76, N = 3 1069.59
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: BMW27 - Compute: NVIDIA OptiX scaling_governer performance 8 16 24 32 40 SE +/- 3.91, N = 15 35.93
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Classroom - Compute: NVIDIA OptiX scaling_governer performance 30 60 90 120 150 SE +/- 0.40, N = 3 146.29
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Fishy Cat - Compute: NVIDIA OptiX scaling_governer performance 13 26 39 52 65 SE +/- 0.03, N = 3 60.29
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Barbershop - Compute: NVIDIA OptiX scaling_governer performance 400 800 1200 1600 2000 SE +/- 6.24, N = 3 1789.88
Blender Blend File: Pabellon Barcelona - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: CUDA scaling_governer performance 120 240 360 480 600 SE +/- 0.63, N = 3 560.63
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.90 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX scaling_governer performance 40 80 120 160 200 SE +/- 0.04, N = 3 197.31
Phoronix Test Suite v10.8.5