opencl-perf AMD Ryzen 7 2700 Eight-Core testing with a MSI X470 GAMING PRO (MS-7B79) v1.0 (1.90 BIOS) and AMD Vega 20 16GB on Ubuntu 18.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1908073-HV-OPENCLPER74 .
opencl-perf Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core AMD Ryzen 7 2700 Eight-Core @ 3.20GHz (8 Cores / 16 Threads) MSI X470 GAMING PRO (MS-7B79) v1.0 (1.90 BIOS) AMD 17h 64512MB 2000GB Samsung SSD 970 EVO Plus 2TB + 120GB OCZ VERTEX3 AMD Vega 20 16GB (1802/1001MHz) AMD Device ab20 HP E243i Realtek RTL8111/8168/8411 Ubuntu 18.04 5.0.0-23-generic (x86_64) GNOME Shell 3.28.4 X Server 1.20.4 modesetting 1.20.4 GCC 7.4.0 ext4 1920x1200 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling
opencl-perf shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write rodinia: OpenCL Myocyte rodinia: OpenCL Heartwall smallpt-gpu: GPU - 1920 x 1200 - Caustic smallpt-gpu: GPU - 1920 x 1200 - Cornell smallpt-gpu: GPU - 1920 x 1200 - Caustic3 clpeak: Kernel Latency clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 6.71 2463.22 20.73 693037 7.16 7.16 444.51 312.73 737.90 686.03 129.54 2.98 1565174972 1565175108 1565175246 15.19 4487.43 13681.99 3441.42 801.57 13.20 30.88 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 2 4 6 8 10 SE +/- 0.01, N = 3 6.71 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 500 1000 1500 2000 2500 SE +/- 4.31, N = 3 2463.22 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 5 10 15 20 25 SE +/- 0.00, N = 3 20.73 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 150K 300K 450K 600K 750K SE +/- 848.57, N = 3 693037 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 2 4 6 8 10 SE +/- 0.00, N = 3 7.16 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 2 4 6 8 10 SE +/- 0.00, N = 3 7.16 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 100 200 300 400 500 SE +/- 0.75, N = 3 444.51 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 70 140 210 280 350 SE +/- 1.04, N = 3 312.73 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 160 320 480 640 800 SE +/- 3.46, N = 3 737.90 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 150 300 450 600 750 SE +/- 0.15, N = 3 686.03 1. (CC) gcc options: -O2 -flto -lOpenCL
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Myocyte AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 30 60 90 120 150 SE +/- 0.10, N = 3 129.54 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenCL Heartwall AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.01, N = 3 2.98 1. (CXX) g++ options: -O2 -lOpenCL
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 300M 600M 900M 1200M 1500M SE +/- 25.40, N = 3 1565174972 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 300M 600M 900M 1200M 1500M SE +/- 25.12, N = 3 1565175108 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3 AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 300M 600M 900M 1200M 1500M SE +/- 25.40, N = 3 1565175246 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 4 8 12 16 20 SE +/- 0.16, N = 3 15.19 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 1000 2000 3000 4000 5000 SE +/- 0.49, N = 3 4487.43 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 3K 6K 9K 12K 15K SE +/- 1.49, N = 3 13681.99 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 700 1400 2100 2800 3500 SE +/- 0.17, N = 3 3441.42 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 200 400 600 800 1000 SE +/- 0.05, N = 3 801.57 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 3 6 9 12 15 SE +/- 0.18, N = 3 13.20 1. (CXX) g++ options: -O3 -rdynamic
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core 7 14 21 28 35 SE +/- 0.02, N = 3 30.88 1. (CXX) g++ options: -O3 -rdynamic
Phoronix Test Suite v10.8.4