opencl-perf

AMD Ryzen 7 2700 Eight-Core testing with a MSI X470 GAMING PRO (MS-7B79) v1.0 (1.90 BIOS) and AMD Vega 20 16GB on Ubuntu 18.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1908073-HV-OPENCLPER74.

opencl-perfProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionAMD Vega 20 - AMD Ryzen 7 2700 Eight-CoreAMD Ryzen 7 2700 Eight-Core @ 3.20GHz (8 Cores / 16 Threads)MSI X470 GAMING PRO (MS-7B79) v1.0 (1.90 BIOS)AMD 17h64512MB2000GB Samsung SSD 970 EVO Plus 2TB + 120GB OCZ VERTEX3AMD Vega 20 16GB (1802/1001MHz)AMD Device ab20HP E243iRealtek RTL8111/8168/8411Ubuntu 18.045.0.0-23-generic (x86_64)GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.4GCC 7.4.0ext41920x1200OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

opencl-perfshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writerodinia: OpenCL Myocyterodinia: OpenCL Heartwallsmallpt-gpu: GPU - 1920 x 1200 - Causticsmallpt-gpu: GPU - 1920 x 1200 - Cornellsmallpt-gpu: GPU - 1920 x 1200 - Caustic3clpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core6.712463.2220.736930377.167.16444.51312.73737.90686.03129.542.9815651749721565175108156517524615.194487.4313681.993441.42801.5713.2030.88OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core246810SE +/- 0.01, N = 36.711. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core5001000150020002500SE +/- 4.31, N = 32463.221. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core510152025SE +/- 0.00, N = 320.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core150K300K450K600K750KSE +/- 848.57, N = 36930371. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core246810SE +/- 0.00, N = 37.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core246810SE +/- 0.00, N = 37.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core100200300400500SE +/- 0.75, N = 3444.511. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core70140210280350SE +/- 1.04, N = 3312.731. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core160320480640800SE +/- 3.46, N = 3737.901. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core150300450600750SE +/- 0.15, N = 3686.031. (CC) gcc options: -O2 -flto -lOpenCL

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core306090120150SE +/- 0.10, N = 3129.541. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Heartwall

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core0.67051.3412.01152.6823.3525SE +/- 0.01, N = 32.981. (CXX) g++ options: -O2 -lOpenCL

SmallPT GPU

OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: CausticAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core300M600M900M1200M1500MSE +/- 25.40, N = 315651749721. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: CornellAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core300M600M900M1200M1500MSE +/- 25.12, N = 315651751081. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3AMD Vega 20 - AMD Ryzen 7 2700 Eight-Core300M600M900M1200M1500MSE +/- 25.40, N = 315651752461. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core48121620SE +/- 0.16, N = 315.191. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core10002000300040005000SE +/- 0.49, N = 34487.431. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core3K6K9K12K15KSE +/- 1.49, N = 313681.991. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core7001400210028003500SE +/- 0.17, N = 33441.421. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core2004006008001000SE +/- 0.05, N = 3801.571. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core3691215SE +/- 0.18, N = 313.201. (CXX) g++ options: -O3 -rdynamic

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferAMD Vega 20 - AMD Ryzen 7 2700 Eight-Core714212835SE +/- 0.02, N = 330.881. (CXX) g++ options: -O3 -rdynamic


Phoronix Test Suite v10.8.4