PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForce

PlaidML Linux benchmarks for a future article on Phoronix.com by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1901122-PTS-PLAIDMLG16&grs&rdt.

PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionVulkanRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1601 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX 64 8GB (1590/800MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 18.104.19.0-041900-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1amdgpu 18.1.04.5 Mesa 18.2.2 (LLVM 7.0.0)GCC 8.2.0ext43840x2160AMD Radeon RX 64 8GB (1630/945MHz)MSI AMD Radeon RX 470/480/570/570X/580/580X 8GB (1366/2000MHz)modesetting 1.20.1Sapphire AMD Radeon RX 470/480/570/570X/580/580X 8GB (1560/2100MHz)4.20.0-042000-generic (x86_64)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA 415.254.6.01.1.84NVIDIA TITAN RTX 24GB (1350/7000MHz)Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Device 6GB (1365/7000MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz)NVIDIA GeForce GTX 1080 Ti 11GB (1480/5508MHz)NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz)NVIDIA GeForce GTX 980 4GB (1126/3505MHz)OpenBenchmarking.orgProcessor Details- Scaling Governor: acpi-cpufreq ondemandGraphics Details- RX Vega 56, RX Vega 64, RX 580, RX 590: GLAMORPython Details- Python 2.7.15+ + Python 3.6.7Security Details- RX Vega 56: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- RX Vega 64: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- RX 580: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- RX 590: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2080 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- TITAN RTX: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2080: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2070: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2060: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1080: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1060: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1070: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1070 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1080 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 980 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 980: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccompOpenCL Details- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- RTX 2080: GPU Compute Cores: 2944- RTX 2070: GPU Compute Cores: 2304- RTX 2060: GPU Compute Cores: 1920- GTX 1080: GPU Compute Cores: 2560- GTX 1060: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1070 Ti: GPU Compute Cores: 2432- GTX 1080 Ti: GPU Compute Cores: 3584- GTX 980 Ti: GPU Compute Cores: 2816- GTX 980: GPU Compute Cores: 2048

PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForceplaidml: No - Training - VGG16 - OpenCLplaidml: No - Training - VGG19 - OpenCLplaidml: No - Inference - VGG19 - OpenCLplaidml: Yes - Inference - VGG19 - OpenCLplaidml: No - Inference - VGG16 - OpenCLplaidml: Yes - Inference - VGG16 - OpenCLplaidml: No - Inference - ResNet 50 - OpenCLplaidml: Yes - Inference - ResNet 50 - OpenCLplaidml: No - Inference - IMDB LSTM - OpenCLplaidml: No - Training - Mobilenet - OpenCLplaidml: Yes - Inference - NASNer Large - OpenCLplaidml: No - Inference - NASNer Large - OpenCLplaidml: No - Inference - Inception V3 - OpenCLplaidml: No - Training - Inception V3 - OpenCLplaidml: No - Inference - DenseNet 201 - OpenCLplaidml: No - Inference - Mobilenet - OpenCLplaidml: Yes - Inference - DenseNet 201 - OpenCLplaidml: Yes - Inference - Inception V3 - OpenCLplaidml: No - Training - IMDB LSTM - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98028.7825.6694.4191.3210910721319224071.2931.0810419.8695.8753116448030.8528.15107.32103.9312312022621224075.2335.8911820.98106.4454616948110.429.6755.2651.5964.5558.9213112118241.6418.0866.1913.1562.343901194359.528.9258.3153.8567.7161.6513912920545.2120.0171.4214.5967.5042413345585.4574.75221.88207.30258.50236.13478417525138.1169.7056.77197.5637.67150.0196213919219269590.8479.58237.71220.89274.89250.30499428535142.3672.6159.13211.6937.41161.3298814820319169867.5958.55168.53161.16197.52185.76373330418120.2451.5441.57161.9432.80118.6579411115317561263.7154.76155.08148.93183.95172.84351311356119.8444.3436.01147.3931.44115.65767108142164533124.18121.98147.16143.80293259324103.3037.6029.42124.8495.8867189.0212615753850.0942.92117.24115.90138.78134.6827524824794.4737.0230.92124.6528.4490.8362185.6013014251034.0778.1379.2993.0092.1918717317581.6522.1718.2782.0321.5862.2748558.7686.7511942645.9739.22106.35105.77125.37122.5424122121898.4530.2025.31108.5026.2183.8959379.08112.3113547646.7340.33109.40108.85128.85125.6024922621595.5232.6526.57113.6126.0083.5559478.45114.3313347366.5557.60156.28159.02181.25180.71315311298113.5547.6342.75158.2033.10119.80687114.97160.1816555540.14101.40104.74119.20121.6819919917483.6626.9024.3296.6978.8848576.15104.1012242180.3783.1994.8496.8217217515578.4723.2218.8981.0363.9645861.3889.26108407OpenBenchmarking.org

PlaidML

FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG16 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 Ti20406080100SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.26, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.28, N = 328.7830.8510.429.5285.4590.8467.5963.7150.0934.0745.9746.7366.5540.14

PlaidML

FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG19 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070GTX 1080GTX 1070GTX 1070 TiGTX 1080 Ti20406080100SE +/- 0.19, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 325.6628.159.678.9274.7579.5858.5554.7642.9239.2240.3357.60

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98050100150200250SE +/- 1.43, N = 4SE +/- 0.89, N = 3SE +/- 0.06, N = 3SE +/- 0.34, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 394.41107.3255.2658.31221.88237.71168.53155.08124.18117.2478.13106.35109.40156.28101.4080.37

PlaidML

FP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98050100150200250SE +/- 0.06, N = 3SE +/- 0.71, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 1.40, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 391.32103.9351.5953.85207.30220.89161.16148.93121.98115.9079.29105.77108.85159.02104.7483.19

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98060120180240300SE +/- 0.77, N = 3SE +/- 1.73, N = 3SE +/- 0.03, N = 3SE +/- 0.51, N = 3SE +/- 0.25, N = 3SE +/- 1.45, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.40, N = 3SE +/- 0.06, N = 3SE +/- 0.45, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3109.00123.0064.5567.71258.50274.89197.52183.95147.16138.7893.00125.37128.85181.25119.2094.84

PlaidML

FP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98050100150200250SE +/- 0.06, N = 3SE +/- 0.39, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 1.48, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.82, N = 3SE +/- 0.04, N = 3SE +/- 0.36, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3107.00120.0058.9261.65236.13250.30185.76172.84143.80134.6892.19122.54125.60180.71121.6896.82

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980110220330440550SE +/- 1.86, N = 3SE +/- 4.02, N = 3SE +/- 0.49, N = 3SE +/- 0.64, N = 3SE +/- 4.66, N = 3SE +/- 0.22, N = 3SE +/- 0.08, N = 3SE +/- 0.35, N = 3SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.15, N = 3SE +/- 1.62, N = 3SE +/- 0.51, N = 3SE +/- 0.05, N = 3SE +/- 0.34, N = 3SE +/- 0.53, N = 3213226131139478499373351293275187241249315199172

PlaidML

FP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98090180270360450SE +/- 0.02, N = 3SE +/- 0.88, N = 3SE +/- 0.32, N = 3SE +/- 0.04, N = 3SE +/- 0.30, N = 3SE +/- 0.23, N = 3SE +/- 0.43, N = 3SE +/- 0.17, N = 3SE +/- 6.28, N = 11SE +/- 0.31, N = 3SE +/- 2.69, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.73, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 3192212121129417428330311259248173221226311199175

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980120240360480600SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.68, N = 3SE +/- 1.28, N = 3SE +/- 1.04, N = 3SE +/- 0.33, N = 3SE +/- 1.05, N = 3SE +/- 1.24, N = 3SE +/- 0.23, N = 3SE +/- 0.99, N = 3SE +/- 0.26, N = 3SE +/- 0.54, N = 3SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.37, N = 3240240182205525535418356324247175218215298174155

PlaidML

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980306090120150SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.32, N = 3SE +/- 0.40, N = 3SE +/- 0.24, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 3SE +/- 0.03, N = 371.2975.2341.6445.21138.11142.36120.24119.84103.3094.4781.6598.4595.52113.5583.6678.47

PlaidML

FP16: Yes - Mode: Inference - Network: NASNer Large - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: NASNer Large - Device: OpenCLRTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9801632486480SE +/- 0.89, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 369.7072.6151.5444.3437.6037.0222.1730.2032.6547.6326.9023.22

PlaidML

FP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9801326395265SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 331.0835.8918.0820.0156.7759.1341.5736.0129.4230.9218.2725.3126.5742.7524.3218.89

PlaidML

FP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 98050100150200250SE +/- 1.56, N = 3SE +/- 0.77, N = 3SE +/- 0.95, N = 3SE +/- 1.29, N = 3SE +/- 3.98, N = 12SE +/- 0.31, N = 3SE +/- 0.12, N = 3SE +/- 0.61, N = 3SE +/- 1.89, N = 3SE +/- 1.75, N = 12SE +/- 1.04, N = 7SE +/- 1.68, N = 5SE +/- 0.05, N = 3SE +/- 0.42, N = 3SE +/- 1.44, N = 5SE +/- 1.08, N = 3104.00118.0066.1971.42197.56211.69161.94147.39124.84124.6582.03108.50113.61158.2096.6981.03

PlaidML

FP16: No - Mode: Training - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Inception V3 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 Ti918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 319.8620.9813.1514.5937.6737.4132.8031.4428.4421.5826.2126.0033.10

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9804080120160200SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 2.09, N = 3SE +/- 0.42, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 395.87106.4462.3467.50150.01161.32118.65115.6595.8890.8362.2783.8983.55119.8078.8863.96

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9802004006008001000SE +/- 5.93, N = 3SE +/- 11.08, N = 12SE +/- 0.73, N = 3SE +/- 0.16, N = 3SE +/- 9.03, N = 3SE +/- 1.87, N = 3SE +/- 3.51, N = 3SE +/- 2.10, N = 3SE +/- 1.48, N = 3SE +/- 0.30, N = 3SE +/- 0.64, N = 3SE +/- 2.92, N = 3SE +/- 0.56, N = 3SE +/- 4.02, N = 3SE +/- 2.78, N = 3SE +/- 1.99, N = 3531546390424962988794767671621485593594687485458

PlaidML

FP16: Yes - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: DenseNet 201 - Device: OpenCLRTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980306090120150SE +/- 1.79, N = 7SE +/- 2.16, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3139.00148.00111.00108.0089.0285.6058.7679.0878.45114.9776.1561.38

PlaidML

FP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLRTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9804080120160200SE +/- 3.75, N = 12SE +/- 0.41, N = 3SE +/- 2.75, N = 12SE +/- 2.27, N = 12SE +/- 2.35, N = 3SE +/- 0.17, N = 3SE +/- 0.27, N = 3SE +/- 0.16, N = 3SE +/- 1.45, N = 7SE +/- 0.55, N = 3SE +/- 1.04, N = 12SE +/- 0.15, N = 3192.00203.00153.00142.00126.00130.0086.75112.31114.33160.18104.1089.26

PlaidML

FP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 9804080120160200SE +/- 0.44, N = 3SE +/- 0.49, N = 3SE +/- 1.28, N = 3SE +/- 1.06, N = 3SE +/- 1.21, N = 3SE +/- 1.98, N = 3SE +/- 1.03, N = 3SE +/- 1.14, N = 3SE +/- 0.59, N = 3SE +/- 0.45, N = 3SE +/- 0.30, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.42, N = 3SE +/- 0.66, N = 3SE +/- 0.08, N = 3164169119133192191175164157142119135133165122108

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLRX Vega 56RX Vega 64RX 580RX 590RTX 2080 TiTITAN RTXRTX 2080RTX 2070RTX 2060GTX 1080GTX 1060GTX 1070GTX 1070 TiGTX 1080 TiGTX 980 TiGTX 980150300450600750SE +/- 0.30, N = 3SE +/- 0.64, N = 3SE +/- 1.39, N = 3SE +/- 0.50, N = 3SE +/- 1.64, N = 3SE +/- 5.84, N = 3SE +/- 6.97, N = 12SE +/- 6.98, N = 3SE +/- 7.50, N = 6SE +/- 4.10, N = 3SE +/- 5.36, N = 3SE +/- 1.57, N = 3SE +/- 8.00, N = 4SE +/- 5.47, N = 3SE +/- 2.96, N = 3SE +/- 4.97, N = 3480481435455695698612533538510426476473555421407


Phoronix Test Suite v10.8.4