PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForce

PlaidML Linux benchmarks for a future article on Phoronix.com by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1901122-PTS-PLAIDMLG16&sor&grr.

PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionVulkanRX 580RX 590RX Vega 56RX Vega 64GTX 980GTX 980 TiGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTXAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1601 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBMSI AMD Radeon RX 470/480/570/570X/580/580X 8GB (1366/2000MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 18.104.19.0-041900-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1modesetting 1.20.14.5 Mesa 18.2.2 (LLVM 7.0.0)GCC 8.2.0ext43840x2160Sapphire AMD Radeon RX 470/480/570/570X/580/580X 8GB (1560/2100MHz)4.20.0-042000-generic (x86_64)AMD Radeon RX 64 8GB (1590/800MHz)4.19.0-041900-generic (x86_64)amdgpu 18.1.0AMD Radeon RX 64 8GB (1630/945MHz)NVIDIA GeForce GTX 980 4GB (1126/3505MHz)Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad4.20.0-042000-generic (x86_64)NVIDIA 415.254.6.01.1.84NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11GB (1480/5508MHz)Device 6GB (1365/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)Intel I211 Gigabit + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adNVIDIA TITAN RTX 24GB (1350/7000MHz)Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adOpenBenchmarking.orgProcessor Details- Scaling Governor: acpi-cpufreq ondemandGraphics Details- RX 580, RX 590, RX Vega 56, RX Vega 64: GLAMORPython Details- Python 2.7.15+ + Python 3.6.7Security Details- RX 580: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- RX 590: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RX Vega 56: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- RX Vega 64: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- GTX 980: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 980 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1060: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1070: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1070 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1080: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- GTX 1080 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2060: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2070: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2080: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- RTX 2080 Ti: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp- TITAN RTX: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccompOpenCL Details- GTX 980: GPU Compute Cores: 2048- GTX 980 Ti: GPU Compute Cores: 2816- GTX 1060: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1070 Ti: GPU Compute Cores: 2432- GTX 1080: GPU Compute Cores: 2560- GTX 1080 Ti: GPU Compute Cores: 3584- RTX 2060: GPU Compute Cores: 1920- RTX 2070: GPU Compute Cores: 2304- RTX 2080: GPU Compute Cores: 2944- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608

PlaidML OpenCL Linux GPU Benchmarks AMD Radeon + NVIDIA GeForceplaidml: No - Training - Inception V3 - OpenCLplaidml: No - Training - VGG19 - OpenCLplaidml: No - Training - VGG16 - OpenCLplaidml: No - Inference - NASNer Large - OpenCLplaidml: Yes - Inference - NASNer Large - OpenCLplaidml: No - Inference - DenseNet 201 - OpenCLplaidml: Yes - Inference - DenseNet 201 - OpenCLplaidml: Yes - Inference - Inception V3 - OpenCLplaidml: No - Inference - Inception V3 - OpenCLplaidml: No - Training - Mobilenet - OpenCLplaidml: Yes - Inference - VGG19 - OpenCLplaidml: No - Inference - VGG19 - OpenCLplaidml: Yes - Inference - VGG16 - OpenCLplaidml: No - Inference - VGG16 - OpenCLplaidml: No - Training - IMDB LSTM - OpenCLplaidml: Yes - Inference - ResNet 50 - OpenCLplaidml: No - Inference - ResNet 50 - OpenCLplaidml: No - Inference - IMDB LSTM - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLplaidml: No - Inference - Mobilenet - OpenCLRX 580RX 590RX Vega 56RX Vega 64GTX 980GTX 980 TiGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX13.159.6710.4218.0862.3466.1941.6451.5955.2658.9264.5511912113118243539014.598.929.5220.0167.5071.4245.2153.8558.3161.6567.7113312913920545542419.8625.6628.7831.0895.8710471.2991.3294.4110710916419221324048053120.9828.1530.8535.89106.4411875.23103.93107.3212012316921222624048154618.8923.2263.9661.3889.2681.0378.4783.1980.3796.8294.8410817517215540745840.1424.3226.9078.8876.15104.1096.6983.66104.74101.40121.68119.2012219919917442148521.5834.0718.2722.1762.2758.7686.7582.0381.6579.2978.1392.1993.0011917318717542648526.2139.2245.9725.3130.2083.8979.08112.31108.5098.45105.77106.35122.54125.3713522124121847659326.0040.3346.7326.5732.6583.5578.45114.33113.6195.52108.85109.40125.60128.8513322624921547359428.4442.9250.0930.9237.0290.8385.60130124.6594.47115.90117.24134.68138.7814224827524751062133.1057.6066.5542.7547.63119.80114.97160.18158.20113.55159.02156.28180.71181.2516531131529855568729.4237.6095.8889.02126124.84103.30121.98124.18143.80147.1615725929332453867131.4454.7663.7136.0144.34115.65108142147.39119.84148.93155.08172.84183.9516431135135653376732.8058.5567.5941.5751.54118.65111153161.94120.24161.16168.53185.76197.5217533037341861279437.6774.7585.4556.7769.70150.01139192197.56138.11207.30221.88236.13258.5019241747852569596237.4179.5890.8459.1372.61161.32148203211.69142.36220.89237.71250.30274.89191428499535698988OpenBenchmarking.org

PlaidML

FP16: No - Mode: Training - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Inception V3 - Device: OpenCLRTX 2080 TiTITAN RTXGTX 1080 TiRTX 2080RTX 2070GTX 1080GTX 1070GTX 1070 TiGTX 1060RX Vega 64RX Vega 56RX 590RX 580918273645SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 337.6737.4133.1032.8031.4428.4426.2126.0021.5820.9819.8614.5913.15

PlaidML

FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG19 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070GTX 1080GTX 1070 TiGTX 1070RX Vega 64RX Vega 56RX 580RX 59020406080100SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 379.5874.7558.5557.6054.7642.9240.3339.2228.1525.669.678.92

PlaidML

FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: VGG16 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070GTX 1080GTX 1070 TiGTX 1070GTX 980 TiGTX 1060RX Vega 64RX Vega 56RX 580RX 59020406080100SE +/- 0.18, N = 3SE +/- 0.13, N = 3SE +/- 0.26, N = 3SE +/- 0.15, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 390.8485.4567.5966.5563.7150.0946.7345.9740.1434.0730.8528.7810.429.52

PlaidML

FP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: NASNer Large - Device: OpenCLTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RX Vega 64RX Vega 56GTX 1080RTX 2060GTX 1070 TiGTX 1070GTX 980 TiRX 590GTX 980GTX 1060RX 5801326395265SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 359.1356.7742.7541.5736.0135.8931.0830.9229.4226.5725.3124.3220.0118.8918.2718.08

PlaidML

FP16: Yes - Mode: Inference - Network: NASNer Large - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: NASNer Large - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070RTX 2060GTX 1080GTX 1070 TiGTX 1070GTX 980 TiGTX 980GTX 10601632486480SE +/- 0.16, N = 3SE +/- 0.89, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 372.6169.7051.5447.6344.3437.6037.0232.6530.2026.9023.2222.17

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RX Vega 64RTX 2060RX Vega 56GTX 1080GTX 1070GTX 1070 TiGTX 980 TiRX 590GTX 980RX 580GTX 10604080120160200SE +/- 0.42, N = 3SE +/- 2.09, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3161.32150.01119.80118.65115.65106.4495.8895.8790.8383.8983.5578.8867.5063.9662.3462.27

PlaidML

FP16: Yes - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: DenseNet 201 - Device: OpenCLTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1080GTX 1070GTX 1070 TiGTX 980 TiGTX 980GTX 1060306090120150SE +/- 2.16, N = 3SE +/- 1.79, N = 7SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3148.00139.00114.97111.00108.0089.0285.6079.0878.4576.1561.3858.76

PlaidML

FP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070GTX 1080RTX 2060GTX 1070 TiGTX 1070GTX 980 TiGTX 980GTX 10604080120160200SE +/- 0.41, N = 3SE +/- 3.75, N = 12SE +/- 0.55, N = 3SE +/- 2.75, N = 12SE +/- 2.27, N = 12SE +/- 0.17, N = 3SE +/- 2.35, N = 3SE +/- 1.45, N = 7SE +/- 0.16, N = 3SE +/- 1.04, N = 12SE +/- 0.15, N = 3SE +/- 0.27, N = 3203.00192.00160.18153.00142.00130.00126.00114.33112.31104.1089.2686.75

PlaidML

FP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070RTX 2060GTX 1080RX Vega 64GTX 1070 TiGTX 1070RX Vega 56GTX 980 TiGTX 1060GTX 980RX 590RX 58050100150200250SE +/- 0.31, N = 3SE +/- 3.98, N = 12SE +/- 0.12, N = 3SE +/- 0.42, N = 3SE +/- 0.61, N = 3SE +/- 1.89, N = 3SE +/- 1.75, N = 12SE +/- 0.77, N = 3SE +/- 0.05, N = 3SE +/- 1.68, N = 5SE +/- 1.56, N = 3SE +/- 1.44, N = 5SE +/- 1.04, N = 7SE +/- 1.08, N = 3SE +/- 1.29, N = 3SE +/- 0.95, N = 3211.69197.56161.94158.20147.39124.84124.65118.00113.61108.50104.0096.6982.0381.0371.4266.19

PlaidML

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080 TiRTX 2060GTX 1070GTX 1070 TiGTX 1080GTX 980 TiGTX 1060GTX 980RX Vega 64RX Vega 56RX 590RX 580306090120150SE +/- 0.40, N = 3SE +/- 0.32, N = 3SE +/- 0.24, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3142.36138.11120.24119.84113.55103.3098.4595.5294.4783.6681.6578.4775.2371.2945.2141.64

PlaidML

FP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070RTX 2060GTX 1080GTX 1070 TiGTX 1070GTX 980 TiRX Vega 64RX Vega 56GTX 980GTX 1060RX 590RX 58050100150200250SE +/- 0.04, N = 3SE +/- 0.21, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 1.40, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.71, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3SE +/- 0.06, N = 3220.89207.30161.16159.02148.93121.98115.90108.85105.77104.74103.9391.3283.1979.2953.8551.59

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070RTX 2060GTX 1080GTX 1070 TiRX Vega 64GTX 1070GTX 980 TiRX Vega 56GTX 980GTX 1060RX 590RX 58050100150200250SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.89, N = 3SE +/- 0.00, N = 3SE +/- 0.19, N = 3SE +/- 1.43, N = 4SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.34, N = 3SE +/- 0.06, N = 3237.71221.88168.53156.28155.08124.18117.24109.40107.32106.35101.4094.4180.3778.1358.3155.26

PlaidML

FP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2070RTX 2060GTX 1080GTX 1070 TiGTX 1070GTX 980 TiRX Vega 64RX Vega 56GTX 980GTX 1060RX 590RX 58050100150200250SE +/- 0.14, N = 3SE +/- 1.48, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.36, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.39, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.82, N = 3SE +/- 0.27, N = 3SE +/- 0.06, N = 3250.30236.13185.76180.71172.84143.80134.68125.60122.54121.68120.00107.0096.8292.1961.6558.92

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080 TiRTX 2060GTX 1080GTX 1070 TiGTX 1070RX Vega 64GTX 980 TiRX Vega 56GTX 980GTX 1060RX 590RX 58060120180240300SE +/- 1.45, N = 3SE +/- 0.25, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.45, N = 3SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.40, N = 3SE +/- 1.73, N = 3SE +/- 0.14, N = 3SE +/- 0.77, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.51, N = 3SE +/- 0.03, N = 3274.89258.50197.52183.95181.25147.16138.78128.85125.37123.00119.20109.0094.8493.0067.7164.55

PlaidML

FP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLRTX 2080 TiTITAN RTXRTX 2080RX Vega 64GTX 1080 TiRTX 2070RX Vega 56RTX 2060GTX 1080GTX 1070GTX 1070 TiRX 590GTX 980 TiGTX 1060RX 580GTX 9804080120160200SE +/- 1.21, N = 3SE +/- 1.98, N = 3SE +/- 1.03, N = 3SE +/- 0.49, N = 3SE +/- 0.42, N = 3SE +/- 1.14, N = 3SE +/- 0.44, N = 3SE +/- 0.59, N = 3SE +/- 0.45, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 1.06, N = 3SE +/- 0.66, N = 3SE +/- 0.30, N = 3SE +/- 1.28, N = 3SE +/- 0.08, N = 3192191175169165164164157142135133133122119119108

PlaidML

FP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080 TiRTX 2060GTX 1080GTX 1070 TiGTX 1070RX Vega 64GTX 980 TiRX Vega 56GTX 980GTX 1060RX 590RX 58090180270360450SE +/- 0.23, N = 3SE +/- 0.30, N = 3SE +/- 0.43, N = 3SE +/- 0.17, N = 3SE +/- 0.73, N = 3SE +/- 6.28, N = 11SE +/- 0.31, N = 3SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.88, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 2.69, N = 3SE +/- 0.04, N = 3SE +/- 0.32, N = 3428417330311311259248226221212199192175173129121

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080 TiRTX 2060GTX 1080GTX 1070 TiGTX 1070RX Vega 64RX Vega 56GTX 980 TiGTX 1060GTX 980RX 590RX 580110220330440550SE +/- 0.22, N = 3SE +/- 4.66, N = 3SE +/- 0.08, N = 3SE +/- 0.35, N = 3SE +/- 0.05, N = 3SE +/- 0.35, N = 3SE +/- 0.28, N = 3SE +/- 0.51, N = 3SE +/- 1.62, N = 3SE +/- 4.02, N = 3SE +/- 1.86, N = 3SE +/- 0.34, N = 3SE +/- 0.15, N = 3SE +/- 0.53, N = 3SE +/- 0.64, N = 3SE +/- 0.49, N = 3499478373351315293275249241226213199187172139131

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070RTX 2060GTX 1080 TiGTX 1080RX Vega 64RX Vega 56GTX 1070GTX 1070 TiRX 590RX 580GTX 1060GTX 980 TiGTX 980120240360480600SE +/- 0.33, N = 3SE +/- 1.04, N = 3SE +/- 1.05, N = 3SE +/- 1.24, N = 3SE +/- 0.23, N = 3SE +/- 0.26, N = 3SE +/- 0.99, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.54, N = 3SE +/- 0.09, N = 3SE +/- 1.28, N = 3SE +/- 0.68, N = 3SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.37, N = 3535525418356324298247240240218215205182175174155

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2060RTX 2070GTX 1080RX Vega 64RX Vega 56GTX 1070GTX 1070 TiRX 590RX 580GTX 1060GTX 980 TiGTX 980150300450600750SE +/- 5.84, N = 3SE +/- 1.64, N = 3SE +/- 6.97, N = 12SE +/- 5.47, N = 3SE +/- 7.50, N = 6SE +/- 6.98, N = 3SE +/- 4.10, N = 3SE +/- 0.64, N = 3SE +/- 0.30, N = 3SE +/- 1.57, N = 3SE +/- 8.00, N = 4SE +/- 0.50, N = 3SE +/- 1.39, N = 3SE +/- 5.36, N = 3SE +/- 2.96, N = 3SE +/- 4.97, N = 3698695612555538533510481480476473455435426421407

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080 TiRTX 2060GTX 1080GTX 1070 TiGTX 1070RX Vega 64RX Vega 56GTX 1060GTX 980 TiGTX 980RX 590RX 5802004006008001000SE +/- 1.87, N = 3SE +/- 9.03, N = 3SE +/- 3.51, N = 3SE +/- 2.10, N = 3SE +/- 4.02, N = 3SE +/- 1.48, N = 3SE +/- 0.30, N = 3SE +/- 0.56, N = 3SE +/- 2.92, N = 3SE +/- 11.08, N = 12SE +/- 5.93, N = 3SE +/- 0.64, N = 3SE +/- 2.78, N = 3SE +/- 1.99, N = 3SE +/- 0.16, N = 3SE +/- 0.73, N = 3988962794767687671621594593546531485485458424390


Phoronix Test Suite v10.8.4