NVIDIA Linux GPU Compute

NVIDIA Linux GPU computing benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1901274-SP-NVIDIACOM53
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 5 Tests
Creator Workloads 2 Tests
HPC - High Performance Computing 4 Tests
Machine Learning 3 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 9 Tests
OpenCL 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GTX 1060
January 27 2019
  1 Hour, 42 Minutes
GTX 1070
January 25 2019
  1 Hour, 35 Minutes
GTX 1070 Ti
January 25 2019
  1 Hour, 39 Minutes
GTX 1080
January 25 2019
  1 Hour, 24 Minutes
GTX 1080 Ti
January 25 2019
  1 Hour, 31 Minutes
RTX 2060
January 27 2019
  1 Hour, 40 Minutes
RTX 2070
January 27 2019
  1 Hour, 39 Minutes
RTX 2080
January 26 2019
  1 Hour, 38 Minutes
RTX 2080 Ti
January 26 2019
  1 Hour, 34 Minutes
TITAN RTX
January 26 2019
  1 Hour, 35 Minutes
Invert Hiding All Results Option
  1 Hour, 36 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA Linux GPU ComputeOpenBenchmarking.orgPhoronix Test SuiteIntel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (0602 BIOS)Intel Cannon Lake PCH Shared SRAM16384MBSamsung SSD 970 EVO 250GB + 2000GB SABRENTNVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11GB (1480/5508MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TITAN RTX 24GB (1350/7000MHz)Realtek ALC1220Acer B286HKIntel I219-VUbuntu 18.104.20.3-042003-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1NVIDIA 415.274.6.0OpenCL 1.2 CUDA 10.0.1321.1.84GCC 8.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionNVIDIA Linux GPU Compute BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance- GTX 1060: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1070 Ti: GPU Compute Cores: 2432- GTX 1080: GPU Compute Cores: 2560- GTX 1080 Ti: GPU Compute Cores: 3584- RTX 2060: GPU Compute Cores: 1920- RTX 2070: GPU Compute Cores: 2304- RTX 2080: GPU Compute Cores: 2944- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- Python 2.7.15+ + Python 3.6.7- __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp

GTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTXResult OverviewPhoronix Test Suite100%173%246%319%391%LuxMarkLeelaChessZerocl-memSHOC Scalable HeterOgeneous ComputingclpeakPlaidMLRodiniaNAMD CUDAJuliaGPUDarktable

GTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTXPer Watt Result OverviewPhoronix Test Suite100%168%235%303%LeelaChessZeroMeta Performance Per Wattcl-memPlaidMLLuxMarkSHOC Scalable HeterOgeneous ComputingJuliaGPUP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

NVIDIA Linux GPU Computeshoc: OpenCL - Max SP Flopsoctanebench: Total Scoreluxmark: GPU - Microphoneluxmark: GPU - Hotelluxmark: GPU - Luxball HDRv-ray: CUDA GPUrodinia: OpenCL Myocyteplaidml: No - Training - Mobilenet - OpenCLplaidml: Yes - Inference - VGG16 - OpenCLclpeak: Double-Precision Doubleplaidml: Yes - Inference - Inception V3 - OpenCLplaidml: No - Inference - Inception V3 - OpenCLplaidml: No - Training - IMDB LSTM - OpenCLshoc: OpenCL - Texture Read Bandwidthplaidml: No - Inference - VGG16 - OpenCLplaidml: Yes - Inference - ResNet 50 - OpenCLlczero: OpenCLnamd-cuda: ATPase Simulation - 327,506 Atomsplaidml: No - Inference - ResNet 50 - OpenCLclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferplaidml: No - Inference - IMDB LSTM - OpenCLrodinia: OpenCL Particle Filterjuliagpu: GPUcl-mem: Readcl-mem: Copycl-mem: Writeplaidml: Yes - Inference - Mobilenet - OpenCLdarktable: Masskrug - OpenCLplaidml: No - Inference - Mobilenet - OpenCLdarktable: Boat - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - FFT SPclpeak: Integer Compute INTshoc: OpenCL - Triadclpeak: Global Memory Bandwidthshoc: OpenCL - MD5 Hashclpeak: Single-Precision Floatdarktable: Server Rack - OpenCLclpeak: Kernel LatencyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX479491.546962266212256129.6635.5486.9494.6215188.8586.1113441194.081838990.3227119411.1512.5418012.001837826261531381395524.175923.661.31297125211.931457.2842390.133.787101133997238721728793.6635.4410712622411711615644712923714300.2422825711.3312.602288.252181720622051871926643.927642.891.10453168112.1919610.4758780.123.6477091411017541671677686.7139.6410312824212011815343313123714780.2302226211.1512.552247.852215480162051871906454.017552.931.16501206812.1919711.6367760.133.929400872238021380335.6610313929713213316752714326417500.2123329211.3412.592596.502391313242292092166863.978012.721.10575239812.3022214.1979340.133.77132382111358456682168266.5636.3112519441516716718759618933423330.2011333911.2912.573304.962651628753373173367743.878832.261.03972326312.5132919.68108530.123.9973401641373447662135693.3131.98113148231134133188102415028214840.2054230911.2912.603478.802516119892962472457793.828632.220.82803670512.4327615.8766300.113.6585292041807461472953266.3932.71132179266157156198107718933317290.1974637611.2512.483827.882656820463923303138303.6810081.930.75988778012.5736518.4279680.103.64109822181976665522914672.7432.47133192343168172216112120335422590.2004440011.1312.524576.282788292383923273238933.7810451.920.811083995212.5736823.74102700.113.65165803042861791844278753.0431.96156250519211225246117327345831070.1917753711.2912.596054.363009222415444544399713.6912911.640.7514451490612.6850535.13153520.103.83173323193042697294602253.5132.72160264540215229244115828947432750.1973555611.1612.586104.393036654475654834899773.6913361.640.7415531484812.6852536.28163860.103.85OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX4K8K12K16K20KSE +/- 4.35, N = 3SE +/- 23.18, N = 3SE +/- 38.15, N = 3SE +/- 1.25, N = 3SE +/- 45.89, N = 3SE +/- 38.59, N = 3SE +/- 58.57, N = 3SE +/- 27.16, N = 3SE +/- 89.12, N = 3SE +/- 91.92, N = 3479471017340770985299400109821323816580173321. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX3K6K9K12K15KMin: 4789.76 / Avg: 4794.41 / Max: 4803.11Min: 7059.2 / Avg: 7101.2 / Max: 7139.19Min: 7301.84 / Avg: 7340.04 / Max: 7416.33Min: 7707.02 / Avg: 7709.1 / Max: 7711.34Min: 8482.7 / Avg: 8528.62 / Max: 8620.4Min: 9358.21 / Avg: 9399.5 / Max: 9476.62Min: 10923.4 / Avg: 10981.97 / Max: 11099.1Min: 13209.2 / Avg: 13237.6 / Max: 13291.9Min: 16488.7 / Avg: 16579.67 / Max: 16757.9Min: 17240.2 / Avg: 17332.17 / Max: 175161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00Total ScoreGTX 1060GTX 1070GTX 1070 TiRTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX7014021028035091.54133.00141.00164.00204.00211.00218.00304.00319.00

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1080GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX7K14K21K28K35KSE +/- 1.53, N = 3SE +/- 10.84, N = 3SE +/- 1.20, N = 3SE +/- 5.00, N = 3SE +/- 5.67, N = 3SE +/- 57.27, N = 3SE +/- 4.62, N = 3SE +/- 24.50, N = 3SE +/- 127.33, N = 3SE +/- 146.65, N = 369628722997210175135841373418074197662861730426
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1080GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX5K10K15K20K25KMin: 6959 / Avg: 6962 / Max: 6964Min: 8711 / Avg: 8722.33 / Max: 8744Min: 9970 / Avg: 9971.67 / Max: 9974Min: 10165 / Avg: 10175 / Max: 10180Min: 13573 / Avg: 13584.33 / Max: 13590Min: 13649 / Avg: 13734 / Max: 13843Min: 18066 / Avg: 18074 / Max: 18082Min: 19717 / Avg: 19766 / Max: 19791Min: 28490 / Avg: 28617.33 / Max: 28872Min: 30135 / Avg: 30425.67 / Max: 30605

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1080GTX 1070GTX 1070 TiRTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX2K4K6K8K10KSE +/- 10.90, N = 3SE +/- 3.18, N = 3SE +/- 4.33, N = 3SE +/- 0.67, N = 3SE +/- 4.51, N = 3SE +/- 29.33, N = 3SE +/- 1.00, N = 3SE +/- 12.33, N = 3SE +/- 1.45, N = 3SE +/- 6.69, N = 32662380238724167476656686147655291849729
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1080GTX 1070GTX 1070 TiRTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX2K4K6K8K10KMin: 2641 / Avg: 2661.67 / Max: 2678Min: 3796 / Avg: 3802.33 / Max: 3806Min: 3868 / Avg: 3872.33 / Max: 3881Min: 4166 / Avg: 4167.33 / Max: 4168Min: 4761 / Avg: 4766 / Max: 4775Min: 5609 / Avg: 5667.67 / Max: 5697Min: 6145 / Avg: 6147 / Max: 6148Min: 6527 / Avg: 6551.67 / Max: 6564Min: 9181 / Avg: 9183.67 / Max: 9186Min: 9718 / Avg: 9728.67 / Max: 9741

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1060GTX 1080GTX 1070 TiGTX 1070RTX 2060GTX 1080 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTX10K20K30K40K50KSE +/- 46.58, N = 3SE +/- 42.00, N = 3SE +/- 14.90, N = 3SE +/- 1.33, N = 3SE +/- 31.58, N = 3SE +/- 8.33, N = 3SE +/- 25.12, N = 3SE +/- 107.00, N = 3SE +/- 155.67, N = 3SE +/- 103.04, N = 312256138031677617287213562168229146295324278746022
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1060GTX 1080GTX 1070 TiGTX 1070RTX 2060GTX 1080 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTX8K16K24K32K40KMin: 12182 / Avg: 12256 / Max: 12342Min: 13761 / Avg: 13803 / Max: 13887Min: 16759 / Avg: 16776.33 / Max: 16806Min: 17286 / Avg: 17287.33 / Max: 17290Min: 21297 / Avg: 21356 / Max: 21405Min: 21674 / Avg: 21682.33 / Max: 21699Min: 29102 / Avg: 29145.67 / Max: 29189Min: 29424 / Avg: 29532 / Max: 29746Min: 42607 / Avg: 42787 / Max: 43097Min: 45879 / Avg: 46022 / Max: 46222

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2080GTX 1080 TiRTX 2070TITAN RTXRTX 2080 Ti306090120150SE +/- 0.04, N = 3SE +/- 2.94, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 3.00, N = 3SE +/- 0.03, N = 3129.6693.6693.3186.7172.7466.5666.3953.5153.04
OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2080GTX 1080 TiRTX 2070TITAN RTXRTX 2080 Ti20406080100Min: 129.58 / Avg: 129.66 / Max: 129.72Min: 90.61 / Avg: 93.66 / Max: 99.53Min: 93.24 / Avg: 93.31 / Max: 93.38Min: 86.69 / Avg: 86.71 / Max: 86.74Min: 72.73 / Avg: 72.74 / Max: 72.76Min: 66.51 / Avg: 66.56 / Max: 66.6Min: 66.37 / Avg: 66.39 / Max: 66.42Min: 50.49 / Avg: 53.51 / Max: 59.51Min: 53 / Avg: 53.04 / Max: 53.1

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGTX 1070 TiGTX 1080 TiGTX 1080GTX 1060GTX 1070TITAN RTXRTX 2070RTX 2080RTX 2060RTX 2080 Ti918273645SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 339.6436.3135.6635.5435.4432.7232.7132.4731.9831.961. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGTX 1070 TiGTX 1080 TiGTX 1080GTX 1060GTX 1070TITAN RTXRTX 2070RTX 2080RTX 2060RTX 2080 Ti816243240Min: 39.57 / Avg: 39.64 / Max: 39.68Min: 36.28 / Avg: 36.31 / Max: 36.35Min: 35.37 / Avg: 35.66 / Max: 36.02Min: 35.41 / Avg: 35.54 / Max: 35.66Min: 35.36 / Avg: 35.44 / Max: 35.54Min: 32.52 / Avg: 32.72 / Max: 33.1Min: 32.66 / Avg: 32.71 / Max: 32.78Min: 32.32 / Avg: 32.47 / Max: 32.54Min: 31.87 / Avg: 31.98 / Max: 32.07Min: 31.84 / Avg: 31.96 / Max: 32.091. (CXX) g++ options: -O2 -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 386.94103.00103.00107.00113.00125.00132.00133.00156.00160.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX306090120150Min: 86.89 / Avg: 86.94 / Max: 86.97Min: 102.73 / Avg: 102.78 / Max: 102.84Min: 102.56 / Avg: 102.63 / Max: 102.77Min: 107.12 / Avg: 107.4 / Max: 107.66Min: 112.47 / Avg: 112.61 / Max: 112.83Min: 125.21 / Avg: 125.45 / Max: 125.8Min: 131.66 / Avg: 131.97 / Max: 132.29Min: 132.99 / Avg: 133.25 / Max: 133.68Min: 155.76 / Avg: 155.89 / Max: 156.08Min: 159.74 / Avg: 160.07 / Max: 160.41

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX60120180240300SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.32, N = 3SE +/- 0.20, N = 3SE +/- 0.43, N = 394.62126.00128.00139.00148.00179.00192.00194.00250.00264.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX50100150200250Min: 94.58 / Avg: 94.62 / Max: 94.68Min: 126.33 / Avg: 126.49 / Max: 126.72Min: 128.34 / Avg: 128.4 / Max: 128.51Min: 138.95 / Avg: 139.09 / Max: 139.25Min: 147.98 / Avg: 148.21 / Max: 148.51Min: 178.77 / Avg: 179.11 / Max: 179.62Min: 191.55 / Avg: 192.06 / Max: 192.65Min: 194.05 / Avg: 194.4 / Max: 195.04Min: 249.48 / Avg: 249.7 / Max: 250.09Min: 263.47 / Avg: 264.04 / Max: 264.9

clpeak

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX120240360480600SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.60, N = 3SE +/- 0.24, N = 3SE +/- 0.72, N = 3SE +/- 0.02, N = 3SE +/- 0.98, N = 3SE +/- 0.14, N = 3SE +/- 1.25, N = 3SE +/- 1.29, N = 31512242312422662973434155195401. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 1060GTX 1070RTX 2060GTX 1070 TiRTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX100200300400500Min: 150.56 / Avg: 150.64 / Max: 150.74Min: 223.25 / Avg: 223.76 / Max: 224.03Min: 229.66 / Avg: 230.86 / Max: 231.47Min: 241.36 / Avg: 241.64 / Max: 242.12Min: 265.58 / Avg: 266.33 / Max: 267.77Min: 297.37 / Avg: 297.4 / Max: 297.42Min: 341.37 / Avg: 342.53 / Max: 344.48Min: 414.58 / Avg: 414.85 / Max: 415.04Min: 516.26 / Avg: 518.58 / Max: 520.56Min: 537.59 / Avg: 539.61 / Max: 542.011. (CXX) g++ options: -O3 -rdynamic -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX50100150200250SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.44, N = 3SE +/- 0.18, N = 3SE +/- 0.58, N = 3SE +/- 0.40, N = 3SE +/- 0.48, N = 3SE +/- 1.30, N = 388.85117.00120.00132.00134.00157.00167.00168.00211.00215.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 88.76 / Avg: 88.85 / Max: 88.92Min: 117.31 / Avg: 117.42 / Max: 117.56Min: 119.2 / Avg: 119.54 / Max: 120.14Min: 131.77 / Avg: 131.91 / Max: 132Min: 133.38 / Avg: 133.96 / Max: 134.81Min: 157.1 / Avg: 157.33 / Max: 157.7Min: 165.9 / Avg: 166.71 / Max: 167.85Min: 167.9 / Avg: 168.35 / Max: 169.14Min: 210.59 / Avg: 211.32 / Max: 212.24Min: 213.66 / Avg: 215.41 / Max: 217.95

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX50100150200250SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 0.38, N = 3SE +/- 0.53, N = 386.11116.00118.00133.00133.00156.00167.00172.00225.00229.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 86.05 / Avg: 86.11 / Max: 86.17Min: 115.89 / Avg: 115.99 / Max: 116.12Min: 117.21 / Avg: 117.56 / Max: 117.8Min: 132.58 / Avg: 132.67 / Max: 132.83Min: 133.25 / Avg: 133.38 / Max: 133.48Min: 156.39 / Avg: 156.41 / Max: 156.45Min: 166.55 / Avg: 166.93 / Max: 167.49Min: 171.98 / Avg: 172.21 / Max: 172.43Min: 223.99 / Avg: 224.61 / Max: 225.31Min: 227.82 / Avg: 228.71 / Max: 229.66

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti50100150200250SE +/- 0.17, N = 3SE +/- 0.03, N = 3SE +/- 0.50, N = 3SE +/- 0.60, N = 3SE +/- 0.49, N = 3SE +/- 0.38, N = 3SE +/- 0.82, N = 3SE +/- 0.27, N = 3SE +/- 0.96, N = 3SE +/- 0.72, N = 3134153156167187188198216244246
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti4080120160200Min: 133.51 / Avg: 133.81 / Max: 134.09Min: 153.07 / Avg: 153.11 / Max: 153.17Min: 155.61 / Avg: 156.25 / Max: 157.23Min: 165.62 / Avg: 166.82 / Max: 167.47Min: 186.6 / Avg: 187.44 / Max: 188.3Min: 187.38 / Avg: 188.13 / Max: 188.58Min: 196.74 / Avg: 198.38 / Max: 199.3Min: 215.25 / Avg: 215.62 / Max: 216.16Min: 242.37 / Avg: 243.94 / Max: 245.69Min: 244.31 / Avg: 245.51 / Max: 246.8

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti30060090012001500SE +/- 1.03, N = 3SE +/- 0.80, N = 3SE +/- 0.82, N = 3SE +/- 0.28, N = 3SE +/- 1.91, N = 3SE +/- 0.24, N = 3SE +/- 1.24, N = 3SE +/- 2.94, N = 3SE +/- 3.52, N = 3SE +/- 1.02, N = 3411433447527596102410771121115811731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti2004006008001000Min: 410.11 / Avg: 411.15 / Max: 413.22Min: 431.53 / Avg: 433.09 / Max: 434.17Min: 445.5 / Avg: 447.09 / Max: 448.19Min: 526.71 / Avg: 527.14 / Max: 527.67Min: 592.44 / Avg: 596.16 / Max: 598.8Min: 1023.47 / Avg: 1023.82 / Max: 1024.27Min: 1074.61 / Avg: 1076.98 / Max: 1078.78Min: 1115.71 / Avg: 1121.4 / Max: 1125.5Min: 1151.93 / Avg: 1158.19 / Max: 1164.11Min: 1171.44 / Avg: 1172.57 / Max: 1174.61. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 3SE +/- 0.36, N = 3SE +/- 0.72, N = 394.08129.00131.00143.00150.00189.00189.00203.00273.00289.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 94.02 / Avg: 94.08 / Max: 94.14Min: 128.87 / Avg: 128.94 / Max: 128.99Min: 130.93 / Avg: 130.95 / Max: 130.97Min: 142.61 / Avg: 142.69 / Max: 142.8Min: 150.11 / Avg: 150.43 / Max: 150.64Min: 188.71 / Avg: 188.84 / Max: 189.07Min: 189.1 / Avg: 189.39 / Max: 189.94Min: 203.19 / Avg: 203.39 / Max: 203.73Min: 272.43 / Avg: 272.89 / Max: 273.61Min: 288.45 / Avg: 289.19 / Max: 290.62

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX100200300400500SE +/- 0.17, N = 3SE +/- 0.38, N = 3SE +/- 0.16, N = 3SE +/- 0.24, N = 3SE +/- 0.26, N = 3SE +/- 0.76, N = 3SE +/- 0.34, N = 3SE +/- 0.40, N = 3SE +/- 1.03, N = 3SE +/- 1.16, N = 3183237237264282333334354458474
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX80160240320400Min: 182.32 / Avg: 182.6 / Max: 182.92Min: 236.94 / Avg: 237.38 / Max: 238.13Min: 237.19 / Avg: 237.39 / Max: 237.7Min: 263.46 / Avg: 263.87 / Max: 264.3Min: 281.23 / Avg: 281.69 / Max: 282.14Min: 331.75 / Avg: 333.11 / Max: 334.37Min: 333.56 / Avg: 334.13 / Max: 334.72Min: 353.61 / Avg: 354.02 / Max: 354.83Min: 456.67 / Avg: 458.12 / Max: 460.12Min: 472.54 / Avg: 474.44 / Max: 476.54

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060GTX 1070GTX 1070 TiRTX 2060RTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX7001400210028003500SE +/- 6.27, N = 3SE +/- 9.63, N = 3SE +/- 13.26, N = 3SE +/- 13.06, N = 3SE +/- 20.57, N = 3SE +/- 32.06, N = 3SE +/- 22.18, N = 3SE +/- 25.14, N = 3SE +/- 20.98, N = 3SE +/- 28.14, N = 38991430147814841729175022592333310732751. (CXX) g++ options: -lpthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060GTX 1070GTX 1070 TiRTX 2060RTX 2070GTX 1080RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX6001200180024003000Min: 888.27 / Avg: 898.96 / Max: 909.99Min: 1412.22 / Avg: 1430.05 / Max: 1445.29Min: 1451.36 / Avg: 1477.54 / Max: 1494.31Min: 1463.59 / Avg: 1484.44 / Max: 1508.5Min: 1689.75 / Avg: 1729.48 / Max: 1758.6Min: 1686.07 / Avg: 1749.9 / Max: 1787.12Min: 2235.88 / Avg: 2258.99 / Max: 2303.34Min: 2285.91 / Avg: 2332.72 / Max: 2372.04Min: 3066.49 / Avg: 3106.95 / Max: 3136.83Min: 3232.74 / Avg: 3275.46 / Max: 3328.551. (CXX) g++ options: -lpthread

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.13ATPase Simulation - 327,506 AtomsGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2080RTX 2070TITAN RTXRTX 2080 Ti0.07260.14520.21780.29040.363SE +/- 0.00102, N = 3SE +/- 0.00210, N = 3SE +/- 0.00129, N = 3SE +/- 0.00192, N = 3SE +/- 0.00208, N = 3SE +/- 0.00196, N = 3SE +/- 0.00116, N = 3SE +/- 0.00161, N = 3SE +/- 0.00092, N = 3SE +/- 0.00190, N = 30.322710.242280.230220.212330.205420.201130.200440.197460.197350.19177
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.13ATPase Simulation - 327,506 AtomsGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2080RTX 2070TITAN RTXRTX 2080 Ti12345Min: 0.32 / Avg: 0.32 / Max: 0.32Min: 0.24 / Avg: 0.24 / Max: 0.25Min: 0.23 / Avg: 0.23 / Max: 0.23Min: 0.21 / Avg: 0.21 / Max: 0.22Min: 0.2 / Avg: 0.21 / Max: 0.21Min: 0.2 / Avg: 0.2 / Max: 0.21Min: 0.2 / Avg: 0.2 / Max: 0.2Min: 0.2 / Avg: 0.2 / Max: 0.2Min: 0.2 / Avg: 0.2 / Max: 0.2Min: 0.19 / Avg: 0.19 / Max: 0.2

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX120240360480600SE +/- 0.11, N = 3SE +/- 0.45, N = 3SE +/- 0.24, N = 3SE +/- 0.41, N = 3SE +/- 0.26, N = 3SE +/- 0.55, N = 3SE +/- 0.33, N = 3SE +/- 0.45, N = 3SE +/- 0.31, N = 3SE +/- 1.67, N = 3194257262292309339376400537556
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500Min: 193.36 / Avg: 193.57 / Max: 193.7Min: 256.49 / Avg: 257.03 / Max: 257.93Min: 261.35 / Avg: 261.84 / Max: 262.13Min: 291.32 / Avg: 292.02 / Max: 292.73Min: 308.5 / Avg: 309.01 / Max: 309.37Min: 338.01 / Avg: 338.9 / Max: 339.9Min: 375.3 / Avg: 375.95 / Max: 376.34Min: 398.92 / Avg: 399.51 / Max: 400.4Min: 536.36 / Avg: 536.97 / Max: 537.39Min: 553.7 / Avg: 556.33 / Max: 559.45

clpeak

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferRTX 2080GTX 1060GTX 1070 TiTITAN RTXRTX 2070GTX 1080 TiRTX 2060RTX 2080 TiGTX 1070GTX 10803691215SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.1311.1511.1511.1611.2511.2911.2911.2911.3311.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferRTX 2080GTX 1060GTX 1070 TiTITAN RTXRTX 2070GTX 1080 TiRTX 2060RTX 2080 TiGTX 1070GTX 10803691215Min: 11.03 / Avg: 11.13 / Max: 11.31Min: 11.01 / Avg: 11.15 / Max: 11.31Min: 11.06 / Avg: 11.15 / Max: 11.24Min: 11.04 / Avg: 11.16 / Max: 11.24Min: 11.18 / Avg: 11.25 / Max: 11.32Min: 11.27 / Avg: 11.29 / Max: 11.31Min: 11.24 / Avg: 11.29 / Max: 11.34Min: 11.27 / Avg: 11.29 / Max: 11.31Min: 11.32 / Avg: 11.33 / Max: 11.34Min: 11.29 / Avg: 11.34 / Max: 11.371. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRTX 2070RTX 2080GTX 1060GTX 1070 TiGTX 1080 TiTITAN RTXGTX 1080RTX 2080 TiGTX 1070RTX 20603691215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 312.4812.5212.5412.5512.5712.5812.5912.5912.6012.601. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRTX 2070RTX 2080GTX 1060GTX 1070 TiGTX 1080 TiTITAN RTXGTX 1080RTX 2080 TiGTX 1070RTX 206048121620Min: 12.45 / Avg: 12.48 / Max: 12.49Min: 12.47 / Avg: 12.52 / Max: 12.55Min: 12.47 / Avg: 12.54 / Max: 12.6Min: 12.54 / Avg: 12.55 / Max: 12.56Min: 12.55 / Avg: 12.57 / Max: 12.58Min: 12.55 / Avg: 12.58 / Max: 12.6Min: 12.57 / Avg: 12.59 / Max: 12.61Min: 12.58 / Avg: 12.59 / Max: 12.61Min: 12.54 / Avg: 12.6 / Max: 12.63Min: 12.59 / Avg: 12.6 / Max: 12.611. (CXX) g++ options: -O3 -rdynamic -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX130260390520650SE +/- 0.33, N = 3SE +/- 0.19, N = 3SE +/- 0.21, N = 3SE +/- 0.45, N = 3SE +/- 0.59, N = 3SE +/- 0.31, N = 3SE +/- 0.60, N = 3SE +/- 1.09, N = 3SE +/- 2.13, N = 3SE +/- 0.76, N = 3180224228259330347382457605610
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX110220330440550Min: 179.02 / Avg: 179.67 / Max: 180Min: 223.9 / Avg: 224.25 / Max: 224.53Min: 227.78 / Avg: 228.16 / Max: 228.5Min: 257.79 / Avg: 258.64 / Max: 259.34Min: 329.48 / Avg: 330.36 / Max: 331.47Min: 346.94 / Avg: 347.49 / Max: 348.01Min: 380.39 / Avg: 381.58 / Max: 382.33Min: 456.16 / Avg: 457.47 / Max: 459.63Min: 601.13 / Avg: 605.16 / Max: 608.37Min: 608.75 / Avg: 610.22 / Max: 611.33

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGTX 1060RTX 2060GTX 1070RTX 2070GTX 1070 TiGTX 1080RTX 2080GTX 1080 TiTITAN RTXRTX 2080 Ti3691215SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 312.008.808.257.887.856.506.284.964.394.361. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGTX 1060RTX 2060GTX 1070RTX 2070GTX 1070 TiGTX 1080RTX 2080GTX 1080 TiTITAN RTXRTX 2080 Ti3691215Min: 11.92 / Avg: 12 / Max: 12.13Min: 8.74 / Avg: 8.8 / Max: 8.88Min: 8.22 / Avg: 8.25 / Max: 8.26Min: 7.77 / Avg: 7.88 / Max: 7.99Min: 7.83 / Avg: 7.85 / Max: 7.88Min: 6.44 / Avg: 6.5 / Max: 6.63Min: 6.19 / Avg: 6.28 / Max: 6.39Min: 4.92 / Avg: 4.96 / Max: 5.01Min: 4.3 / Avg: 4.39 / Max: 4.55Min: 4.31 / Avg: 4.36 / Max: 4.431. (CXX) g++ options: -O2 -lOpenCL

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX70M140M210M280M350MSE +/- 513835.53, N = 3SE +/- 621591.58, N = 3SE +/- 297139.02, N = 3SE +/- 121634.53, N = 3SE +/- 815068.63, N = 3SE +/- 720633.38, N = 3SE +/- 333271.02, N = 3SE +/- 1536969.33, N = 3SE +/- 1417039.09, N = 3SE +/- 908628.37, N = 31837826262181720622215480162391313242516119892651628752656820462788292383009222413036654471. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX50M100M150M200M250MMin: 182818085.8 / Avg: 183782626.17 / Max: 184572023.8Min: 217003046.5 / Avg: 218172062.43 / Max: 219122875.7Min: 220996207.2 / Avg: 221548015.6 / Max: 222014983.7Min: 238908355.7 / Avg: 239131324.27 / Max: 239327062Min: 250000827.1 / Avg: 251611988.57 / Max: 252632347.5Min: 263757984.9 / Avg: 265162874.6 / Max: 266143978.9Min: 265132379.1 / Avg: 265682046.23 / Max: 266283389Min: 276811943.5 / Avg: 278829237.77 / Max: 281846545.4Min: 298088601.7 / Avg: 300922240.8 / Max: 302382263.1Min: 301940481.9 / Avg: 303665447.4 / Max: 305023093.81. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX120240360480600SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.52, N = 3SE +/- 1.98, N = 3SE +/- 1.81, N = 3SE +/- 0.50, N = 3SE +/- 0.38, N = 31532052052292963373923925445651. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500Min: 152.5 / Avg: 152.67 / Max: 153Min: 205.1 / Avg: 205.27 / Max: 205.4Min: 204.2 / Avg: 204.73 / Max: 205.1Min: 228.6 / Avg: 228.93 / Max: 229.2Min: 296.1 / Avg: 296.3 / Max: 296.6Min: 336.4 / Avg: 337.37 / Max: 338.2Min: 388.2 / Avg: 392.1 / Max: 394.6Min: 388.5 / Avg: 391.87 / Max: 394.7Min: 543.5 / Avg: 544 / Max: 545Min: 565 / Avg: 565.43 / Max: 566.21. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTX100200300400500SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.35, N = 3SE +/- 0.18, N = 3SE +/- 0.23, N = 3SE +/- 0.64, N = 31381871872092473173273304544831. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTX90180270360450Min: 138.2 / Avg: 138.4 / Max: 138.7Min: 186.8 / Avg: 186.87 / Max: 187Min: 186.7 / Avg: 187.23 / Max: 187.7Min: 209.3 / Avg: 209.4 / Max: 209.5Min: 246.4 / Avg: 246.67 / Max: 247Min: 317.3 / Avg: 317.47 / Max: 317.8Min: 326.3 / Avg: 326.7 / Max: 327.4Min: 330.1 / Avg: 330.37 / Max: 330.7Min: 453.3 / Avg: 453.7 / Max: 454.1Min: 481.7 / Avg: 482.77 / Max: 483.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1060GTX 1070 TiGTX 1070GTX 1080RTX 2060RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX110220330440550SE +/- 0.19, N = 3SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.95, N = 3SE +/- 0.59, N = 3SE +/- 0.73, N = 3SE +/- 0.09, N = 3SE +/- 1.92, N = 3SE +/- 2.45, N = 31391901922162453133233364394891. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1060GTX 1070 TiGTX 1070GTX 1080RTX 2060RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX90180270360450Min: 138.9 / Avg: 139.13 / Max: 139.5Min: 189.2 / Avg: 189.63 / Max: 190.2Min: 191.6 / Avg: 191.67 / Max: 191.8Min: 216 / Avg: 216.17 / Max: 216.3Min: 243.5 / Avg: 245.37 / Max: 246.6Min: 312.5 / Avg: 313.4 / Max: 314.5Min: 321.7 / Avg: 322.67 / Max: 324.1Min: 335.5 / Avg: 335.67 / Max: 335.8Min: 435.5 / Avg: 439.33 / Max: 441.3Min: 483.6 / Avg: 488.5 / Max: 491.11. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000SE +/- 0.87, N = 3SE +/- 3.15, N = 3SE +/- 4.06, N = 3SE +/- 5.02, N = 3SE +/- 3.70, N = 3SE +/- 1.27, N = 3SE +/- 1.07, N = 3SE +/- 1.63, N = 3SE +/- 2.70, N = 3SE +/- 3.45, N = 3552645664686774779830893971977
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000Min: 550.45 / Avg: 552.08 / Max: 553.4Min: 641.13 / Avg: 645.34 / Max: 651.5Min: 656.32 / Avg: 664.31 / Max: 669.55Min: 675.83 / Avg: 685.87 / Max: 691.24Min: 767.34 / Avg: 774.41 / Max: 779.85Min: 777.01 / Avg: 779.22 / Max: 781.41Min: 828.43 / Avg: 830.48 / Max: 832.07Min: 890.4 / Avg: 893.16 / Max: 896.06Min: 965.79 / Avg: 971.16 / Max: 974.38Min: 971.07 / Avg: 976.99 / Max: 983.03

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Masskrug - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070GTX 1080 TiRTX 2060RTX 2080TITAN RTXRTX 2080 TiRTX 20700.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.174.013.973.923.873.823.783.693.693.68
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Masskrug - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070GTX 1080 TiRTX 2060RTX 2080TITAN RTXRTX 2080 TiRTX 2070246810Min: 4.16 / Avg: 4.17 / Max: 4.19Min: 3.99 / Avg: 4.01 / Max: 4.03Min: 3.97 / Avg: 3.97 / Max: 3.98Min: 3.91 / Avg: 3.92 / Max: 3.93Min: 3.85 / Avg: 3.87 / Max: 3.89Min: 3.81 / Avg: 3.82 / Max: 3.84Min: 3.78 / Avg: 3.78 / Max: 3.79Min: 3.67 / Avg: 3.69 / Max: 3.7Min: 3.66 / Avg: 3.69 / Max: 3.7Min: 3.65 / Avg: 3.68 / Max: 3.7

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500SE +/- 3.04, N = 3SE +/- 0.31, N = 3SE +/- 1.02, N = 3SE +/- 0.61, N = 3SE +/- 1.86, N = 3SE +/- 4.51, N = 3SE +/- 1.43, N = 3SE +/- 0.31, N = 3SE +/- 1.39, N = 3SE +/- 3.01, N = 35927557648018638831008104512911336
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000Min: 586.76 / Avg: 592.47 / Max: 597.15Min: 754.57 / Avg: 754.92 / Max: 755.54Min: 761.81 / Avg: 763.83 / Max: 765.1Min: 800.38 / Avg: 801.45 / Max: 802.5Min: 859.19 / Avg: 862.88 / Max: 865.16Min: 875.86 / Avg: 882.88 / Max: 891.29Min: 1005.23 / Avg: 1007.98 / Max: 1010.04Min: 1044.43 / Avg: 1045.03 / Max: 1045.41Min: 1288.49 / Avg: 1291.1 / Max: 1293.21Min: 1329.56 / Avg: 1335.56 / Max: 1338.83

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Boat - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti0.82351.6472.47053.2944.1175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.662.932.892.722.262.221.931.921.641.64
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Boat - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1070GTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti246810Min: 3.65 / Avg: 3.66 / Max: 3.67Min: 2.92 / Avg: 2.93 / Max: 2.94Min: 2.88 / Avg: 2.89 / Max: 2.89Min: 2.71 / Avg: 2.72 / Max: 2.73Min: 2.25 / Avg: 2.26 / Max: 2.27Min: 2.22 / Avg: 2.22 / Max: 2.23Min: 1.92 / Avg: 1.93 / Max: 1.94Min: 1.92 / Avg: 1.92 / Max: 1.93Min: 1.63 / Avg: 1.64 / Max: 1.64Min: 1.64 / Avg: 1.64 / Max: 1.65

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Room - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070GTX 1080 TiRTX 2060RTX 2080RTX 2080 TiRTX 2070TITAN RTX0.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.311.161.101.101.030.820.810.750.750.74
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Room - Acceleration: OpenCLGTX 1060GTX 1070 TiGTX 1080GTX 1070GTX 1080 TiRTX 2060RTX 2080RTX 2080 TiRTX 2070TITAN RTX246810Min: 1.3 / Avg: 1.31 / Max: 1.31Min: 1.15 / Avg: 1.16 / Max: 1.16Min: 1.1 / Avg: 1.1 / Max: 1.11Min: 1.1 / Avg: 1.1 / Max: 1.1Min: 1.01 / Avg: 1.03 / Max: 1.06Min: 0.82 / Avg: 0.82 / Max: 0.82Min: 0.81 / Avg: 0.81 / Max: 0.81Min: 0.74 / Avg: 0.75 / Max: 0.75Min: 0.75 / Avg: 0.75 / Max: 0.75Min: 0.73 / Avg: 0.74 / Max: 0.75

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500SE +/- 2.45, N = 3SE +/- 8.74, N = 3SE +/- 1.30, N = 3SE +/- 1.43, N = 3SE +/- 10.50, N = 3SE +/- 2.56, N = 3SE +/- 46.80, N = 3SE +/- 6.71, N = 3SE +/- 11.70, N = 3SE +/- 2.67, N = 32974535015758039729881083144515531. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500Min: 291.88 / Avg: 296.79 / Max: 299.31Min: 443.6 / Avg: 453.16 / Max: 470.62Min: 498.13 / Avg: 500.59 / Max: 502.57Min: 572.34 / Avg: 574.51 / Max: 577.21Min: 782.66 / Avg: 802.85 / Max: 817.98Min: 968.63 / Avg: 972.1 / Max: 977.1Min: 936.99 / Avg: 988.41 / Max: 1081.85Min: 1069.68 / Avg: 1083.05 / Max: 1090.83Min: 1431.39 / Avg: 1444.87 / Max: 1468.18Min: 1548.6 / Avg: 1552.67 / Max: 1557.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti3K6K9K12K15KSE +/- 38.63, N = 3SE +/- 21.00, N = 3SE +/- 6.17, N = 3SE +/- 25.25, N = 3SE +/- 76.28, N = 3SE +/- 444.77, N = 3SE +/- 498.12, N = 3SE +/- 679.58, N = 3SE +/- 1004.15, N = 3SE +/- 517.75, N = 31252168120682398326367057780995214848149061. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti3K6K9K12K15KMin: 1174.97 / Avg: 1252.22 / Max: 1291.03Min: 1641.76 / Avg: 1680.93 / Max: 1713.66Min: 2055.54 / Avg: 2067.8 / Max: 2075.12Min: 2351.95 / Avg: 2398.48 / Max: 2438.72Min: 3114.34 / Avg: 3263.29 / Max: 3366.29Min: 5815.24 / Avg: 6704.55 / Max: 7166.29Min: 6785.83 / Avg: 7780.45 / Max: 8326.84Min: 8593.77 / Avg: 9951.71 / Max: 10680.62Min: 12874.24 / Avg: 14847.83 / Max: 16156.55Min: 13870.49 / Avg: 14905.92 / Max: 15434.021. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.9312.1912.1912.3012.4312.5112.5712.5712.6812.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX48121620Min: 11.92 / Avg: 11.93 / Max: 11.95Min: 12.19 / Avg: 12.19 / Max: 12.19Min: 12.19 / Avg: 12.19 / Max: 12.2Min: 12.29 / Avg: 12.3 / Max: 12.3Min: 12.43 / Avg: 12.43 / Max: 12.44Min: 12.51 / Avg: 12.51 / Max: 12.51Min: 12.56 / Avg: 12.57 / Max: 12.58Min: 12.56 / Avg: 12.57 / Max: 12.57Min: 12.68 / Avg: 12.68 / Max: 12.69Min: 12.68 / Avg: 12.68 / Max: 12.691. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX110220330440550SE +/- 0.95, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.68, N = 3SE +/- 2.69, N = 3SE +/- 0.37, N = 3SE +/- 0.48, N = 3SE +/- 1.70, N = 31451961972222763293653685055251. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 2070RTX 2080RTX 2080 TiTITAN RTX90180270360450Min: 143.59 / Avg: 145.49 / Max: 146.46Min: 196.27 / Avg: 196.3 / Max: 196.32Min: 196.62 / Avg: 197.05 / Max: 197.27Min: 221.94 / Avg: 222.03 / Max: 222.08Min: 275.65 / Avg: 275.99 / Max: 276.3Min: 328.16 / Avg: 329.02 / Max: 330.37Min: 359.25 / Avg: 364.64 / Max: 367.34Min: 367.26 / Avg: 367.64 / Max: 368.37Min: 504.5 / Avg: 505.01 / Max: 505.96Min: 522.05 / Avg: 525.42 / Max: 527.521. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.40, N = 3SE +/- 0.35, N = 37.2810.4711.6314.1915.8718.4219.6823.7435.1336.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060RTX 2070GTX 1080 TiRTX 2080RTX 2080 TiTITAN RTX816243240Min: 7.28 / Avg: 7.28 / Max: 7.28Min: 10.47 / Avg: 10.47 / Max: 10.47Min: 11.63 / Avg: 11.63 / Max: 11.63Min: 14.15 / Avg: 14.19 / Max: 14.22Min: 15.86 / Avg: 15.87 / Max: 15.88Min: 18.41 / Avg: 18.42 / Max: 18.43Min: 19.56 / Avg: 19.68 / Max: 19.78Min: 23.74 / Avg: 23.74 / Max: 23.75Min: 34.34 / Avg: 35.13 / Max: 35.58Min: 35.77 / Avg: 36.28 / Max: 36.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 1060GTX 1070RTX 2060GTX 1070 TiGTX 1080RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX4K8K12K16K20KSE +/- 3.56, N = 3SE +/- 393.27, N = 3SE +/- 657.16, N = 3SE +/- 14.43, N = 3SE +/- 396.02, N = 3SE +/- 495.99, N = 3SE +/- 732.21, N = 3SE +/- 782.31, N = 3SE +/- 1065.18, N = 3SE +/- 688.69, N = 3423958786630677679347968102701085315352163861. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 1060GTX 1070RTX 2060GTX 1070 TiGTX 1080RTX 2070RTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX3K6K9K12K15KMin: 4232.57 / Avg: 4239.17 / Max: 4244.77Min: 5091.74 / Avg: 5878.26 / Max: 6276.36Min: 5316.4 / Avg: 6630.38 / Max: 7313.07Min: 6747.48 / Avg: 6776.31 / Max: 6791.94Min: 7141.47 / Avg: 7933.51 / Max: 8329.77Min: 6980.09 / Avg: 7968.07 / Max: 8539.2Min: 8806.31 / Avg: 10270.06 / Max: 11040.34Min: 9288.09 / Avg: 10852.7 / Max: 11635.67Min: 13222.02 / Avg: 15351.89 / Max: 16456.4Min: 15021.53 / Avg: 16386.43 / Max: 17229.121. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Rack - Acceleration: OpenCLGTX 1080GTX 1070 TiGTX 1060GTX 1080 TiGTX 1070RTX 2080RTX 2060TITAN RTXRTX 2080 TiRTX 20700.02930.05860.08790.11720.1465SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.130.130.130.120.120.110.110.100.100.10
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Rack - Acceleration: OpenCLGTX 1080GTX 1070 TiGTX 1060GTX 1080 TiGTX 1070RTX 2080RTX 2060TITAN RTXRTX 2080 TiRTX 207012345Min: 0.12 / Avg: 0.13 / Max: 0.13Min: 0.13 / Avg: 0.13 / Max: 0.13Min: 0.13 / Avg: 0.13 / Max: 0.13Min: 0.12 / Avg: 0.12 / Max: 0.12Min: 0.12 / Avg: 0.12 / Max: 0.13Min: 0.11 / Avg: 0.11 / Max: 0.11Min: 0.11 / Avg: 0.11 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11

clpeak

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGTX 1080 TiGTX 1070 TiTITAN RTXRTX 2080 TiGTX 1060GTX 1080RTX 2080RTX 2060RTX 2070GTX 10700.89781.79562.69343.59124.489SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 33.993.923.853.833.783.773.653.653.643.641. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGTX 1080 TiGTX 1070 TiTITAN RTXRTX 2080 TiGTX 1060GTX 1080RTX 2080RTX 2060RTX 2070GTX 1070246810Min: 3.94 / Avg: 3.99 / Max: 4.05Min: 3.75 / Avg: 3.92 / Max: 4.2Min: 3.66 / Avg: 3.85 / Max: 3.99Min: 3.77 / Avg: 3.83 / Max: 3.88Min: 3.72 / Avg: 3.78 / Max: 3.9Min: 3.71 / Avg: 3.77 / Max: 3.8Min: 3.58 / Avg: 3.65 / Max: 3.72Min: 3.52 / Avg: 3.65 / Max: 3.74Min: 3.64 / Avg: 3.64 / Max: 3.64Min: 3.56 / Avg: 3.64 / Max: 3.721. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Chaos Group V-RAY

OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 1.1.0System Power Consumption MonitorRTX 2080 TiTITAN RTXGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1070GTX 1070 TiGTX 106050100150200250Min: 77.8 / Avg: 258.11 / Max: 278.4Min: 56.9 / Avg: 255.3 / Max: 301.4Min: 51.6 / Avg: 222.32 / Max: 232.4Min: 56.9 / Avg: 193.17 / Max: 205.2Min: 47.1 / Avg: 187.28 / Max: 203.3Min: 51.4 / Avg: 157.72 / Max: 163.6Min: 44.7 / Avg: 152.69 / Max: 164.2Min: 48.1 / Avg: 131.12 / Max: 137.1Min: 43.2 / Avg: 121.48 / Max: 125.9

OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 1.1.0GPU Temperature MonitorRTX 2080GTX 1080 TiRTX 2080 TiTITAN RTXGTX 1070RTX 2070RTX 2060GTX 1060GTX 1070 Ti1428425670Min: 64 / Avg: 72.3 / Max: 75Min: 63 / Avg: 70.44 / Max: 73Min: 63 / Avg: 68.72 / Max: 70Min: 61 / Avg: 68.6 / Max: 71Min: 58 / Avg: 66.75 / Max: 70Min: 58 / Avg: 65.11 / Max: 67Min: 55 / Avg: 62.31 / Max: 64Min: 50 / Avg: 58.41 / Max: 60Min: 44 / Avg: 48.38 / Max: 50

OctaneBench

OpenBenchmarking.orgWatts, Fewer Is BetterOctaneBench 4.00System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1070GTX 1070 TiGTX 106060120180240300Min: 50.9 / Avg: 313.74 / Max: 347Min: 48.1 / Avg: 306.05 / Max: 333.8Min: 121.5 / Avg: 228.13 / Max: 255.9Min: 46.8 / Avg: 227.19 / Max: 265.6Min: 45.3 / Avg: 207.59 / Max: 236.4Min: 44.5 / Avg: 179.11 / Max: 208.8Min: 82.8 / Avg: 151.39 / Max: 179.2Min: 101.9 / Avg: 126.5 / Max: 138.6Min: 43.6 / Avg: 116.96 / Max: 132.2

OpenBenchmarking.orgCelsius, Fewer Is BetterOctaneBench 4.00GPU Temperature MonitorRTX 2080TITAN RTXGTX 1080 TiRTX 2080 TiRTX 2070GTX 1070RTX 2060GTX 1060GTX 1070 Ti1530456075Min: 55 / Avg: 77.23 / Max: 81Min: 49 / Avg: 73.57 / Max: 78Min: 58 / Avg: 72.86 / Max: 75Min: 48 / Avg: 72.21 / Max: 76Min: 44 / Avg: 69.09 / Max: 74Min: 54 / Avg: 67.7 / Max: 71Min: 44 / Avg: 65.39 / Max: 70Min: 45 / Avg: 57.32 / Max: 60Min: 40 / Avg: 48.77 / Max: 50

OpenBenchmarking.orgScore Per Watt, More Is BetterOctaneBench 4.00Total ScoreGTX 1060GTX 1070RTX 2060GTX 1080 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTXGTX 1070 Ti0.2520.5040.7561.0081.260.780.880.920.930.960.980.991.021.12

System Power Consumption Monitor

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1080GTX 1070GTX 1070 Ti70140210280350Min: 49.4 / Avg: 228.34 / Max: 363.1Min: 47.2 / Avg: 220.83 / Max: 367.7Min: 47.2 / Avg: 198.85 / Max: 370.3Min: 46 / Avg: 176.62 / Max: 334.5Min: 43.9 / Avg: 160.13 / Max: 317.6Min: 42.5 / Avg: 144.37 / Max: 300.1Min: 40.2 / Avg: 143.79 / Max: 318.2Min: 42.3 / Avg: 142.19 / Max: 280.5Min: 47.2 / Avg: 123.43 / Max: 255.8

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 2080GTX 1080 TiTITAN RTXRTX 2080 TiGTX 1070GTX 1080RTX 2070RTX 2060GTX 1070 Ti1632486480Min: 48 / Avg: 67.06 / Max: 83Min: 39 / Avg: 66.36 / Max: 82Min: 33 / Avg: 62.56 / Max: 79Min: 43 / Avg: 61.88 / Max: 77Min: 32 / Avg: 61.66 / Max: 76Min: 31 / Avg: 60.14 / Max: 74Min: 32 / Avg: 56.83 / Max: 74Min: 37 / Avg: 55.81 / Max: 70Min: 38 / Avg: 45.88 / Max: 53

Meta Performance Per Watt

OpenBenchmarking.orgPerformance Per Watt, More Is BetterMeta Performance Per WattPerformance Per WattGTX 1060GTX 1070GTX 1080GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000416.98479.38541.32557.72583.35735.65740.20786.04839.87854.04

Rodinia

OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 2.4System Power Consumption MonitorGTX 1080 TiTITAN RTXGTX 1080RTX 2080GTX 1070 TiGTX 1060RTX 2070GTX 1070RTX 2060RTX 2080 Ti50100150200250Min: 251.2 / Avg: 257.17 / Max: 262.8Min: 51.8 / Avg: 158.3 / Max: 240.2Min: 43.1 / Avg: 152.7 / Max: 204.6Min: 47.2 / Avg: 145.53 / Max: 181.5Min: 138.9 / Avg: 142.32 / Max: 143.8Min: 104.4 / Avg: 137.7 / Max: 148Min: 44.9 / Avg: 133.58 / Max: 161.9Min: 42.6 / Avg: 133.12 / Max: 175.4Min: 43.8 / Avg: 118.92 / Max: 153.5Min: 48.3 / Avg: 118.57 / Max: 153.7

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 2.4GPU Temperature MonitorRTX 2080GTX 1070GTX 1080 TiTITAN RTXGTX 1080RTX 2080 TiGTX 1060RTX 2060RTX 2070GTX 1070 Ti1224364860Min: 57 / Avg: 58.75 / Max: 61Min: 53 / Avg: 56.4 / Max: 60Min: 51 / Avg: 54.67 / Max: 58Min: 53 / Avg: 54.33 / Max: 55Min: 47 / Avg: 53 / Max: 57Min: 52 / Avg: 52.67 / Max: 54Min: 46 / Avg: 49.29 / Max: 53Min: 43 / Avg: 48.8 / Max: 51Min: 44 / Avg: 47.5 / Max: 49Min: 38 / Avg: 42.67 / Max: 45

OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 2.4System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1080GTX 1070GTX 1070 TiGTX 1060306090120150Min: 103.8 / Avg: 163.73 / Max: 168.8Min: 94.7 / Avg: 156.48 / Max: 163Min: 139.3 / Avg: 143.18 / Max: 143.6Min: 86.1 / Avg: 134.67 / Max: 138.4Min: 77.4 / Avg: 121.31 / Max: 124.3Min: 71.3 / Avg: 117.74 / Max: 121Min: 89.5 / Avg: 112.44 / Max: 114Min: 83.2 / Avg: 107.83 / Max: 109.9Min: 79.3 / Avg: 102.88 / Max: 104.5Min: 71.2 / Avg: 95.18 / Max: 97.9

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 2.4GPU Temperature MonitorRTX 2080GTX 1080 TiTITAN RTXGTX 1070RTX 2080 TiGTX 1080RTX 2060RTX 2070GTX 1060GTX 1070 Ti1122334455Min: 55 / Avg: 56.89 / Max: 58Min: 54 / Avg: 55.14 / Max: 56Min: 52 / Avg: 53 / Max: 54Min: 51 / Avg: 52.25 / Max: 54Min: 50 / Avg: 51.68 / Max: 52Min: 50 / Avg: 50 / Max: 50Min: 47 / Avg: 47.53 / Max: 48Min: 45 / Avg: 46.16 / Max: 47Min: 43 / Avg: 44.4 / Max: 46Min: 40 / Avg: 40.18 / Max: 41

JuliaGPU

OpenBenchmarking.orgWatts, Fewer Is BetterJuliaGPU 1.2pts1System Power Consumption MonitorGTX 1080 TiTITAN RTXRTX 2080 TiRTX 2080GTX 1080GTX 1070RTX 2070GTX 1060RTX 2060GTX 1070 Ti4080120160200Min: 67.7 / Avg: 159.15 / Max: 189.8Min: 51.3 / Avg: 148.68 / Max: 204Min: 99.5 / Avg: 147.3 / Max: 199.5Min: 46.4 / Avg: 134.7 / Max: 169Min: 42.3 / Avg: 127 / Max: 157.8Min: 42.3 / Avg: 116.82 / Max: 147.4Min: 52.9 / Avg: 116.58 / Max: 157.4Min: 57.5 / Avg: 112.76 / Max: 128.1Min: 43.5 / Avg: 112.53 / Max: 153.7Min: 47.7 / Avg: 111.88 / Max: 128.2

OpenBenchmarking.orgCelsius, Fewer Is BetterJuliaGPU 1.2pts1GPU Temperature MonitorGTX 1080 TiRTX 2080GTX 1070TITAN RTXGTX 1080RTX 2080 TiRTX 2060GTX 1060RTX 2070GTX 1070 Ti1122334455Min: 52 / Avg: 54.6 / Max: 57Min: 53 / Avg: 54.5 / Max: 56Min: 53 / Avg: 54.4 / Max: 56Min: 49 / Avg: 52.5 / Max: 54Min: 51 / Avg: 52.5 / Max: 54Min: 48 / Avg: 51.25 / Max: 53Min: 43 / Avg: 47.5 / Max: 50Min: 46 / Avg: 47.2 / Max: 49Min: 46 / Avg: 47 / Max: 48Min: 38 / Avg: 40.6 / Max: 42

OpenBenchmarking.orgSamples/sec Per Watt, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1080 TiGTX 1070GTX 1080GTX 1070 TiTITAN RTXRTX 2080 TiRTX 2080RTX 2060RTX 2070500K1000K1500K2000K2500K1629857166611918675921882924198022920424782042921207000222360542279065

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorTITAN RTXRTX 2080 TiRTX 2080GTX 1080 TiRTX 2060RTX 2070GTX 1080GTX 1070GTX 1070 TiGTX 106050100150200250Min: 49.4 / Avg: 244.67 / Max: 287.4Min: 49 / Avg: 235.34 / Max: 283.1Min: 169.9 / Avg: 187.73 / Max: 213.5Min: 53.7 / Avg: 180.42 / Max: 218.2Min: 148.5 / Avg: 159.28 / Max: 177.5Min: 52.2 / Avg: 156.43 / Max: 195Min: 43.4 / Avg: 134.92 / Max: 163.8Min: 43.3 / Avg: 130.75 / Max: 158.9Min: 48.4 / Avg: 122.39 / Max: 144.2Min: 42.1 / Avg: 105.4 / Max: 135.1

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10GPU Temperature MonitorRTX 2080TITAN RTXRTX 2080 TiGTX 1080 TiGTX 1070GTX 1080RTX 2060RTX 2070GTX 1060GTX 1070 Ti1326395265Min: 57 / Avg: 63.86 / Max: 67Min: 55 / Avg: 63.4 / Max: 66Min: 59 / Avg: 62 / Max: 65Min: 56 / Avg: 59.67 / Max: 61Min: 57 / Avg: 57.33 / Max: 58Min: 54 / Avg: 55.54 / Max: 57Min: 48 / Avg: 54.5 / Max: 56Min: 52 / Avg: 54.2 / Max: 56Min: 46 / Avg: 48.5 / Max: 49Min: 42 / Avg: 42.6 / Max: 43

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1080 TiGTX 1070GTX 1070 TiGTX 1060GTX 1080TITAN RTXRTX 2080 TiRTX 2080RTX 2060RTX 20702468103.303.423.543.903.914.734.985.976.436.88

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080GTX 1080RTX 2070RTX 2060GTX 1070GTX 1070 TiGTX 106060120180240300Min: 50.4 / Avg: 217.41 / Max: 351.9Min: 47.8 / Avg: 210.47 / Max: 341.8Min: 49.4 / Avg: 197.37 / Max: 320.7Min: 48.8 / Avg: 169.69 / Max: 284.4Min: 50.6 / Avg: 150.09 / Max: 250.1Min: 44.9 / Avg: 147.19 / Max: 246Min: 44.4 / Avg: 139.12 / Max: 218.8Min: 55.9 / Avg: 136.52 / Max: 200.6Min: 48.3 / Avg: 120.03 / Max: 172.9Min: 43 / Avg: 114.31 / Max: 162.2

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10GPU Temperature MonitorGTX 1080 TiRTX 2080TITAN RTXGTX 1080RTX 2080 TiGTX 1070RTX 2060RTX 2070GTX 1060GTX 1070 Ti1428425670Min: 49 / Avg: 66.58 / Max: 73Min: 49 / Avg: 66.07 / Max: 74Min: 52 / Avg: 62.85 / Max: 70Min: 49 / Avg: 62.72 / Max: 68Min: 45 / Avg: 61.74 / Max: 69Min: 54 / Avg: 60.68 / Max: 66Min: 50 / Avg: 54.74 / Max: 63Min: 47 / Avg: 54.47 / Max: 62Min: 39 / Avg: 50.55 / Max: 56Min: 42 / Avg: 44.91 / Max: 50

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070RTX 2060RTX 2070GTX 1080GTX 1070 TiRTX 2080GTX 1080 TiRTX 2080 TiTITAN RTX2040608010041.9452.0252.7657.9462.6364.2264.7267.0778.7779.72

cl-mem

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRTX 2080 TiRTX 2070RTX 2080GTX 1080 TiGTX 1070GTX 1080GTX 1070 TiRTX 2060GTX 1060TITAN RTX50100150200250Min: 136 / Avg: 198.3 / Max: 260.6Min: 115.9 / Avg: 150.6 / Max: 185.3Min: 52.6 / Avg: 141.73 / Max: 186.7Min: 49.6 / Avg: 139.67 / Max: 192.8Min: 64.6 / Avg: 132.44 / Max: 150.8Min: 43.2 / Avg: 119.3 / Max: 162.2Min: 49.3 / Avg: 113.36 / Max: 139.4Min: 43.7 / Avg: 111.03 / Max: 156.9Min: 57.7 / Avg: 108.33 / Max: 126.1Min: 54.7 / Avg: 99.5 / Max: 144.3

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorRTX 2080TITAN RTXGTX 1080 TiRTX 2080 TiGTX 1070GTX 1080RTX 2060RTX 2070GTX 1060GTX 1070 Ti1224364860Min: 62 / Avg: 62 / Max: 62Min: 60 / Avg: 61 / Max: 62Min: 60 / Avg: 60.67 / Max: 61Min: 56 / Avg: 58.33 / Max: 61Min: 55 / Avg: 57.17 / Max: 58Min: 56 / Avg: 57 / Max: 58Min: 49 / Avg: 54 / Max: 56Min: 50 / Avg: 53.67 / Max: 56Min: 48 / Avg: 51.14 / Max: 52Min: 43 / Avg: 44 / Max: 45

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080RTX 2060GTX 1080 TiRTX 20800.51981.03961.55942.07922.5991.281.411.651.762.222.272.31

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGTX 1080 TiTITAN RTXRTX 2080 TiGTX 1080RTX 2070GTX 1070RTX 2060RTX 2080GTX 1070 TiGTX 106050100150200250Min: 216.4 / Avg: 217.73 / Max: 218.8Min: 55 / Avg: 190.13 / Max: 258.3Min: 51 / Avg: 155.07 / Max: 243.7Min: 92.2 / Avg: 141.24 / Max: 162.2Min: 46.9 / Avg: 139.57 / Max: 185.9Min: 84.4 / Avg: 137.36 / Max: 151.6Min: 44.2 / Avg: 131.53 / Max: 164.4Min: 98.5 / Avg: 128.93 / Max: 169.3Min: 50 / Avg: 124.63 / Max: 140.5Min: 43.4 / Avg: 98.27 / Max: 126.2

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorGTX 1080 TiRTX 2080TITAN RTXRTX 2080 TiGTX 1070GTX 1080RTX 2060RTX 2070GTX 1060GTX 1070 Ti1326395265Min: 64 / Avg: 64 / Max: 64Min: 61 / Avg: 63.75 / Max: 65Min: 59 / Avg: 63 / Max: 65Min: 61 / Avg: 62 / Max: 63Min: 60 / Avg: 60.6 / Max: 61Min: 60 / Avg: 60.2 / Max: 61Min: 55 / Avg: 56 / Max: 57Min: 54 / Avg: 55.33 / Max: 56Min: 53 / Avg: 53 / Max: 53Min: 43 / Avg: 44.83 / Max: 46

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1070GTX 1060GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti0.63681.27361.91042.54723.1841.401.421.521.531.541.872.252.502.572.83

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGTX 1080 TiRTX 2080 TiTITAN RTXGTX 1080RTX 2080RTX 2070GTX 1070RTX 2060GTX 1070 TiGTX 106050100150200250Min: 52.6 / Avg: 161.7 / Max: 217.6Min: 57.7 / Avg: 161.45 / Max: 265.2Min: 55.2 / Avg: 156.17 / Max: 266.5Min: 102.3 / Avg: 149.02 / Max: 162.5Min: 50.4 / Avg: 148 / Max: 197.9Min: 48.6 / Avg: 145.97 / Max: 196.3Min: 46.7 / Avg: 129.34 / Max: 151Min: 45.9 / Avg: 126.65 / Max: 159.6Min: 49 / Avg: 121.24 / Max: 139.8Min: 55.5 / Avg: 114.96 / Max: 125.8

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorRTX 2080TITAN RTXGTX 1080 TiRTX 2080 TiGTX 1070GTX 1080RTX 2070RTX 2060GTX 1060GTX 1070 Ti1428425670Min: 67 / Avg: 68.67 / Max: 70Min: 66 / Avg: 68.33 / Max: 70Min: 68 / Avg: 68.33 / Max: 69Min: 66 / Avg: 67 / Max: 68Min: 64 / Avg: 64.8 / Max: 66Min: 63 / Avg: 64.17 / Max: 65Min: 60 / Avg: 62 / Max: 63Min: 56 / Avg: 58.5 / Max: 60Min: 54 / Avg: 55.88 / Max: 57Min: 47 / Avg: 47 / Max: 47

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1080GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2080RTX 2070TITAN RTX0.81451.6292.44353.2584.07251.331.541.591.692.092.342.652.693.62

LuxMark

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1080GTX 1070GTX 1070 TiGTX 106060120180240300Min: 58 / Avg: 351.6 / Max: 363.1Min: 51.4 / Avg: 335.79 / Max: 346.6Min: 48.8 / Avg: 276.45 / Max: 286Min: 52 / Avg: 263.55 / Max: 277.3Min: 47.5 / Avg: 239.56 / Max: 245.1Min: 45.7 / Avg: 206.77 / Max: 213.1Min: 43.9 / Avg: 194.98 / Max: 203.8Min: 45.5 / Avg: 193.79 / Max: 199.6Min: 48.8 / Avg: 159.08 / Max: 161.7Min: 44.6 / Avg: 149.89 / Max: 155.9

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorRTX 2080GTX 1080 TiTITAN RTXRTX 2080 TiGTX 1070RTX 2070GTX 1080RTX 2060GTX 1060GTX 1070 Ti1632486480Min: 66 / Avg: 80.62 / Max: 83Min: 67 / Avg: 79.45 / Max: 82Min: 65 / Avg: 77.45 / Max: 79Min: 63 / Avg: 75.3 / Max: 77Min: 60 / Avg: 73.85 / Max: 76Min: 63 / Avg: 72.56 / Max: 74Min: 59 / Avg: 71.48 / Max: 74Min: 55 / Avg: 68.52 / Max: 70Min: 50 / Avg: 62.77 / Max: 65Min: 47 / Avg: 52.18 / Max: 53

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1080GTX 1070GTX 1080 TiRTX 2060RTX 2080RTX 2070GTX 1070 TiRTX 2080 TiTITAN RTX71421283517.7619.5019.9820.5023.0524.8625.6626.1927.3527.67

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1080GTX 1070GTX 1070 TiGTX 106060120180240300Min: 56.5 / Avg: 333.71 / Max: 348.1Min: 51.5 / Avg: 320.8 / Max: 328.8Min: 51.8 / Avg: 234.85 / Max: 240.3Min: 49.5 / Avg: 233.26 / Max: 240.4Min: 49.6 / Avg: 218.65 / Max: 223.5Min: 45.4 / Avg: 188.59 / Max: 193.4Min: 49.8 / Avg: 167.48 / Max: 171.6Min: 58.9 / Avg: 165.75 / Max: 169.2Min: 49.7 / Avg: 142.13 / Max: 144.6Min: 43.5 / Avg: 135.49 / Max: 138.3

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorRTX 2080TITAN RTXRTX 2080 TiGTX 1080 TiRTX 2070GTX 1070RTX 2060GTX 1080GTX 1060GTX 1070 Ti1530456075Min: 66 / Avg: 76.55 / Max: 78Min: 64 / Avg: 76.18 / Max: 78Min: 68 / Avg: 73.54 / Max: 75Min: 66 / Avg: 72.64 / Max: 74Min: 59 / Avg: 68.82 / Max: 70Min: 64 / Avg: 67.32 / Max: 68Min: 56 / Avg: 65.72 / Max: 67Min: 61 / Avg: 65.19 / Max: 66Min: 52 / Avg: 57.72 / Max: 59Min: 45 / Avg: 48.75 / Max: 49

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1080GTX 1080 TiGTX 1070GTX 1070 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2040608010051.3852.0857.8460.1671.5972.8282.6684.7489.2191.17

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080 TiRTX 2080RTX 2070RTX 2060GTX 1070GTX 1080GTX 1070 TiGTX 106060120180240300Min: 50.6 / Avg: 334.61 / Max: 351.7Min: 47.7 / Avg: 320.7 / Max: 332Min: 161.6 / Avg: 242.69 / Max: 248.1Min: 48.9 / Avg: 235.08 / Max: 244.1Min: 46 / Avg: 225.28 / Max: 233.2Min: 44 / Avg: 192.74 / Max: 197.4Min: 82.6 / Avg: 175.54 / Max: 179.2Min: 89.3 / Avg: 173.79 / Max: 175.8Min: 47.6 / Avg: 146.95 / Max: 149.7Min: 111.3 / Avg: 143.3 / Max: 145.4

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorRTX 2080GTX 1080 TiTITAN RTXRTX 2080 TiRTX 2070GTX 1070GTX 1080RTX 2060GTX 1060GTX 1070 Ti1530456075Min: 48 / Avg: 75.04 / Max: 79Min: 56 / Avg: 72.55 / Max: 75Min: 44 / Avg: 71.96 / Max: 78Min: 49 / Avg: 69.88 / Max: 74Min: 47 / Avg: 67.5 / Max: 72Min: 54 / Avg: 67.07 / Max: 70Min: 49 / Avg: 64.55 / Max: 67Min: 49 / Avg: 63.65 / Max: 67Min: 43 / Avg: 58.4 / Max: 61Min: 41 / Avg: 48.91 / Max: 51

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1080GTX 1060GTX 1080 TiGTX 1070RTX 2060GTX 1070 TiRTX 2080RTX 2070RTX 2080 TiTITAN RTX30609012015079.4285.5389.3498.48110.80114.16123.99131.09133.42137.54

NAMD CUDA

OpenBenchmarking.orgWatts, Fewer Is BetterNAMD CUDA 2.13System Power Consumption MonitorGTX 1080 TiRTX 2080TITAN RTXRTX 2070RTX 2080 TiRTX 2060GTX 1070GTX 1060GTX 1070 TiGTX 108070140210280350Min: 48.2 / Avg: 283.67 / Max: 370.3Min: 173.3 / Avg: 271.87 / Max: 334.5Min: 53 / Avg: 269.25 / Max: 360.8Min: 45.7 / Avg: 247.69 / Max: 317.6Min: 47.8 / Avg: 245.81 / Max: 362.6Min: 45.9 / Avg: 231.36 / Max: 300.1Min: 81.5 / Avg: 223.92 / Max: 280.5Min: 42.7 / Avg: 204.58 / Max: 263.4Min: 49.2 / Avg: 202 / Max: 255.8Min: 92.5 / Avg: 192.92 / Max: 318.2

OpenBenchmarking.orgCelsius, Fewer Is BetterNAMD CUDA 2.13GPU Temperature MonitorGTX 1080 TiRTX 2080GTX 1070GTX 1060GTX 1080RTX 2070RTX 2060TITAN RTXRTX 2080 TiGTX 1070 Ti1428425670Min: 62 / Avg: 66.71 / Max: 70Min: 61 / Avg: 65.38 / Max: 70Min: 60 / Avg: 64 / Max: 67Min: 52 / Avg: 59.2 / Max: 63Min: 51 / Avg: 58.08 / Max: 63Min: 51 / Avg: 55.71 / Max: 60Min: 49 / Avg: 55.63 / Max: 61Min: 51 / Avg: 55.29 / Max: 58Min: 51 / Avg: 54.43 / Max: 57Min: 45 / Avg: 48.14 / Max: 50

PlaidML

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiTITAN RTXRTX 2080 TiRTX 2080RTX 2070GTX 1080RTX 2060GTX 1070GTX 1070 TiGTX 106060120180240300Min: 127.7 / Avg: 231.45 / Max: 296.9Min: 55.4 / Avg: 223.68 / Max: 345.8Min: 47.6 / Avg: 212.98 / Max: 331.1Min: 50.9 / Avg: 188.45 / Max: 269.3Min: 68.7 / Avg: 183.65 / Max: 245.5Min: 45.1 / Avg: 180.6 / Max: 225Min: 44 / Avg: 167.51 / Max: 215.1Min: 43.1 / Avg: 162.78 / Max: 197.9Min: 47.9 / Avg: 146.58 / Max: 171.7Min: 72.7 / Avg: 144.61 / Max: 170.9

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiGTX 1080RTX 2080GTX 1070GTX 1060RTX 2060TITAN RTXRTX 2070RTX 2080 TiGTX 1070 Ti1428425670Min: 61 / Avg: 65.7 / Max: 70Min: 59 / Avg: 64.36 / Max: 68Min: 56 / Avg: 63.5 / Max: 70Min: 56 / Avg: 62.15 / Max: 66Min: 52 / Avg: 58.35 / Max: 62Min: 46 / Avg: 55.73 / Max: 61Min: 50 / Avg: 55.25 / Max: 61Min: 47 / Avg: 54.45 / Max: 60Min: 49 / Avg: 54.38 / Max: 59Min: 43 / Avg: 47.69 / Max: 50

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1080GTX 1080 TiGTX 1070 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.29030.58060.87091.16121.45150.650.790.790.820.890.901.031.081.281.29

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorRTX 2080 TiGTX 1080 TiTITAN RTXGTX 1080GTX 1070GTX 1060RTX 2080GTX 1070 TiRTX 2070RTX 206050100150200250Min: 48.3 / Avg: 198.43 / Max: 287.4Min: 50.4 / Avg: 194.3 / Max: 254.7Min: 52.5 / Avg: 178.13 / Max: 298.5Min: 115.7 / Avg: 176.49 / Max: 201.1Min: 43.1 / Avg: 156.77 / Max: 185.9Min: 71.5 / Avg: 140.59 / Max: 158.7Min: 47.4 / Avg: 135.75 / Max: 228.3Min: 48.9 / Avg: 127.46 / Max: 146.6Min: 46.5 / Avg: 126.45 / Max: 198.6Min: 45.1 / Avg: 125.88 / Max: 192.6

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiGTX 1080RTX 2080GTX 1070GTX 1060TITAN RTXRTX 2070RTX 2060RTX 2080 TiGTX 1070 Ti1326395265Min: 59 / Avg: 63 / Max: 66Min: 58 / Avg: 62 / Max: 65Min: 58 / Avg: 61.5 / Max: 64Min: 55 / Avg: 60 / Max: 64Min: 52 / Avg: 56.5 / Max: 59Min: 51 / Avg: 54 / Max: 58Min: 51 / Avg: 53.25 / Max: 55Min: 46 / Avg: 51.8 / Max: 55Min: 50 / Avg: 51.33 / Max: 53Min: 43 / Avg: 45.83 / Max: 48

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1080GTX 1080 TiGTX 1070 TiRTX 2060RTX 2070RTX 2080 TiRTX 2080TITAN RTX0.77181.54362.31543.08723.8591.281.461.471.701.762.763.023.053.373.43

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorTITAN RTXGTX 1080 TiRTX 2080 TiRTX 2080RTX 2070GTX 1080GTX 1070GTX 1070 TiRTX 2060GTX 106060120180240300Min: 102.3 / Avg: 194.3 / Max: 332.7Min: 61.2 / Avg: 185.7 / Max: 280.5Min: 100.3 / Avg: 172.57 / Max: 323.3Min: 48.4 / Avg: 158.95 / Max: 257.5Min: 49.8 / Avg: 153.92 / Max: 242.3Min: 91.9 / Avg: 152.34 / Max: 225.3Min: 45.2 / Avg: 145.44 / Max: 201.2Min: 48.6 / Avg: 134.71 / Max: 168.9Min: 65 / Avg: 134.41 / Max: 208.9Min: 71.4 / Avg: 121.65 / Max: 172.6

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiGTX 1080RTX 2080GTX 1070TITAN RTXGTX 1060RTX 2070RTX 2080 TiRTX 2060GTX 1070 Ti1326395265Min: 61 / Avg: 63.11 / Max: 66Min: 60 / Avg: 62.5 / Max: 65Min: 59 / Avg: 62.38 / Max: 66Min: 56 / Avg: 59.9 / Max: 64Min: 52 / Avg: 54.86 / Max: 58Min: 50 / Avg: 54.8 / Max: 58Min: 50 / Avg: 53.44 / Max: 58Min: 51 / Avg: 53.29 / Max: 57Min: 46 / Avg: 50.11 / Max: 56Min: 44 / Avg: 46.22 / Max: 48

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1080 TiGTX 1080GTX 1070 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti0.69981.39962.09942.79923.4991.591.771.821.921.942.302.442.512.863.11

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiTITAN RTXRTX 2080 TiGTX 1080RTX 2080RTX 2070RTX 2060GTX 1070GTX 1060GTX 1070 Ti50100150200250Min: 123.8 / Avg: 205.47 / Max: 275.7Min: 63.8 / Avg: 192.88 / Max: 301.5Min: 49.3 / Avg: 185.12 / Max: 298.3Min: 91.6 / Avg: 167.52 / Max: 214.9Min: 50.1 / Avg: 167.47 / Max: 240.9Min: 47.5 / Avg: 159.79 / Max: 224.3Min: 45.2 / Avg: 158.66 / Max: 201.6Min: 85.2 / Avg: 157.11 / Max: 185.1Min: 70.6 / Avg: 141.88 / Max: 164.5Min: 50 / Avg: 141.65 / Max: 161.4

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiGTX 1080RTX 2080GTX 1070TITAN RTXRTX 2080 TiRTX 2070GTX 1060RTX 2060GTX 1070 Ti1326395265Min: 60 / Avg: 64.36 / Max: 69Min: 58 / Avg: 63.21 / Max: 68Min: 57 / Avg: 63.18 / Max: 68Min: 58 / Avg: 61.79 / Max: 65Min: 51 / Avg: 55.27 / Max: 60Min: 52 / Avg: 54.78 / Max: 59Min: 49 / Avg: 54.67 / Max: 60Min: 44 / Avg: 52.44 / Max: 59Min: 40 / Avg: 48.69 / Max: 56Min: 43 / Avg: 47.13 / Max: 50

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1080GTX 1080 TiGTX 1070 TiRTX 2060RTX 2070RTX 2080TITAN RTXRTX 2080 Ti0.27230.54460.81691.08921.36150.610.740.790.810.830.840.981.031.191.21

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorTITAN RTXRTX 2080 TiGTX 1080GTX 1080 TiRTX 2070GTX 1070RTX 2080RTX 2060GTX 1070 TiGTX 106050100150200250Min: 54 / Avg: 200.17 / Max: 274.2Min: 48.2 / Avg: 152.87 / Max: 274.6Min: 44.3 / Avg: 147.75 / Max: 195.1Min: 53.3 / Avg: 125.73 / Max: 165.2Min: 115.1 / Avg: 115.83 / Max: 116.5Min: 45.9 / Avg: 107.3 / Max: 155.1Min: 52 / Avg: 97.93 / Max: 121Min: 43.1 / Avg: 95.9 / Max: 118.8Min: 49.7 / Avg: 93.67 / Max: 125.7Min: 41.8 / Avg: 76.7 / Max: 94.4

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorRTX 2080GTX 1080 TiGTX 1080GTX 1070RTX 2080 TiTITAN RTXRTX 2070GTX 1070 TiGTX 1060RTX 20601326395265Min: 66 / Avg: 66 / Max: 66Min: 63 / Avg: 64 / Max: 65Min: 62 / Avg: 62 / Max: 62Min: 59 / Avg: 60.33 / Max: 61Min: 54 / Avg: 55.67 / Max: 58Min: 55 / Avg: 55 / Max: 55Min: 55 / Avg: 55 / Max: 55Min: 48 / Avg: 48 / Max: 48Min: 41 / Avg: 45.25 / Max: 47Min: 39 / Avg: 44.25 / Max: 48

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1080TITAN RTXGTX 1080 TiGTX 1070GTX 1060GTX 1070 TiRTX 2080 TiRTX 2070RTX 2060RTX 208036912155.426.677.027.127.728.068.458.709.0010.67

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiRTX 2080 TiTITAN RTXGTX 1080RTX 2080GTX 1070RTX 2070GTX 1060GTX 1070 TiRTX 20604080120160200Min: 50.4 / Avg: 165.05 / Max: 214.8Min: 48.6 / Avg: 159.87 / Max: 223.7Min: 57.5 / Avg: 152.07 / Max: 231.9Min: 90.8 / Avg: 146.71 / Max: 176.8Min: 46.4 / Avg: 138.03 / Max: 185.4Min: 82.2 / Avg: 136.83 / Max: 164.9Min: 47.1 / Avg: 127.27 / Max: 169.5Min: 70.1 / Avg: 124.21 / Max: 145Min: 104.8 / Avg: 123.57 / Max: 135.6Min: 44.3 / Avg: 116.14 / Max: 162.7

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiGTX 1080GTX 1070RTX 2080TITAN RTXRTX 2080 TiGTX 1060RTX 2070RTX 2060GTX 1070 Ti1224364860Min: 59 / Avg: 61.75 / Max: 63Min: 53 / Avg: 57.58 / Max: 61Min: 51 / Avg: 55.42 / Max: 59Min: 50 / Avg: 54.64 / Max: 59Min: 52 / Avg: 53.8 / Max: 56Min: 52 / Avg: 53.5 / Max: 56Min: 44 / Avg: 49.23 / Max: 54Min: 41 / Avg: 45.18 / Max: 49Min: 40 / Avg: 45 / Max: 50Min: 39 / Avg: 42.75 / Max: 45

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1080GTX 1080 TiGTX 1070 TiRTX 2080 TiRTX 2070RTX 2080TITAN RTXRTX 20600.36450.7291.09351.4581.82251.081.141.141.141.241.541.561.561.601.62

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorTITAN RTXGTX 1080RTX 2080 TiRTX 2080GTX 1070GTX 1080 TiRTX 2070RTX 2060GTX 1060GTX 1070 Ti4080120160200Min: 56 / Avg: 163.54 / Max: 245.3Min: 91.7 / Avg: 152.84 / Max: 169.4Min: 48.4 / Avg: 137.54 / Max: 235.7Min: 48 / Avg: 132.2 / Max: 198.2Min: 48.4 / Avg: 126.46 / Max: 165Min: 47.2 / Avg: 121.72 / Max: 213.3Min: 49.9 / Avg: 120.6 / Max: 185.8Min: 47.3 / Avg: 110.84 / Max: 168.5Min: 43.2 / Avg: 99.8 / Max: 140.7Min: 48.1 / Avg: 99.69 / Max: 140.6

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080RTX 2080GTX 1070GTX 1080 TiRTX 2060RTX 2080 TiTITAN RTXRTX 2070GTX 1060GTX 1070 Ti1326395265Min: 60 / Avg: 61.64 / Max: 63Min: 52 / Avg: 58.56 / Max: 66Min: 51 / Avg: 56.59 / Max: 60Min: 44 / Avg: 53.18 / Max: 61Min: 43 / Avg: 48.84 / Max: 55Min: 44 / Avg: 48.59 / Max: 53Min: 43 / Avg: 48.07 / Max: 53Min: 42 / Avg: 47.65 / Max: 53Min: 41 / Avg: 47.52 / Max: 54Min: 38 / Avg: 42.62 / Max: 46

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1080GTX 1070GTX 1060TITAN RTXRTX 2080RTX 2060GTX 1070 TiGTX 1080 TiRTX 2070RTX 2080 Ti0.25430.50860.76291.01721.27150.670.850.870.981.011.021.031.031.091.13

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorRTX 2080 TiGTX 1080 TiTITAN RTXGTX 1080RTX 2080RTX 2070RTX 2060GTX 1070GTX 1060GTX 1070 Ti60120180240300Min: 47.7 / Avg: 180.78 / Max: 312Min: 65.2 / Avg: 179.46 / Max: 287.9Min: 58.3 / Avg: 176.01 / Max: 322.5Min: 70.3 / Avg: 172.91 / Max: 216.6Min: 49.8 / Avg: 158.89 / Max: 252.8Min: 47.2 / Avg: 141.31 / Max: 234Min: 62.3 / Avg: 137.76 / Max: 208Min: 43.1 / Avg: 136.92 / Max: 191.2Min: 58.2 / Avg: 131.14 / Max: 167.2Min: 48.4 / Avg: 125.32 / Max: 169.8

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1080 TiRTX 2080GTX 1080GTX 1070GTX 1060RTX 2080 TiRTX 2060RTX 2070TITAN RTXGTX 1070 Ti1326395265Min: 53 / Avg: 60.07 / Max: 67Min: 53 / Avg: 59.79 / Max: 68Min: 52 / Avg: 58.85 / Max: 65Min: 49 / Avg: 56.06 / Max: 63Min: 43 / Avg: 51.9 / Max: 59Min: 46 / Avg: 51 / Max: 57Min: 43 / Avg: 50.75 / Max: 59Min: 42 / Avg: 48.87 / Max: 56Min: 42 / Avg: 47.33 / Max: 54Min: 41 / Avg: 45.18 / Max: 50

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1080GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2080RTX 2070RTX 2080 TiTITAN RTX0.33750.6751.01251.351.68750.720.800.921.021.081.081.211.271.381.50

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiTITAN RTXGTX 1080RTX 2080 TiRTX 2080RTX 2060GTX 1070RTX 2070GTX 1060GTX 1070 Ti50100150200250Min: 122.3 / Avg: 187.68 / Max: 270.3Min: 52.6 / Avg: 162.76 / Max: 301.7Min: 91.6 / Avg: 162.7 / Max: 211.1Min: 48.4 / Avg: 155.12 / Max: 292.5Min: 47.4 / Avg: 148.19 / Max: 240.2Min: 86.5 / Avg: 143.68 / Max: 202.9Min: 83.8 / Avg: 134.85 / Max: 189.4Min: 78.4 / Avg: 132.98 / Max: 222.4Min: 60.4 / Avg: 122.52 / Max: 164.5Min: 48.5 / Avg: 113.89 / Max: 160.8

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorRTX 2080GTX 1080 TiGTX 1080GTX 1070RTX 2080 TiGTX 1060RTX 2060RTX 2070TITAN RTXGTX 1070 Ti1326395265Min: 58 / Avg: 61.6 / Max: 65Min: 59 / Avg: 61.1 / Max: 64Min: 58 / Avg: 60 / Max: 63Min: 54 / Avg: 57.18 / Max: 61Min: 49 / Avg: 51.2 / Max: 55Min: 47 / Avg: 50.92 / Max: 55Min: 46 / Avg: 50.9 / Max: 56Min: 46 / Avg: 48.78 / Max: 53Min: 44 / Avg: 46.33 / Max: 52Min: 43 / Avg: 45.8 / Max: 48

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1080GTX 1070GTX 1080 TiRTX 2060GTX 1070 TiRTX 2080RTX 2070TITAN RTXRTX 2080 Ti0.66381.32761.99142.65523.3191.491.621.761.781.962.082.392.512.922.95

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiRTX 2080 TiTITAN RTXGTX 1080RTX 2080GTX 1070RTX 2070RTX 2060GTX 1060GTX 1070 Ti50100150200250Min: 120.7 / Avg: 202.56 / Max: 256.6Min: 48.4 / Avg: 188.73 / Max: 275.6Min: 51.5 / Avg: 168.92 / Max: 275.4Min: 89.5 / Avg: 168.75 / Max: 202.8Min: 47.5 / Avg: 155.97 / Max: 224.6Min: 81.8 / Avg: 153.87 / Max: 183Min: 45.8 / Avg: 146.81 / Max: 206.6Min: 45 / Avg: 144.95 / Max: 190.7Min: 87.5 / Avg: 132.19 / Max: 156.7Min: 56.1 / Avg: 132.15 / Max: 155

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorRTX 2080GTX 1080 TiGTX 1080GTX 1070RTX 2080 TiGTX 1060RTX 2060RTX 2070GTX 1070 TiTITAN RTX1326395265Min: 56 / Avg: 61.08 / Max: 66Min: 55 / Avg: 59.71 / Max: 66Min: 51 / Avg: 57.64 / Max: 63Min: 48 / Avg: 54.81 / Max: 60Min: 49 / Avg: 52.45 / Max: 57Min: 45 / Avg: 52 / Max: 57Min: 43 / Avg: 49.73 / Max: 56Min: 42 / Avg: 48.08 / Max: 55Min: 43 / Avg: 45.94 / Max: 49Min: 39 / Avg: 44.46 / Max: 52

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1080GTX 1080 TiGTX 1070 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.2880.5760.8641.1521.440.670.760.780.820.900.921.071.081.121.28

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1080 TiRTX 2080RTX 2060TITAN RTXGTX 1060GTX 1070 TiRTX 2080 TiRTX 2070GTX 1080GTX 10704080120160200Min: 91.6 / Avg: 149.17 / Max: 203.2Min: 56.3 / Avg: 130.87 / Max: 184.7Min: 45.5 / Avg: 121.65 / Max: 166.1Min: 52.9 / Avg: 110.8 / Max: 140.2Min: 42.1 / Avg: 107.84 / Max: 141.6Min: 48.7 / Avg: 98.18 / Max: 132.6Min: 48.4 / Avg: 94.48 / Max: 113.3Min: 46 / Avg: 93.68 / Max: 129.3Min: 43.7 / Avg: 93.48 / Max: 127.3Min: 42.8 / Avg: 91.98 / Max: 117.4

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorRTX 2080GTX 1080 TiRTX 2080 TiGTX 1060GTX 1080GTX 1070RTX 2060GTX 1070 TiRTX 2070TITAN RTX1224364860Min: 61 / Avg: 61.67 / Max: 62Min: 56 / Avg: 56.67 / Max: 57Min: 52 / Avg: 52.5 / Max: 53Min: 49 / Avg: 50 / Max: 51Min: 47 / Avg: 49.75 / Max: 51Min: 47 / Avg: 49.5 / Max: 51Min: 46 / Avg: 47.75 / Max: 50Min: 46 / Avg: 47 / Max: 48Min: 45 / Avg: 46.75 / Max: 48Min: 41 / Avg: 41.5 / Max: 42

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1080 TiRTX 2060GTX 1070 TiRTX 2080GTX 1070GTX 1080TITAN RTXRTX 2070RTX 2080 Ti36912155.125.196.416.576.827.227.348.828.8710.28

LeelaChessZero

OpenBenchmarking.orgWatts, Fewer Is BetterLeelaChessZero 0.20.1System Power Consumption MonitorGTX 1080 TiRTX 2080RTX 2080 TiRTX 2060RTX 2070GTX 1070GTX 1070 TiGTX 1060TITAN RTXGTX 108060120180240300Min: 96.9 / Avg: 265.17 / Max: 323.1Min: 88.3 / Avg: 222.99 / Max: 288.3Min: 48.2 / Avg: 205.76 / Max: 349.4Min: 69.4 / Avg: 196.01 / Max: 230Min: 75.1 / Avg: 186.61 / Max: 251.7Min: 42.6 / Avg: 184.04 / Max: 213.3Min: 80 / Avg: 156.39 / Max: 189.7Min: 41.9 / Avg: 150.59 / Max: 180Min: 97.7 / Avg: 148.16 / Max: 357.7Min: 40.2 / Avg: 148.14 / Max: 249.5

OpenBenchmarking.orgCelsius, Fewer Is BetterLeelaChessZero 0.20.1GPU Temperature MonitorRTX 2080RTX 2080 TiGTX 1060GTX 1080 TiRTX 2060GTX 1070 TiGTX 1070RTX 2070GTX 1080TITAN RTX1428425670Min: 57 / Avg: 66.29 / Max: 73Min: 50 / Avg: 55.6 / Max: 60Min: 38 / Avg: 51.69 / Max: 59Min: 41 / Avg: 51.43 / Max: 61Min: 39 / Avg: 50.33 / Max: 58Min: 46 / Avg: 50.13 / Max: 53Min: 35 / Avg: 46.2 / Max: 55Min: 33 / Avg: 44.86 / Max: 52Min: 32 / Avg: 40.8 / Max: 58Min: 34 / Avg: 36.91 / Max: 47

OpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060RTX 2060GTX 1070GTX 1080 TiRTX 2070GTX 1070 TiRTX 2080GTX 1080RTX 2080 TiTITAN RTX5101520255.977.577.778.809.279.4510.1311.8115.1022.11

117 Results Shown

SHOC Scalable HeterOgeneous Computing
OctaneBench
LuxMark:
  GPU - Microphone
  GPU - Hotel
  GPU - Luxball HDR
Chaos Group V-RAY
Rodinia
PlaidML:
  No - Training - Mobilenet - OpenCL
  Yes - Inference - VGG16 - OpenCL
clpeak
PlaidML:
  Yes - Inference - Inception V3 - OpenCL
  No - Inference - Inception V3 - OpenCL
  No - Training - IMDB LSTM - OpenCL
SHOC Scalable HeterOgeneous Computing
PlaidML:
  No - Inference - VGG16 - OpenCL
  Yes - Inference - ResNet 50 - OpenCL
LeelaChessZero
NAMD CUDA
PlaidML
clpeak:
  Transfer Bandwidth enqueueReadBuffer
  Transfer Bandwidth enqueueWriteBuffer
PlaidML
Rodinia
JuliaGPU
cl-mem:
  Read
  Copy
  Write
PlaidML
Darktable
PlaidML
Darktable:
  Boat - OpenCL
  Server Room - OpenCL
SHOC Scalable HeterOgeneous Computing
clpeak
SHOC Scalable HeterOgeneous Computing
clpeak
SHOC Scalable HeterOgeneous Computing
clpeak
Darktable
clpeak
Chaos Group V-RAY:
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  Total Score
  Phoronix Test Suite System Monitoring
  Phoronix Test Suite System Monitoring
  Performance Per Watt
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL - Texture Read Bandwidth
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL - Max SP Flops
  System Power Consumption Monitor
  GPU Temp Monitor
  Copy
  System Power Consumption Monitor
  GPU Temp Monitor
  Write
  System Power Consumption Monitor
  GPU Temp Monitor
  Read
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Hotel
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Microphone
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Luxball HDR
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - VGG16 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - IMDB LSTM - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - ResNet 50 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - Inception V3 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Training - IMDB LSTM - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Training - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - VGG16 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - ResNet 50 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - Inception V3 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL