NVIDIA Linux GPU Compute

NVIDIA Linux GPU computing benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1901274-SP-NVIDIACOM53
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 5 Tests
Creator Workloads 2 Tests
HPC - High Performance Computing 4 Tests
Machine Learning 3 Tests
Multi-Core 2 Tests
NVIDIA GPU Compute 9 Tests
OpenCL 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GTX 1060
January 27 2019
  1 Hour, 42 Minutes
GTX 1070
January 25 2019
  1 Hour, 35 Minutes
GTX 1070 Ti
January 25 2019
  1 Hour, 39 Minutes
GTX 1080
January 25 2019
  1 Hour, 24 Minutes
GTX 1080 Ti
January 25 2019
  1 Hour, 31 Minutes
RTX 2060
January 27 2019
  1 Hour, 40 Minutes
RTX 2070
January 27 2019
  1 Hour, 39 Minutes
RTX 2080
January 26 2019
  1 Hour, 38 Minutes
RTX 2080 Ti
January 26 2019
  1 Hour, 34 Minutes
TITAN RTX
January 26 2019
  1 Hour, 35 Minutes
Invert Hiding All Results Option
  1 Hour, 36 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA Linux GPU ComputeOpenBenchmarking.orgPhoronix Test SuiteIntel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (0602 BIOS)Intel Cannon Lake PCH Shared SRAM16384MBSamsung SSD 970 EVO 250GB + 2000GB SABRENTNVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11GB (1480/5508MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TITAN RTX 24GB (1350/7000MHz)Realtek ALC1220Acer B286HKIntel I219-VUbuntu 18.104.20.3-042003-generic (x86_64)GNOME Shell 3.30.1X Server 1.20.1NVIDIA 415.274.6.0OpenCL 1.2 CUDA 10.0.1321.1.84GCC 8.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionNVIDIA Linux GPU Compute BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance- GTX 1060: GPU Compute Cores: 1280- GTX 1070: GPU Compute Cores: 1920- GTX 1070 Ti: GPU Compute Cores: 2432- GTX 1080: GPU Compute Cores: 2560- GTX 1080 Ti: GPU Compute Cores: 3584- RTX 2060: GPU Compute Cores: 1920- RTX 2070: GPU Compute Cores: 2304- RTX 2080: GPU Compute Cores: 2944- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- Python 2.7.15+ + Python 3.6.7- __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp

GTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTXResult OverviewPhoronix Test Suite100%173%246%319%391%LuxMarkLeelaChessZerocl-memSHOC Scalable HeterOgeneous ComputingclpeakPlaidMLRodiniaNAMD CUDAJuliaGPUDarktable

GTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTXPer Watt Result OverviewPhoronix Test Suite100%168%235%303%LeelaChessZeroMeta Performance Per Wattcl-memPlaidMLLuxMarkSHOC Scalable HeterOgeneous ComputingJuliaGPUP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

NVIDIA Linux GPU Computeshoc: OpenCL - Max SP Flopsoctanebench: Total Scoreluxmark: GPU - Microphoneluxmark: GPU - Hotelluxmark: GPU - Luxball HDRv-ray: CUDA GPUrodinia: OpenCL Myocyteplaidml: No - Training - Mobilenet - OpenCLplaidml: Yes - Inference - VGG16 - OpenCLclpeak: Double-Precision Doubleplaidml: Yes - Inference - Inception V3 - OpenCLplaidml: No - Inference - Inception V3 - OpenCLplaidml: No - Training - IMDB LSTM - OpenCLshoc: OpenCL - Texture Read Bandwidthplaidml: No - Inference - VGG16 - OpenCLplaidml: Yes - Inference - ResNet 50 - OpenCLlczero: OpenCLnamd-cuda: ATPase Simulation - 327,506 Atomsplaidml: No - Inference - ResNet 50 - OpenCLclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferplaidml: No - Inference - IMDB LSTM - OpenCLrodinia: OpenCL Particle Filterjuliagpu: GPUcl-mem: Readcl-mem: Copycl-mem: Writeplaidml: Yes - Inference - Mobilenet - OpenCLdarktable: Masskrug - OpenCLplaidml: No - Inference - Mobilenet - OpenCLdarktable: Boat - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - FFT SPclpeak: Integer Compute INTshoc: OpenCL - Triadclpeak: Global Memory Bandwidthshoc: OpenCL - MD5 Hashclpeak: Single-Precision Floatdarktable: Server Rack - OpenCLclpeak: Kernel LatencyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX479491.546962266212256129.6635.5486.9494.6215188.8586.1113441194.081838990.3227119411.1512.5418012.001837826261531381395524.175923.661.31297125211.931457.2842390.133.787101133997238721728793.6635.4410712622411711615644712923714300.2422825711.3312.602288.252181720622051871926643.927642.891.10453168112.1919610.4758780.123.6477091411017541671677686.7139.6410312824212011815343313123714780.2302226211.1512.552247.852215480162051871906454.017552.931.16501206812.1919711.6367760.133.929400872238021380335.6610313929713213316752714326417500.2123329211.3412.592596.502391313242292092166863.978012.721.10575239812.3022214.1979340.133.77132382111358456682168266.5636.3112519441516716718759618933423330.2011333911.2912.573304.962651628753373173367743.878832.261.03972326312.5132919.68108530.123.9973401641373447662135693.3131.98113148231134133188102415028214840.2054230911.2912.603478.802516119892962472457793.828632.220.82803670512.4327615.8766300.113.6585292041807461472953266.3932.71132179266157156198107718933317290.1974637611.2512.483827.882656820463923303138303.6810081.930.75988778012.5736518.4279680.103.64109822181976665522914672.7432.47133192343168172216112120335422590.2004440011.1312.524576.282788292383923273238933.7810451.920.811083995212.5736823.74102700.113.65165803042861791844278753.0431.96156250519211225246117327345831070.1917753711.2912.596054.363009222415444544399713.6912911.640.7514451490612.6850535.13153520.103.83173323193042697294602253.5132.72160264540215229244115828947432750.1973555611.1612.586104.393036654475654834899773.6913361.640.7415531484812.6852536.28163860.103.85OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4K8K12K16K20KSE +/- 4.35, N = 3SE +/- 23.18, N = 3SE +/- 1.25, N = 3SE +/- 38.59, N = 3SE +/- 27.16, N = 3SE +/- 38.15, N = 3SE +/- 45.89, N = 3SE +/- 58.57, N = 3SE +/- 89.12, N = 3SE +/- 91.92, N = 3479471017709940013238734085291098216580173321. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3K6K9K12K15KMin: 4789.76 / Avg: 4794.41 / Max: 4803.11Min: 7059.2 / Avg: 7101.2 / Max: 7139.19Min: 7707.02 / Avg: 7709.1 / Max: 7711.34Min: 9358.21 / Avg: 9399.5 / Max: 9476.62Min: 13209.2 / Avg: 13237.6 / Max: 13291.9Min: 7301.84 / Avg: 7340.04 / Max: 7416.33Min: 8482.7 / Avg: 8528.62 / Max: 8620.4Min: 10923.4 / Avg: 10981.97 / Max: 11099.1Min: 16488.7 / Avg: 16579.67 / Max: 16757.9Min: 17240.2 / Avg: 17332.17 / Max: 175161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00Total ScoreGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX7014021028035091.54133.00141.00211.00164.00204.00218.00304.00319.00

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX7K14K21K28K35KSE +/- 1.53, N = 3SE +/- 1.20, N = 3SE +/- 5.00, N = 3SE +/- 10.84, N = 3SE +/- 5.67, N = 3SE +/- 57.27, N = 3SE +/- 4.62, N = 3SE +/- 24.50, N = 3SE +/- 127.33, N = 3SE +/- 146.65, N = 369629972101758722135841373418074197662861730426
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX5K10K15K20K25KMin: 6959 / Avg: 6962 / Max: 6964Min: 9970 / Avg: 9971.67 / Max: 9974Min: 10165 / Avg: 10175 / Max: 10180Min: 8711 / Avg: 8722.33 / Max: 8744Min: 13573 / Avg: 13584.33 / Max: 13590Min: 13649 / Avg: 13734 / Max: 13843Min: 18066 / Avg: 18074 / Max: 18082Min: 19717 / Avg: 19766 / Max: 19791Min: 28490 / Avg: 28617.33 / Max: 28872Min: 30135 / Avg: 30425.67 / Max: 30605

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2K4K6K8K10KSE +/- 10.90, N = 3SE +/- 4.33, N = 3SE +/- 0.67, N = 3SE +/- 3.18, N = 3SE +/- 29.33, N = 3SE +/- 4.51, N = 3SE +/- 1.00, N = 3SE +/- 12.33, N = 3SE +/- 1.45, N = 3SE +/- 6.69, N = 32662387241673802566847666147655291849729
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2K4K6K8K10KMin: 2641 / Avg: 2661.67 / Max: 2678Min: 3868 / Avg: 3872.33 / Max: 3881Min: 4166 / Avg: 4167.33 / Max: 4168Min: 3796 / Avg: 3802.33 / Max: 3806Min: 5609 / Avg: 5667.67 / Max: 5697Min: 4761 / Avg: 4766 / Max: 4775Min: 6145 / Avg: 6147 / Max: 6148Min: 6527 / Avg: 6551.67 / Max: 6564Min: 9181 / Avg: 9183.67 / Max: 9186Min: 9718 / Avg: 9728.67 / Max: 9741

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX10K20K30K40K50KSE +/- 46.58, N = 3SE +/- 1.33, N = 3SE +/- 14.90, N = 3SE +/- 42.00, N = 3SE +/- 8.33, N = 3SE +/- 31.58, N = 3SE +/- 107.00, N = 3SE +/- 25.12, N = 3SE +/- 155.67, N = 3SE +/- 103.04, N = 312256172871677613803216822135629532291464278746022
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX8K16K24K32K40KMin: 12182 / Avg: 12256 / Max: 12342Min: 17286 / Avg: 17287.33 / Max: 17290Min: 16759 / Avg: 16776.33 / Max: 16806Min: 13761 / Avg: 13803 / Max: 13887Min: 21674 / Avg: 21682.33 / Max: 21699Min: 21297 / Avg: 21356 / Max: 21405Min: 29424 / Avg: 29532 / Max: 29746Min: 29102 / Avg: 29145.67 / Max: 29189Min: 42607 / Avg: 42787 / Max: 43097Min: 45879 / Avg: 46022 / Max: 46222

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX306090120150SE +/- 0.04, N = 3SE +/- 2.94, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 3.00, N = 3129.6693.6686.7166.5693.3166.3972.7453.0453.51
OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX20406080100Min: 129.58 / Avg: 129.66 / Max: 129.72Min: 90.61 / Avg: 93.66 / Max: 99.53Min: 86.69 / Avg: 86.71 / Max: 86.74Min: 66.51 / Avg: 66.56 / Max: 66.6Min: 93.24 / Avg: 93.31 / Max: 93.38Min: 66.37 / Avg: 66.39 / Max: 66.42Min: 72.73 / Avg: 72.74 / Max: 72.76Min: 53 / Avg: 53.04 / Max: 53.1Min: 50.49 / Avg: 53.51 / Max: 59.51

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX918273645SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 335.5435.4439.6435.6636.3131.9832.7132.4731.9632.721. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX816243240Min: 35.41 / Avg: 35.54 / Max: 35.66Min: 35.36 / Avg: 35.44 / Max: 35.54Min: 39.57 / Avg: 39.64 / Max: 39.68Min: 35.37 / Avg: 35.66 / Max: 36.02Min: 36.28 / Avg: 36.31 / Max: 36.35Min: 31.87 / Avg: 31.98 / Max: 32.07Min: 32.66 / Avg: 32.71 / Max: 32.78Min: 32.32 / Avg: 32.47 / Max: 32.54Min: 31.84 / Avg: 31.96 / Max: 32.09Min: 32.52 / Avg: 32.72 / Max: 33.11. (CXX) g++ options: -O2 -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 386.94107.00103.00103.00125.00113.00132.00133.00156.00160.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX306090120150Min: 86.89 / Avg: 86.94 / Max: 86.97Min: 107.12 / Avg: 107.4 / Max: 107.66Min: 102.73 / Avg: 102.78 / Max: 102.84Min: 102.56 / Avg: 102.63 / Max: 102.77Min: 125.21 / Avg: 125.45 / Max: 125.8Min: 112.47 / Avg: 112.61 / Max: 112.83Min: 131.66 / Avg: 131.97 / Max: 132.29Min: 132.99 / Avg: 133.25 / Max: 133.68Min: 155.76 / Avg: 155.89 / Max: 156.08Min: 159.74 / Avg: 160.07 / Max: 160.41

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.32, N = 3SE +/- 0.15, N = 3SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.20, N = 3SE +/- 0.43, N = 394.62126.00128.00139.00194.00148.00179.00192.00250.00264.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 94.58 / Avg: 94.62 / Max: 94.68Min: 126.33 / Avg: 126.49 / Max: 126.72Min: 128.34 / Avg: 128.4 / Max: 128.51Min: 138.95 / Avg: 139.09 / Max: 139.25Min: 194.05 / Avg: 194.4 / Max: 195.04Min: 147.98 / Avg: 148.21 / Max: 148.51Min: 178.77 / Avg: 179.11 / Max: 179.62Min: 191.55 / Avg: 192.06 / Max: 192.65Min: 249.48 / Avg: 249.7 / Max: 250.09Min: 263.47 / Avg: 264.04 / Max: 264.9

clpeak

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX120240360480600SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.60, N = 3SE +/- 0.72, N = 3SE +/- 0.98, N = 3SE +/- 1.25, N = 3SE +/- 1.29, N = 31512242422974152312663435195401. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500Min: 150.56 / Avg: 150.64 / Max: 150.74Min: 223.25 / Avg: 223.76 / Max: 224.03Min: 241.36 / Avg: 241.64 / Max: 242.12Min: 297.37 / Avg: 297.4 / Max: 297.42Min: 414.58 / Avg: 414.85 / Max: 415.04Min: 229.66 / Avg: 230.86 / Max: 231.47Min: 265.58 / Avg: 266.33 / Max: 267.77Min: 341.37 / Avg: 342.53 / Max: 344.48Min: 516.26 / Avg: 518.58 / Max: 520.56Min: 537.59 / Avg: 539.61 / Max: 542.011. (CXX) g++ options: -O3 -rdynamic -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.58, N = 3SE +/- 0.44, N = 3SE +/- 0.18, N = 3SE +/- 0.40, N = 3SE +/- 0.48, N = 3SE +/- 1.30, N = 388.85117.00120.00132.00167.00134.00157.00168.00211.00215.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 88.76 / Avg: 88.85 / Max: 88.92Min: 117.31 / Avg: 117.42 / Max: 117.56Min: 119.2 / Avg: 119.54 / Max: 120.14Min: 131.77 / Avg: 131.91 / Max: 132Min: 165.9 / Avg: 166.71 / Max: 167.85Min: 133.38 / Avg: 133.96 / Max: 134.81Min: 157.1 / Avg: 157.33 / Max: 157.7Min: 167.9 / Avg: 168.35 / Max: 169.14Min: 210.59 / Avg: 211.32 / Max: 212.24Min: 213.66 / Avg: 215.41 / Max: 217.95

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.38, N = 3SE +/- 0.53, N = 386.11116.00118.00133.00167.00133.00156.00172.00225.00229.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 86.05 / Avg: 86.11 / Max: 86.17Min: 115.89 / Avg: 115.99 / Max: 116.12Min: 117.21 / Avg: 117.56 / Max: 117.8Min: 132.58 / Avg: 132.67 / Max: 132.83Min: 166.55 / Avg: 166.93 / Max: 167.49Min: 133.25 / Avg: 133.38 / Max: 133.48Min: 156.39 / Avg: 156.41 / Max: 156.45Min: 171.98 / Avg: 172.21 / Max: 172.43Min: 223.99 / Avg: 224.61 / Max: 225.31Min: 227.82 / Avg: 228.71 / Max: 229.66

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250SE +/- 0.17, N = 3SE +/- 0.50, N = 3SE +/- 0.03, N = 3SE +/- 0.60, N = 3SE +/- 0.49, N = 3SE +/- 0.38, N = 3SE +/- 0.82, N = 3SE +/- 0.27, N = 3SE +/- 0.72, N = 3SE +/- 0.96, N = 3134156153167187188198216246244
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 133.51 / Avg: 133.81 / Max: 134.09Min: 155.61 / Avg: 156.25 / Max: 157.23Min: 153.07 / Avg: 153.11 / Max: 153.17Min: 165.62 / Avg: 166.82 / Max: 167.47Min: 186.6 / Avg: 187.44 / Max: 188.3Min: 187.38 / Avg: 188.13 / Max: 188.58Min: 196.74 / Avg: 198.38 / Max: 199.3Min: 215.25 / Avg: 215.62 / Max: 216.16Min: 244.31 / Avg: 245.51 / Max: 246.8Min: 242.37 / Avg: 243.94 / Max: 245.69

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500SE +/- 1.03, N = 3SE +/- 0.82, N = 3SE +/- 0.80, N = 3SE +/- 0.28, N = 3SE +/- 1.91, N = 3SE +/- 0.24, N = 3SE +/- 1.24, N = 3SE +/- 2.94, N = 3SE +/- 1.02, N = 3SE +/- 3.52, N = 3411447433527596102410771121117311581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000Min: 410.11 / Avg: 411.15 / Max: 413.22Min: 445.5 / Avg: 447.09 / Max: 448.19Min: 431.53 / Avg: 433.09 / Max: 434.17Min: 526.71 / Avg: 527.14 / Max: 527.67Min: 592.44 / Avg: 596.16 / Max: 598.8Min: 1023.47 / Avg: 1023.82 / Max: 1024.27Min: 1074.61 / Avg: 1076.98 / Max: 1078.78Min: 1115.71 / Avg: 1121.4 / Max: 1125.5Min: 1171.44 / Avg: 1172.57 / Max: 1174.6Min: 1151.93 / Avg: 1158.19 / Max: 1164.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 3SE +/- 0.36, N = 3SE +/- 0.72, N = 394.08129.00131.00143.00189.00150.00189.00203.00273.00289.00
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 94.02 / Avg: 94.08 / Max: 94.14Min: 128.87 / Avg: 128.94 / Max: 128.99Min: 130.93 / Avg: 130.95 / Max: 130.97Min: 142.61 / Avg: 142.69 / Max: 142.8Min: 188.71 / Avg: 188.84 / Max: 189.07Min: 150.11 / Avg: 150.43 / Max: 150.64Min: 189.1 / Avg: 189.39 / Max: 189.94Min: 203.19 / Avg: 203.39 / Max: 203.73Min: 272.43 / Avg: 272.89 / Max: 273.61Min: 288.45 / Avg: 289.19 / Max: 290.62

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500SE +/- 0.17, N = 3SE +/- 0.38, N = 3SE +/- 0.16, N = 3SE +/- 0.24, N = 3SE +/- 0.34, N = 3SE +/- 0.26, N = 3SE +/- 0.76, N = 3SE +/- 0.40, N = 3SE +/- 1.03, N = 3SE +/- 1.16, N = 3183237237264334282333354458474
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX80160240320400Min: 182.32 / Avg: 182.6 / Max: 182.92Min: 236.94 / Avg: 237.38 / Max: 238.13Min: 237.19 / Avg: 237.39 / Max: 237.7Min: 263.46 / Avg: 263.87 / Max: 264.3Min: 333.56 / Avg: 334.13 / Max: 334.72Min: 281.23 / Avg: 281.69 / Max: 282.14Min: 331.75 / Avg: 333.11 / Max: 334.37Min: 353.61 / Avg: 354.02 / Max: 354.83Min: 456.67 / Avg: 458.12 / Max: 460.12Min: 472.54 / Avg: 474.44 / Max: 476.54

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX7001400210028003500SE +/- 6.27, N = 3SE +/- 9.63, N = 3SE +/- 13.26, N = 3SE +/- 32.06, N = 3SE +/- 25.14, N = 3SE +/- 13.06, N = 3SE +/- 20.57, N = 3SE +/- 22.18, N = 3SE +/- 20.98, N = 3SE +/- 28.14, N = 38991430147817502333148417292259310732751. (CXX) g++ options: -lpthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX6001200180024003000Min: 888.27 / Avg: 898.96 / Max: 909.99Min: 1412.22 / Avg: 1430.05 / Max: 1445.29Min: 1451.36 / Avg: 1477.54 / Max: 1494.31Min: 1686.07 / Avg: 1749.9 / Max: 1787.12Min: 2285.91 / Avg: 2332.72 / Max: 2372.04Min: 1463.59 / Avg: 1484.44 / Max: 1508.5Min: 1689.75 / Avg: 1729.48 / Max: 1758.6Min: 2235.88 / Avg: 2258.99 / Max: 2303.34Min: 3066.49 / Avg: 3106.95 / Max: 3136.83Min: 3232.74 / Avg: 3275.46 / Max: 3328.551. (CXX) g++ options: -lpthread

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.13ATPase Simulation - 327,506 AtomsGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.07260.14520.21780.29040.363SE +/- 0.00102, N = 3SE +/- 0.00210, N = 3SE +/- 0.00129, N = 3SE +/- 0.00192, N = 3SE +/- 0.00196, N = 3SE +/- 0.00208, N = 3SE +/- 0.00161, N = 3SE +/- 0.00116, N = 3SE +/- 0.00190, N = 3SE +/- 0.00092, N = 30.322710.242280.230220.212330.201130.205420.197460.200440.191770.19735
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.13ATPase Simulation - 327,506 AtomsGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX12345Min: 0.32 / Avg: 0.32 / Max: 0.32Min: 0.24 / Avg: 0.24 / Max: 0.25Min: 0.23 / Avg: 0.23 / Max: 0.23Min: 0.21 / Avg: 0.21 / Max: 0.22Min: 0.2 / Avg: 0.2 / Max: 0.21Min: 0.2 / Avg: 0.21 / Max: 0.21Min: 0.2 / Avg: 0.2 / Max: 0.2Min: 0.2 / Avg: 0.2 / Max: 0.2Min: 0.19 / Avg: 0.19 / Max: 0.2Min: 0.2 / Avg: 0.2 / Max: 0.2

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX120240360480600SE +/- 0.11, N = 3SE +/- 0.45, N = 3SE +/- 0.24, N = 3SE +/- 0.41, N = 3SE +/- 0.55, N = 3SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.45, N = 3SE +/- 0.31, N = 3SE +/- 1.67, N = 3194257262292339309376400537556
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500Min: 193.36 / Avg: 193.57 / Max: 193.7Min: 256.49 / Avg: 257.03 / Max: 257.93Min: 261.35 / Avg: 261.84 / Max: 262.13Min: 291.32 / Avg: 292.02 / Max: 292.73Min: 338.01 / Avg: 338.9 / Max: 339.9Min: 308.5 / Avg: 309.01 / Max: 309.37Min: 375.3 / Avg: 375.95 / Max: 376.34Min: 398.92 / Avg: 399.51 / Max: 400.4Min: 536.36 / Avg: 536.97 / Max: 537.39Min: 553.7 / Avg: 556.33 / Max: 559.45

clpeak

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 311.1511.3311.1511.3411.2911.2911.2511.1311.2911.161. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215Min: 11.01 / Avg: 11.15 / Max: 11.31Min: 11.32 / Avg: 11.33 / Max: 11.34Min: 11.06 / Avg: 11.15 / Max: 11.24Min: 11.29 / Avg: 11.34 / Max: 11.37Min: 11.27 / Avg: 11.29 / Max: 11.31Min: 11.24 / Avg: 11.29 / Max: 11.34Min: 11.18 / Avg: 11.25 / Max: 11.32Min: 11.03 / Avg: 11.13 / Max: 11.31Min: 11.27 / Avg: 11.29 / Max: 11.31Min: 11.04 / Avg: 11.16 / Max: 11.241. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5412.6012.5512.5912.5712.6012.4812.5212.5912.581. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX48121620Min: 12.47 / Avg: 12.54 / Max: 12.6Min: 12.54 / Avg: 12.6 / Max: 12.63Min: 12.54 / Avg: 12.55 / Max: 12.56Min: 12.57 / Avg: 12.59 / Max: 12.61Min: 12.55 / Avg: 12.57 / Max: 12.58Min: 12.59 / Avg: 12.6 / Max: 12.61Min: 12.45 / Avg: 12.48 / Max: 12.49Min: 12.47 / Avg: 12.52 / Max: 12.55Min: 12.58 / Avg: 12.59 / Max: 12.61Min: 12.55 / Avg: 12.58 / Max: 12.61. (CXX) g++ options: -O3 -rdynamic -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX130260390520650SE +/- 0.33, N = 3SE +/- 0.21, N = 3SE +/- 0.19, N = 3SE +/- 0.45, N = 3SE +/- 0.59, N = 3SE +/- 0.31, N = 3SE +/- 0.60, N = 3SE +/- 1.09, N = 3SE +/- 2.13, N = 3SE +/- 0.76, N = 3180228224259330347382457605610
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX110220330440550Min: 179.02 / Avg: 179.67 / Max: 180Min: 227.78 / Avg: 228.16 / Max: 228.5Min: 223.9 / Avg: 224.25 / Max: 224.53Min: 257.79 / Avg: 258.64 / Max: 259.34Min: 329.48 / Avg: 330.36 / Max: 331.47Min: 346.94 / Avg: 347.49 / Max: 348.01Min: 380.39 / Avg: 381.58 / Max: 382.33Min: 456.16 / Avg: 457.47 / Max: 459.63Min: 601.13 / Avg: 605.16 / Max: 608.37Min: 608.75 / Avg: 610.22 / Max: 611.33

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 312.008.257.856.504.968.807.886.284.364.391. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215Min: 11.92 / Avg: 12 / Max: 12.13Min: 8.22 / Avg: 8.25 / Max: 8.26Min: 7.83 / Avg: 7.85 / Max: 7.88Min: 6.44 / Avg: 6.5 / Max: 6.63Min: 4.92 / Avg: 4.96 / Max: 5.01Min: 8.74 / Avg: 8.8 / Max: 8.88Min: 7.77 / Avg: 7.88 / Max: 7.99Min: 6.19 / Avg: 6.28 / Max: 6.39Min: 4.31 / Avg: 4.36 / Max: 4.43Min: 4.3 / Avg: 4.39 / Max: 4.551. (CXX) g++ options: -O2 -lOpenCL

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX70M140M210M280M350MSE +/- 513835.53, N = 3SE +/- 621591.58, N = 3SE +/- 297139.02, N = 3SE +/- 121634.53, N = 3SE +/- 720633.38, N = 3SE +/- 815068.63, N = 3SE +/- 333271.02, N = 3SE +/- 1536969.33, N = 3SE +/- 1417039.09, N = 3SE +/- 908628.37, N = 31837826262181720622215480162391313242651628752516119892656820462788292383009222413036654471. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50M100M150M200M250MMin: 182818085.8 / Avg: 183782626.17 / Max: 184572023.8Min: 217003046.5 / Avg: 218172062.43 / Max: 219122875.7Min: 220996207.2 / Avg: 221548015.6 / Max: 222014983.7Min: 238908355.7 / Avg: 239131324.27 / Max: 239327062Min: 263757984.9 / Avg: 265162874.6 / Max: 266143978.9Min: 250000827.1 / Avg: 251611988.57 / Max: 252632347.5Min: 265132379.1 / Avg: 265682046.23 / Max: 266283389Min: 276811943.5 / Avg: 278829237.77 / Max: 281846545.4Min: 298088601.7 / Avg: 300922240.8 / Max: 302382263.1Min: 301940481.9 / Avg: 303665447.4 / Max: 305023093.81. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX120240360480600SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.27, N = 3SE +/- 0.18, N = 3SE +/- 0.52, N = 3SE +/- 0.15, N = 3SE +/- 1.98, N = 3SE +/- 1.81, N = 3SE +/- 0.50, N = 3SE +/- 0.38, N = 31532052052293372963923925445651. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500Min: 152.5 / Avg: 152.67 / Max: 153Min: 205.1 / Avg: 205.27 / Max: 205.4Min: 204.2 / Avg: 204.73 / Max: 205.1Min: 228.6 / Avg: 228.93 / Max: 229.2Min: 336.4 / Avg: 337.37 / Max: 338.2Min: 296.1 / Avg: 296.3 / Max: 296.6Min: 388.2 / Avg: 392.1 / Max: 394.6Min: 388.5 / Avg: 391.87 / Max: 394.7Min: 543.5 / Avg: 544 / Max: 545Min: 565 / Avg: 565.43 / Max: 566.21. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX100200300400500SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.18, N = 3SE +/- 0.35, N = 3SE +/- 0.23, N = 3SE +/- 0.64, N = 31381871872093172473303274544831. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX90180270360450Min: 138.2 / Avg: 138.4 / Max: 138.7Min: 186.8 / Avg: 186.87 / Max: 187Min: 186.7 / Avg: 187.23 / Max: 187.7Min: 209.3 / Avg: 209.4 / Max: 209.5Min: 317.3 / Avg: 317.47 / Max: 317.8Min: 246.4 / Avg: 246.67 / Max: 247Min: 330.1 / Avg: 330.37 / Max: 330.7Min: 326.3 / Avg: 326.7 / Max: 327.4Min: 453.3 / Avg: 453.7 / Max: 454.1Min: 481.7 / Avg: 482.77 / Max: 483.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX110220330440550SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.30, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.95, N = 3SE +/- 0.59, N = 3SE +/- 0.73, N = 3SE +/- 1.92, N = 3SE +/- 2.45, N = 31391921902163362453133234394891. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX90180270360450Min: 138.9 / Avg: 139.13 / Max: 139.5Min: 191.6 / Avg: 191.67 / Max: 191.8Min: 189.2 / Avg: 189.63 / Max: 190.2Min: 216 / Avg: 216.17 / Max: 216.3Min: 335.5 / Avg: 335.67 / Max: 335.8Min: 243.5 / Avg: 245.37 / Max: 246.6Min: 312.5 / Avg: 313.4 / Max: 314.5Min: 321.7 / Avg: 322.67 / Max: 324.1Min: 435.5 / Avg: 439.33 / Max: 441.3Min: 483.6 / Avg: 488.5 / Max: 491.11. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000SE +/- 0.87, N = 3SE +/- 4.06, N = 3SE +/- 3.15, N = 3SE +/- 5.02, N = 3SE +/- 3.70, N = 3SE +/- 1.27, N = 3SE +/- 1.07, N = 3SE +/- 1.63, N = 3SE +/- 2.70, N = 3SE +/- 3.45, N = 3552664645686774779830893971977
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000Min: 550.45 / Avg: 552.08 / Max: 553.4Min: 656.32 / Avg: 664.31 / Max: 669.55Min: 641.13 / Avg: 645.34 / Max: 651.5Min: 675.83 / Avg: 685.87 / Max: 691.24Min: 767.34 / Avg: 774.41 / Max: 779.85Min: 777.01 / Avg: 779.22 / Max: 781.41Min: 828.43 / Avg: 830.48 / Max: 832.07Min: 890.4 / Avg: 893.16 / Max: 896.06Min: 965.79 / Avg: 971.16 / Max: 974.38Min: 971.07 / Avg: 976.99 / Max: 983.03

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Masskrug - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.173.924.013.973.873.823.683.783.693.69
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Masskrug - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX246810Min: 4.16 / Avg: 4.17 / Max: 4.19Min: 3.91 / Avg: 3.92 / Max: 3.93Min: 3.99 / Avg: 4.01 / Max: 4.03Min: 3.97 / Avg: 3.97 / Max: 3.98Min: 3.85 / Avg: 3.87 / Max: 3.89Min: 3.81 / Avg: 3.82 / Max: 3.84Min: 3.65 / Avg: 3.68 / Max: 3.7Min: 3.78 / Avg: 3.78 / Max: 3.79Min: 3.66 / Avg: 3.69 / Max: 3.7Min: 3.67 / Avg: 3.69 / Max: 3.7

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500SE +/- 3.04, N = 3SE +/- 1.02, N = 3SE +/- 0.31, N = 3SE +/- 0.61, N = 3SE +/- 4.51, N = 3SE +/- 1.86, N = 3SE +/- 1.43, N = 3SE +/- 0.31, N = 3SE +/- 1.39, N = 3SE +/- 3.01, N = 35927647558018838631008104512911336
OpenBenchmarking.orgExamples Per Second, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000Min: 586.76 / Avg: 592.47 / Max: 597.15Min: 761.81 / Avg: 763.83 / Max: 765.1Min: 754.57 / Avg: 754.92 / Max: 755.54Min: 800.38 / Avg: 801.45 / Max: 802.5Min: 875.86 / Avg: 882.88 / Max: 891.29Min: 859.19 / Avg: 862.88 / Max: 865.16Min: 1005.23 / Avg: 1007.98 / Max: 1010.04Min: 1044.43 / Avg: 1045.03 / Max: 1045.41Min: 1288.49 / Avg: 1291.1 / Max: 1293.21Min: 1329.56 / Avg: 1335.56 / Max: 1338.83

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Boat - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.82351.6472.47053.2944.1175SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.662.892.932.722.262.221.931.921.641.64
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Boat - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX246810Min: 3.65 / Avg: 3.66 / Max: 3.67Min: 2.88 / Avg: 2.89 / Max: 2.89Min: 2.92 / Avg: 2.93 / Max: 2.94Min: 2.71 / Avg: 2.72 / Max: 2.73Min: 2.25 / Avg: 2.26 / Max: 2.27Min: 2.22 / Avg: 2.22 / Max: 2.23Min: 1.92 / Avg: 1.93 / Max: 1.94Min: 1.92 / Avg: 1.92 / Max: 1.93Min: 1.64 / Avg: 1.64 / Max: 1.65Min: 1.63 / Avg: 1.64 / Max: 1.64

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Room - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.311.101.161.101.030.820.750.810.750.74
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Room - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX246810Min: 1.3 / Avg: 1.31 / Max: 1.31Min: 1.1 / Avg: 1.1 / Max: 1.1Min: 1.15 / Avg: 1.16 / Max: 1.16Min: 1.1 / Avg: 1.1 / Max: 1.11Min: 1.01 / Avg: 1.03 / Max: 1.06Min: 0.82 / Avg: 0.82 / Max: 0.82Min: 0.75 / Avg: 0.75 / Max: 0.75Min: 0.81 / Avg: 0.81 / Max: 0.81Min: 0.74 / Avg: 0.75 / Max: 0.75Min: 0.73 / Avg: 0.74 / Max: 0.75

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500SE +/- 2.45, N = 3SE +/- 8.74, N = 3SE +/- 1.30, N = 3SE +/- 1.43, N = 3SE +/- 2.56, N = 3SE +/- 10.50, N = 3SE +/- 46.80, N = 3SE +/- 6.71, N = 3SE +/- 11.70, N = 3SE +/- 2.67, N = 32974535015759728039881083144515531. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX30060090012001500Min: 291.88 / Avg: 296.79 / Max: 299.31Min: 443.6 / Avg: 453.16 / Max: 470.62Min: 498.13 / Avg: 500.59 / Max: 502.57Min: 572.34 / Avg: 574.51 / Max: 577.21Min: 968.63 / Avg: 972.1 / Max: 977.1Min: 782.66 / Avg: 802.85 / Max: 817.98Min: 936.99 / Avg: 988.41 / Max: 1081.85Min: 1069.68 / Avg: 1083.05 / Max: 1090.83Min: 1431.39 / Avg: 1444.87 / Max: 1468.18Min: 1548.6 / Avg: 1552.67 / Max: 1557.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3K6K9K12K15KSE +/- 38.63, N = 3SE +/- 21.00, N = 3SE +/- 6.17, N = 3SE +/- 25.25, N = 3SE +/- 76.28, N = 3SE +/- 444.77, N = 3SE +/- 498.12, N = 3SE +/- 679.58, N = 3SE +/- 517.75, N = 3SE +/- 1004.15, N = 31252168120682398326367057780995214906148481. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3K6K9K12K15KMin: 1174.97 / Avg: 1252.22 / Max: 1291.03Min: 1641.76 / Avg: 1680.93 / Max: 1713.66Min: 2055.54 / Avg: 2067.8 / Max: 2075.12Min: 2351.95 / Avg: 2398.48 / Max: 2438.72Min: 3114.34 / Avg: 3263.29 / Max: 3366.29Min: 5815.24 / Avg: 6704.55 / Max: 7166.29Min: 6785.83 / Avg: 7780.45 / Max: 8326.84Min: 8593.77 / Avg: 9951.71 / Max: 10680.62Min: 13870.49 / Avg: 14905.92 / Max: 15434.02Min: 12874.24 / Avg: 14847.83 / Max: 16156.551. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.9312.1912.1912.3012.5112.4312.5712.5712.6812.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX48121620Min: 11.92 / Avg: 11.93 / Max: 11.95Min: 12.19 / Avg: 12.19 / Max: 12.19Min: 12.19 / Avg: 12.19 / Max: 12.2Min: 12.29 / Avg: 12.3 / Max: 12.3Min: 12.51 / Avg: 12.51 / Max: 12.51Min: 12.43 / Avg: 12.43 / Max: 12.44Min: 12.56 / Avg: 12.57 / Max: 12.58Min: 12.56 / Avg: 12.57 / Max: 12.57Min: 12.68 / Avg: 12.68 / Max: 12.69Min: 12.68 / Avg: 12.68 / Max: 12.691. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX110220330440550SE +/- 0.95, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 3SE +/- 0.05, N = 3SE +/- 0.68, N = 3SE +/- 0.19, N = 3SE +/- 2.69, N = 3SE +/- 0.37, N = 3SE +/- 0.48, N = 3SE +/- 1.70, N = 31451961972223292763653685055251. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX90180270360450Min: 143.59 / Avg: 145.49 / Max: 146.46Min: 196.27 / Avg: 196.3 / Max: 196.32Min: 196.62 / Avg: 197.05 / Max: 197.27Min: 221.94 / Avg: 222.03 / Max: 222.08Min: 328.16 / Avg: 329.02 / Max: 330.37Min: 275.65 / Avg: 275.99 / Max: 276.3Min: 359.25 / Avg: 364.64 / Max: 367.34Min: 367.26 / Avg: 367.64 / Max: 368.37Min: 504.5 / Avg: 505.01 / Max: 505.96Min: 522.05 / Avg: 525.42 / Max: 527.521. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.40, N = 3SE +/- 0.35, N = 37.2810.4711.6314.1919.6815.8718.4223.7435.1336.281. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX816243240Min: 7.28 / Avg: 7.28 / Max: 7.28Min: 10.47 / Avg: 10.47 / Max: 10.47Min: 11.63 / Avg: 11.63 / Max: 11.63Min: 14.15 / Avg: 14.19 / Max: 14.22Min: 19.56 / Avg: 19.68 / Max: 19.78Min: 15.86 / Avg: 15.87 / Max: 15.88Min: 18.41 / Avg: 18.42 / Max: 18.43Min: 23.74 / Avg: 23.74 / Max: 23.75Min: 34.34 / Avg: 35.13 / Max: 35.58Min: 35.77 / Avg: 36.28 / Max: 36.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4K8K12K16K20KSE +/- 3.56, N = 3SE +/- 393.27, N = 3SE +/- 14.43, N = 3SE +/- 396.02, N = 3SE +/- 782.31, N = 3SE +/- 657.16, N = 3SE +/- 495.99, N = 3SE +/- 732.21, N = 3SE +/- 1065.18, N = 3SE +/- 688.69, N = 3423958786776793410853663079681027015352163861. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX3K6K9K12K15KMin: 4232.57 / Avg: 4239.17 / Max: 4244.77Min: 5091.74 / Avg: 5878.26 / Max: 6276.36Min: 6747.48 / Avg: 6776.31 / Max: 6791.94Min: 7141.47 / Avg: 7933.51 / Max: 8329.77Min: 9288.09 / Avg: 10852.7 / Max: 11635.67Min: 5316.4 / Avg: 6630.38 / Max: 7313.07Min: 6980.09 / Avg: 7968.07 / Max: 8539.2Min: 8806.31 / Avg: 10270.06 / Max: 11040.34Min: 13222.02 / Avg: 15351.89 / Max: 16456.4Min: 15021.53 / Avg: 16386.43 / Max: 17229.121. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Rack - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.02930.05860.08790.11720.1465SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.130.120.130.130.120.110.100.110.100.10
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.4Test: Server Rack - Acceleration: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX12345Min: 0.13 / Avg: 0.13 / Max: 0.13Min: 0.12 / Avg: 0.12 / Max: 0.13Min: 0.13 / Avg: 0.13 / Max: 0.13Min: 0.12 / Avg: 0.13 / Max: 0.13Min: 0.12 / Avg: 0.12 / Max: 0.12Min: 0.11 / Avg: 0.11 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11Min: 0.11 / Avg: 0.11 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11Min: 0.1 / Avg: 0.1 / Max: 0.11

clpeak

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.89781.79562.69343.59124.489SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 33.783.643.923.773.993.653.643.653.833.851. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX246810Min: 3.72 / Avg: 3.78 / Max: 3.9Min: 3.56 / Avg: 3.64 / Max: 3.72Min: 3.75 / Avg: 3.92 / Max: 4.2Min: 3.71 / Avg: 3.77 / Max: 3.8Min: 3.94 / Avg: 3.99 / Max: 4.05Min: 3.52 / Avg: 3.65 / Max: 3.74Min: 3.64 / Avg: 3.64 / Max: 3.64Min: 3.58 / Avg: 3.65 / Max: 3.72Min: 3.77 / Avg: 3.83 / Max: 3.88Min: 3.66 / Avg: 3.85 / Max: 3.991. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Chaos Group V-RAY

OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 1.1.0System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 43.2 / Avg: 121.48 / Max: 125.9Min: 44.7 / Avg: 152.69 / Max: 164.2Min: 48.1 / Avg: 131.12 / Max: 137.1Min: 51.6 / Avg: 222.32 / Max: 232.4Min: 51.4 / Avg: 157.72 / Max: 163.6Min: 47.1 / Avg: 187.28 / Max: 203.3Min: 56.9 / Avg: 193.17 / Max: 205.2Min: 77.8 / Avg: 258.11 / Max: 278.4Min: 56.9 / Avg: 255.3 / Max: 301.4

OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 1.1.0GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 50 / Avg: 58.41 / Max: 60Min: 58 / Avg: 66.75 / Max: 70Min: 44 / Avg: 48.38 / Max: 50Min: 63 / Avg: 70.44 / Max: 73Min: 55 / Avg: 62.31 / Max: 64Min: 58 / Avg: 65.11 / Max: 67Min: 64 / Avg: 72.3 / Max: 75Min: 63 / Avg: 68.72 / Max: 70Min: 61 / Avg: 68.6 / Max: 71

OctaneBench

OpenBenchmarking.orgWatts, Fewer Is BetterOctaneBench 4.00System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 43.6 / Avg: 116.96 / Max: 132.2Min: 82.8 / Avg: 151.39 / Max: 179.2Min: 101.9 / Avg: 126.5 / Max: 138.6Min: 121.5 / Avg: 228.13 / Max: 255.9Min: 44.5 / Avg: 179.11 / Max: 208.8Min: 45.3 / Avg: 207.59 / Max: 236.4Min: 46.8 / Avg: 227.19 / Max: 265.6Min: 48.1 / Avg: 306.05 / Max: 333.8Min: 50.9 / Avg: 313.74 / Max: 347

OpenBenchmarking.orgCelsius, Fewer Is BetterOctaneBench 4.00GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1530456075Min: 45 / Avg: 57.32 / Max: 60Min: 54 / Avg: 67.7 / Max: 71Min: 40 / Avg: 48.77 / Max: 50Min: 58 / Avg: 72.86 / Max: 75Min: 44 / Avg: 65.39 / Max: 70Min: 44 / Avg: 69.09 / Max: 74Min: 55 / Avg: 77.23 / Max: 81Min: 48 / Avg: 72.21 / Max: 76Min: 49 / Avg: 73.57 / Max: 78

OpenBenchmarking.orgScore Per Watt, More Is BetterOctaneBench 4.00Total ScoreGTX 1060GTX 1070GTX 1070 TiGTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.2520.5040.7561.0081.260.780.881.120.930.920.980.960.991.02

System Power Consumption Monitor

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX70140210280350Min: 42.3 / Avg: 142.19 / Max: 280.5Min: 47.2 / Avg: 123.43 / Max: 255.8Min: 40.2 / Avg: 143.79 / Max: 318.2Min: 47.2 / Avg: 198.85 / Max: 370.3Min: 42.5 / Avg: 144.37 / Max: 300.1Min: 43.9 / Avg: 160.13 / Max: 317.6Min: 46 / Avg: 176.62 / Max: 334.5Min: 47.2 / Avg: 220.83 / Max: 367.7Min: 49.4 / Avg: 228.34 / Max: 363.1

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringGTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1632486480Min: 32 / Avg: 61.66 / Max: 76Min: 38 / Avg: 45.88 / Max: 53Min: 31 / Avg: 60.14 / Max: 74Min: 39 / Avg: 66.36 / Max: 82Min: 37 / Avg: 55.81 / Max: 70Min: 32 / Avg: 56.83 / Max: 74Min: 48 / Avg: 67.06 / Max: 83Min: 43 / Avg: 61.88 / Max: 77Min: 33 / Avg: 62.56 / Max: 79

Meta Performance Per Watt

OpenBenchmarking.orgPerformance Per Watt, More Is BetterMeta Performance Per WattPerformance Per WattGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2004006008001000416.98479.38557.72541.32583.35735.65740.20786.04839.87854.04

Rodinia

OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 2.4System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 104.4 / Avg: 137.7 / Max: 148Min: 42.6 / Avg: 133.12 / Max: 175.4Min: 138.9 / Avg: 142.32 / Max: 143.8Min: 43.1 / Avg: 152.7 / Max: 204.6Min: 251.2 / Avg: 257.17 / Max: 262.8Min: 43.8 / Avg: 118.92 / Max: 153.5Min: 44.9 / Avg: 133.58 / Max: 161.9Min: 47.2 / Avg: 145.53 / Max: 181.5Min: 48.3 / Avg: 118.57 / Max: 153.7Min: 51.8 / Avg: 158.3 / Max: 240.2

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 2.4GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1224364860Min: 46 / Avg: 49.29 / Max: 53Min: 53 / Avg: 56.4 / Max: 60Min: 38 / Avg: 42.67 / Max: 45Min: 47 / Avg: 53 / Max: 57Min: 51 / Avg: 54.67 / Max: 58Min: 43 / Avg: 48.8 / Max: 51Min: 44 / Avg: 47.5 / Max: 49Min: 57 / Avg: 58.75 / Max: 61Min: 52 / Avg: 52.67 / Max: 54Min: 53 / Avg: 54.33 / Max: 55

OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 2.4System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX306090120150Min: 71.2 / Avg: 95.18 / Max: 97.9Min: 83.2 / Avg: 107.83 / Max: 109.9Min: 79.3 / Avg: 102.88 / Max: 104.5Min: 89.5 / Avg: 112.44 / Max: 114Min: 139.3 / Avg: 143.18 / Max: 143.6Min: 71.3 / Avg: 117.74 / Max: 121Min: 77.4 / Avg: 121.31 / Max: 124.3Min: 86.1 / Avg: 134.67 / Max: 138.4Min: 94.7 / Avg: 156.48 / Max: 163Min: 103.8 / Avg: 163.73 / Max: 168.8

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 2.4GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1122334455Min: 43 / Avg: 44.4 / Max: 46Min: 51 / Avg: 52.25 / Max: 54Min: 40 / Avg: 40.18 / Max: 41Min: 50 / Avg: 50 / Max: 50Min: 54 / Avg: 55.14 / Max: 56Min: 47 / Avg: 47.53 / Max: 48Min: 45 / Avg: 46.16 / Max: 47Min: 55 / Avg: 56.89 / Max: 58Min: 50 / Avg: 51.68 / Max: 52Min: 52 / Avg: 53 / Max: 54

JuliaGPU

OpenBenchmarking.orgWatts, Fewer Is BetterJuliaGPU 1.2pts1System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 57.5 / Avg: 112.76 / Max: 128.1Min: 42.3 / Avg: 116.82 / Max: 147.4Min: 47.7 / Avg: 111.88 / Max: 128.2Min: 42.3 / Avg: 127 / Max: 157.8Min: 67.7 / Avg: 159.15 / Max: 189.8Min: 43.5 / Avg: 112.53 / Max: 153.7Min: 52.9 / Avg: 116.58 / Max: 157.4Min: 46.4 / Avg: 134.7 / Max: 169Min: 99.5 / Avg: 147.3 / Max: 199.5Min: 51.3 / Avg: 148.68 / Max: 204

OpenBenchmarking.orgCelsius, Fewer Is BetterJuliaGPU 1.2pts1GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1122334455Min: 46 / Avg: 47.2 / Max: 49Min: 53 / Avg: 54.4 / Max: 56Min: 38 / Avg: 40.6 / Max: 42Min: 51 / Avg: 52.5 / Max: 54Min: 52 / Avg: 54.6 / Max: 57Min: 43 / Avg: 47.5 / Max: 50Min: 46 / Avg: 47 / Max: 48Min: 53 / Avg: 54.5 / Max: 56Min: 48 / Avg: 51.25 / Max: 53Min: 49 / Avg: 52.5 / Max: 54

OpenBenchmarking.orgSamples/sec Per Watt, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX500K1000K1500K2000K2500K1629857186759219802291882924166611922360542279065207000220429212042478

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 42.1 / Avg: 105.4 / Max: 135.1Min: 43.3 / Avg: 130.75 / Max: 158.9Min: 48.4 / Avg: 122.39 / Max: 144.2Min: 43.4 / Avg: 134.92 / Max: 163.8Min: 53.7 / Avg: 180.42 / Max: 218.2Min: 148.5 / Avg: 159.28 / Max: 177.5Min: 52.2 / Avg: 156.43 / Max: 195Min: 169.9 / Avg: 187.73 / Max: 213.5Min: 49 / Avg: 235.34 / Max: 283.1Min: 49.4 / Avg: 244.67 / Max: 287.4

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 46 / Avg: 48.5 / Max: 49Min: 57 / Avg: 57.33 / Max: 58Min: 42 / Avg: 42.6 / Max: 43Min: 54 / Avg: 55.54 / Max: 57Min: 56 / Avg: 59.67 / Max: 61Min: 48 / Avg: 54.5 / Max: 56Min: 52 / Avg: 54.2 / Max: 56Min: 57 / Avg: 63.86 / Max: 67Min: 59 / Avg: 62 / Max: 65Min: 55 / Avg: 63.4 / Max: 66

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2468103.903.423.543.913.306.436.885.974.984.73

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 43 / Avg: 114.31 / Max: 162.2Min: 55.9 / Avg: 136.52 / Max: 200.6Min: 48.3 / Avg: 120.03 / Max: 172.9Min: 50.6 / Avg: 150.09 / Max: 250.1Min: 49.4 / Avg: 197.37 / Max: 320.7Min: 44.4 / Avg: 139.12 / Max: 218.8Min: 44.9 / Avg: 147.19 / Max: 246Min: 48.8 / Avg: 169.69 / Max: 284.4Min: 47.8 / Avg: 210.47 / Max: 341.8Min: 50.4 / Avg: 217.41 / Max: 351.9

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 39 / Avg: 50.55 / Max: 56Min: 54 / Avg: 60.68 / Max: 66Min: 42 / Avg: 44.91 / Max: 50Min: 49 / Avg: 62.72 / Max: 68Min: 49 / Avg: 66.58 / Max: 73Min: 50 / Avg: 54.74 / Max: 63Min: 47 / Avg: 54.47 / Max: 62Min: 49 / Avg: 66.07 / Max: 74Min: 45 / Avg: 61.74 / Max: 69Min: 52 / Avg: 62.85 / Max: 70

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2040608010041.9452.0264.2262.6367.0752.7657.9464.7278.7779.72

cl-mem

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 57.7 / Avg: 108.33 / Max: 126.1Min: 64.6 / Avg: 132.44 / Max: 150.8Min: 49.3 / Avg: 113.36 / Max: 139.4Min: 43.2 / Avg: 119.3 / Max: 162.2Min: 49.6 / Avg: 139.67 / Max: 192.8Min: 43.7 / Avg: 111.03 / Max: 156.9Min: 115.9 / Avg: 150.6 / Max: 185.3Min: 52.6 / Avg: 141.73 / Max: 186.7Min: 136 / Avg: 198.3 / Max: 260.6Min: 54.7 / Avg: 99.5 / Max: 144.3

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1224364860Min: 48 / Avg: 51.14 / Max: 52Min: 55 / Avg: 57.17 / Max: 58Min: 43 / Avg: 44 / Max: 45Min: 56 / Avg: 57 / Max: 58Min: 60 / Avg: 60.67 / Max: 61Min: 49 / Avg: 54 / Max: 56Min: 50 / Avg: 53.67 / Max: 56Min: 62 / Avg: 62 / Max: 62Min: 56 / Avg: 58.33 / Max: 61Min: 60 / Avg: 61 / Max: 62

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 20800.51981.03961.55942.07922.5991.281.411.651.762.272.222.31

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 43.4 / Avg: 98.27 / Max: 126.2Min: 84.4 / Avg: 137.36 / Max: 151.6Min: 50 / Avg: 124.63 / Max: 140.5Min: 92.2 / Avg: 141.24 / Max: 162.2Min: 216.4 / Avg: 217.73 / Max: 218.8Min: 44.2 / Avg: 131.53 / Max: 164.4Min: 46.9 / Avg: 139.57 / Max: 185.9Min: 98.5 / Avg: 128.93 / Max: 169.3Min: 51 / Avg: 155.07 / Max: 243.7Min: 55 / Avg: 190.13 / Max: 258.3

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 53 / Avg: 53 / Max: 53Min: 60 / Avg: 60.6 / Max: 61Min: 43 / Avg: 44.83 / Max: 46Min: 60 / Avg: 60.2 / Max: 61Min: 64 / Avg: 64 / Max: 64Min: 55 / Avg: 56 / Max: 57Min: 54 / Avg: 55.33 / Max: 56Min: 61 / Avg: 63.75 / Max: 65Min: 61 / Avg: 62 / Max: 63Min: 59 / Avg: 63 / Max: 65

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.63681.27361.91042.54723.1841.421.401.521.531.541.872.252.502.832.57

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 55.5 / Avg: 114.96 / Max: 125.8Min: 46.7 / Avg: 129.34 / Max: 151Min: 49 / Avg: 121.24 / Max: 139.8Min: 102.3 / Avg: 149.02 / Max: 162.5Min: 52.6 / Avg: 161.7 / Max: 217.6Min: 45.9 / Avg: 126.65 / Max: 159.6Min: 48.6 / Avg: 145.97 / Max: 196.3Min: 50.4 / Avg: 148 / Max: 197.9Min: 57.7 / Avg: 161.45 / Max: 265.2Min: 55.2 / Avg: 156.17 / Max: 266.5

OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 54 / Avg: 55.88 / Max: 57Min: 64 / Avg: 64.8 / Max: 66Min: 47 / Avg: 47 / Max: 47Min: 63 / Avg: 64.17 / Max: 65Min: 68 / Avg: 68.33 / Max: 69Min: 56 / Avg: 58.5 / Max: 60Min: 60 / Avg: 62 / Max: 63Min: 67 / Avg: 68.67 / Max: 70Min: 66 / Avg: 67 / Max: 68Min: 66 / Avg: 68.33 / Max: 70

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080TITAN RTX0.81451.6292.44353.2584.07251.331.591.691.542.092.342.692.653.62

LuxMark

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 44.6 / Avg: 149.89 / Max: 155.9Min: 45.5 / Avg: 193.79 / Max: 199.6Min: 48.8 / Avg: 159.08 / Max: 161.7Min: 43.9 / Avg: 194.98 / Max: 203.8Min: 48.8 / Avg: 276.45 / Max: 286Min: 45.7 / Avg: 206.77 / Max: 213.1Min: 47.5 / Avg: 239.56 / Max: 245.1Min: 52 / Avg: 263.55 / Max: 277.3Min: 51.4 / Avg: 335.79 / Max: 346.6Min: 58 / Avg: 351.6 / Max: 363.1

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1632486480Min: 50 / Avg: 62.77 / Max: 65Min: 60 / Avg: 73.85 / Max: 76Min: 47 / Avg: 52.18 / Max: 53Min: 59 / Avg: 71.48 / Max: 74Min: 67 / Avg: 79.45 / Max: 82Min: 55 / Avg: 68.52 / Max: 70Min: 63 / Avg: 72.56 / Max: 74Min: 66 / Avg: 80.62 / Max: 83Min: 63 / Avg: 75.3 / Max: 77Min: 65 / Avg: 77.45 / Max: 79

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX71421283517.7619.9826.1919.5020.5023.0525.6624.8627.3527.67

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 43.5 / Avg: 135.49 / Max: 138.3Min: 58.9 / Avg: 165.75 / Max: 169.2Min: 49.7 / Avg: 142.13 / Max: 144.6Min: 49.8 / Avg: 167.48 / Max: 171.6Min: 51.8 / Avg: 234.85 / Max: 240.3Min: 45.4 / Avg: 188.59 / Max: 193.4Min: 49.6 / Avg: 218.65 / Max: 223.5Min: 49.5 / Avg: 233.26 / Max: 240.4Min: 51.5 / Avg: 320.8 / Max: 328.8Min: 56.5 / Avg: 333.71 / Max: 348.1

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1530456075Min: 52 / Avg: 57.72 / Max: 59Min: 64 / Avg: 67.32 / Max: 68Min: 45 / Avg: 48.75 / Max: 49Min: 61 / Avg: 65.19 / Max: 66Min: 66 / Avg: 72.64 / Max: 74Min: 56 / Avg: 65.72 / Max: 67Min: 59 / Avg: 68.82 / Max: 70Min: 66 / Avg: 76.55 / Max: 78Min: 68 / Avg: 73.54 / Max: 75Min: 64 / Avg: 76.18 / Max: 78

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX2040608010051.3860.1671.5952.0857.8472.8282.6684.7489.2191.17

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.1System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 111.3 / Avg: 143.3 / Max: 145.4Min: 82.6 / Avg: 175.54 / Max: 179.2Min: 47.6 / Avg: 146.95 / Max: 149.7Min: 89.3 / Avg: 173.79 / Max: 175.8Min: 161.6 / Avg: 242.69 / Max: 248.1Min: 44 / Avg: 192.74 / Max: 197.4Min: 46 / Avg: 225.28 / Max: 233.2Min: 48.9 / Avg: 235.08 / Max: 244.1Min: 47.7 / Avg: 320.7 / Max: 332Min: 50.6 / Avg: 334.61 / Max: 351.7

OpenBenchmarking.orgCelsius, Fewer Is BetterLuxMark 3.1GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1530456075Min: 43 / Avg: 58.4 / Max: 61Min: 54 / Avg: 67.07 / Max: 70Min: 41 / Avg: 48.91 / Max: 51Min: 49 / Avg: 64.55 / Max: 67Min: 56 / Avg: 72.55 / Max: 75Min: 49 / Avg: 63.65 / Max: 67Min: 47 / Avg: 67.5 / Max: 72Min: 48 / Avg: 75.04 / Max: 79Min: 49 / Avg: 69.88 / Max: 74Min: 44 / Avg: 71.96 / Max: 78

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX30609012015085.5398.48114.1679.4289.34110.80131.09123.99133.42137.54

NAMD CUDA

OpenBenchmarking.orgWatts, Fewer Is BetterNAMD CUDA 2.13System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX70140210280350Min: 42.7 / Avg: 204.58 / Max: 263.4Min: 81.5 / Avg: 223.92 / Max: 280.5Min: 49.2 / Avg: 202 / Max: 255.8Min: 92.5 / Avg: 192.92 / Max: 318.2Min: 48.2 / Avg: 283.67 / Max: 370.3Min: 45.9 / Avg: 231.36 / Max: 300.1Min: 45.7 / Avg: 247.69 / Max: 317.6Min: 173.3 / Avg: 271.87 / Max: 334.5Min: 47.8 / Avg: 245.81 / Max: 362.6Min: 53 / Avg: 269.25 / Max: 360.8

OpenBenchmarking.orgCelsius, Fewer Is BetterNAMD CUDA 2.13GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 52 / Avg: 59.2 / Max: 63Min: 60 / Avg: 64 / Max: 67Min: 45 / Avg: 48.14 / Max: 50Min: 51 / Avg: 58.08 / Max: 63Min: 62 / Avg: 66.71 / Max: 70Min: 49 / Avg: 55.63 / Max: 61Min: 51 / Avg: 55.71 / Max: 60Min: 61 / Avg: 65.38 / Max: 70Min: 51 / Avg: 54.43 / Max: 57Min: 51 / Avg: 55.29 / Max: 58

PlaidML

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 72.7 / Avg: 144.61 / Max: 170.9Min: 43.1 / Avg: 162.78 / Max: 197.9Min: 47.9 / Avg: 146.58 / Max: 171.7Min: 45.1 / Avg: 180.6 / Max: 225Min: 127.7 / Avg: 231.45 / Max: 296.9Min: 44 / Avg: 167.51 / Max: 215.1Min: 68.7 / Avg: 183.65 / Max: 245.5Min: 50.9 / Avg: 188.45 / Max: 269.3Min: 47.6 / Avg: 212.98 / Max: 331.1Min: 55.4 / Avg: 223.68 / Max: 345.8

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 52 / Avg: 58.35 / Max: 62Min: 56 / Avg: 62.15 / Max: 66Min: 43 / Avg: 47.69 / Max: 50Min: 59 / Avg: 64.36 / Max: 68Min: 61 / Avg: 65.7 / Max: 70Min: 46 / Avg: 55.73 / Max: 61Min: 47 / Avg: 54.45 / Max: 60Min: 56 / Avg: 63.5 / Max: 70Min: 49 / Avg: 54.38 / Max: 59Min: 50 / Avg: 55.25 / Max: 61

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.29030.58060.87091.16121.45150.650.790.890.790.820.901.031.081.281.29

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 71.5 / Avg: 140.59 / Max: 158.7Min: 43.1 / Avg: 156.77 / Max: 185.9Min: 48.9 / Avg: 127.46 / Max: 146.6Min: 115.7 / Avg: 176.49 / Max: 201.1Min: 50.4 / Avg: 194.3 / Max: 254.7Min: 45.1 / Avg: 125.88 / Max: 192.6Min: 46.5 / Avg: 126.45 / Max: 198.6Min: 47.4 / Avg: 135.75 / Max: 228.3Min: 48.3 / Avg: 198.43 / Max: 287.4Min: 52.5 / Avg: 178.13 / Max: 298.5

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 52 / Avg: 56.5 / Max: 59Min: 55 / Avg: 60 / Max: 64Min: 43 / Avg: 45.83 / Max: 48Min: 58 / Avg: 62 / Max: 65Min: 59 / Avg: 63 / Max: 66Min: 46 / Avg: 51.8 / Max: 55Min: 51 / Avg: 53.25 / Max: 55Min: 58 / Avg: 61.5 / Max: 64Min: 50 / Avg: 51.33 / Max: 53Min: 51 / Avg: 54 / Max: 58

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.77181.54362.31543.08723.8591.281.461.761.471.702.763.023.373.053.43

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 71.4 / Avg: 121.65 / Max: 172.6Min: 45.2 / Avg: 145.44 / Max: 201.2Min: 48.6 / Avg: 134.71 / Max: 168.9Min: 91.9 / Avg: 152.34 / Max: 225.3Min: 61.2 / Avg: 185.7 / Max: 280.5Min: 65 / Avg: 134.41 / Max: 208.9Min: 49.8 / Avg: 153.92 / Max: 242.3Min: 48.4 / Avg: 158.95 / Max: 257.5Min: 100.3 / Avg: 172.57 / Max: 323.3Min: 102.3 / Avg: 194.3 / Max: 332.7

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 50 / Avg: 54.8 / Max: 58Min: 56 / Avg: 59.9 / Max: 64Min: 44 / Avg: 46.22 / Max: 48Min: 60 / Avg: 62.5 / Max: 65Min: 61 / Avg: 63.11 / Max: 66Min: 46 / Avg: 50.11 / Max: 56Min: 50 / Avg: 53.44 / Max: 58Min: 59 / Avg: 62.38 / Max: 66Min: 51 / Avg: 53.29 / Max: 57Min: 52 / Avg: 54.86 / Max: 58

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.69981.39962.09942.79923.4991.591.771.941.921.822.302.442.513.112.86

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 70.6 / Avg: 141.88 / Max: 164.5Min: 85.2 / Avg: 157.11 / Max: 185.1Min: 50 / Avg: 141.65 / Max: 161.4Min: 91.6 / Avg: 167.52 / Max: 214.9Min: 123.8 / Avg: 205.47 / Max: 275.7Min: 45.2 / Avg: 158.66 / Max: 201.6Min: 47.5 / Avg: 159.79 / Max: 224.3Min: 50.1 / Avg: 167.47 / Max: 240.9Min: 49.3 / Avg: 185.12 / Max: 298.3Min: 63.8 / Avg: 192.88 / Max: 301.5

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 44 / Avg: 52.44 / Max: 59Min: 58 / Avg: 61.79 / Max: 65Min: 43 / Avg: 47.13 / Max: 50Min: 58 / Avg: 63.21 / Max: 68Min: 60 / Avg: 64.36 / Max: 69Min: 40 / Avg: 48.69 / Max: 56Min: 49 / Avg: 54.67 / Max: 60Min: 57 / Avg: 63.18 / Max: 68Min: 52 / Avg: 54.78 / Max: 59Min: 51 / Avg: 55.27 / Max: 60

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.27230.54460.81691.08921.36150.610.740.830.790.810.840.981.031.211.19

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 41.8 / Avg: 76.7 / Max: 94.4Min: 45.9 / Avg: 107.3 / Max: 155.1Min: 49.7 / Avg: 93.67 / Max: 125.7Min: 44.3 / Avg: 147.75 / Max: 195.1Min: 53.3 / Avg: 125.73 / Max: 165.2Min: 43.1 / Avg: 95.9 / Max: 118.8Min: 115.1 / Avg: 115.83 / Max: 116.5Min: 52 / Avg: 97.93 / Max: 121Min: 48.2 / Avg: 152.87 / Max: 274.6Min: 54 / Avg: 200.17 / Max: 274.2

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 41 / Avg: 45.25 / Max: 47Min: 59 / Avg: 60.33 / Max: 61Min: 48 / Avg: 48 / Max: 48Min: 62 / Avg: 62 / Max: 62Min: 63 / Avg: 64 / Max: 65Min: 39 / Avg: 44.25 / Max: 48Min: 55 / Avg: 55 / Max: 55Min: 66 / Avg: 66 / Max: 66Min: 54 / Avg: 55.67 / Max: 58Min: 55 / Avg: 55 / Max: 55

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX36912157.727.128.065.427.029.008.7010.678.456.67

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 70.1 / Avg: 124.21 / Max: 145Min: 82.2 / Avg: 136.83 / Max: 164.9Min: 104.8 / Avg: 123.57 / Max: 135.6Min: 90.8 / Avg: 146.71 / Max: 176.8Min: 50.4 / Avg: 165.05 / Max: 214.8Min: 44.3 / Avg: 116.14 / Max: 162.7Min: 47.1 / Avg: 127.27 / Max: 169.5Min: 46.4 / Avg: 138.03 / Max: 185.4Min: 48.6 / Avg: 159.87 / Max: 223.7Min: 57.5 / Avg: 152.07 / Max: 231.9

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1224364860Min: 44 / Avg: 49.23 / Max: 54Min: 51 / Avg: 55.42 / Max: 59Min: 39 / Avg: 42.75 / Max: 45Min: 53 / Avg: 57.58 / Max: 61Min: 59 / Avg: 61.75 / Max: 63Min: 40 / Avg: 45 / Max: 50Min: 41 / Avg: 45.18 / Max: 49Min: 50 / Avg: 54.64 / Max: 59Min: 52 / Avg: 53.5 / Max: 56Min: 52 / Avg: 53.8 / Max: 56

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.36450.7291.09351.4581.82251.081.141.241.141.141.621.561.561.541.60

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 43.2 / Avg: 99.8 / Max: 140.7Min: 48.4 / Avg: 126.46 / Max: 165Min: 48.1 / Avg: 99.69 / Max: 140.6Min: 91.7 / Avg: 152.84 / Max: 169.4Min: 47.2 / Avg: 121.72 / Max: 213.3Min: 47.3 / Avg: 110.84 / Max: 168.5Min: 49.9 / Avg: 120.6 / Max: 185.8Min: 48 / Avg: 132.2 / Max: 198.2Min: 48.4 / Avg: 137.54 / Max: 235.7Min: 56 / Avg: 163.54 / Max: 245.3

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 41 / Avg: 47.52 / Max: 54Min: 51 / Avg: 56.59 / Max: 60Min: 38 / Avg: 42.62 / Max: 46Min: 60 / Avg: 61.64 / Max: 63Min: 44 / Avg: 53.18 / Max: 61Min: 43 / Avg: 48.84 / Max: 55Min: 42 / Avg: 47.65 / Max: 53Min: 52 / Avg: 58.56 / Max: 66Min: 44 / Avg: 48.59 / Max: 53Min: 43 / Avg: 48.07 / Max: 53

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.25430.50860.76291.01721.27150.870.851.030.671.031.021.091.011.130.98

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 58.2 / Avg: 131.14 / Max: 167.2Min: 43.1 / Avg: 136.92 / Max: 191.2Min: 48.4 / Avg: 125.32 / Max: 169.8Min: 70.3 / Avg: 172.91 / Max: 216.6Min: 65.2 / Avg: 179.46 / Max: 287.9Min: 62.3 / Avg: 137.76 / Max: 208Min: 47.2 / Avg: 141.31 / Max: 234Min: 49.8 / Avg: 158.89 / Max: 252.8Min: 47.7 / Avg: 180.78 / Max: 312Min: 58.3 / Avg: 176.01 / Max: 322.5

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 43 / Avg: 51.9 / Max: 59Min: 49 / Avg: 56.06 / Max: 63Min: 41 / Avg: 45.18 / Max: 50Min: 52 / Avg: 58.85 / Max: 65Min: 53 / Avg: 60.07 / Max: 67Min: 43 / Avg: 50.75 / Max: 59Min: 42 / Avg: 48.87 / Max: 56Min: 53 / Avg: 59.79 / Max: 68Min: 46 / Avg: 51 / Max: 57Min: 42 / Avg: 47.33 / Max: 54

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.33750.6751.01251.351.68750.720.921.020.801.081.081.271.211.381.50

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 60.4 / Avg: 122.52 / Max: 164.5Min: 83.8 / Avg: 134.85 / Max: 189.4Min: 48.5 / Avg: 113.89 / Max: 160.8Min: 91.6 / Avg: 162.7 / Max: 211.1Min: 122.3 / Avg: 187.68 / Max: 270.3Min: 86.5 / Avg: 143.68 / Max: 202.9Min: 78.4 / Avg: 132.98 / Max: 222.4Min: 47.4 / Avg: 148.19 / Max: 240.2Min: 48.4 / Avg: 155.12 / Max: 292.5Min: 52.6 / Avg: 162.76 / Max: 301.7

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 47 / Avg: 50.92 / Max: 55Min: 54 / Avg: 57.18 / Max: 61Min: 43 / Avg: 45.8 / Max: 48Min: 58 / Avg: 60 / Max: 63Min: 59 / Avg: 61.1 / Max: 64Min: 46 / Avg: 50.9 / Max: 56Min: 46 / Avg: 48.78 / Max: 53Min: 58 / Avg: 61.6 / Max: 65Min: 49 / Avg: 51.2 / Max: 55Min: 44 / Avg: 46.33 / Max: 52

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: ResNet 50 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.66381.32761.99142.65523.3191.491.762.081.621.781.962.512.392.952.92

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX50100150200250Min: 87.5 / Avg: 132.19 / Max: 156.7Min: 81.8 / Avg: 153.87 / Max: 183Min: 56.1 / Avg: 132.15 / Max: 155Min: 89.5 / Avg: 168.75 / Max: 202.8Min: 120.7 / Avg: 202.56 / Max: 256.6Min: 45 / Avg: 144.95 / Max: 190.7Min: 45.8 / Avg: 146.81 / Max: 206.6Min: 47.5 / Avg: 155.97 / Max: 224.6Min: 48.4 / Avg: 188.73 / Max: 275.6Min: 51.5 / Avg: 168.92 / Max: 275.4

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1326395265Min: 45 / Avg: 52 / Max: 57Min: 48 / Avg: 54.81 / Max: 60Min: 43 / Avg: 45.94 / Max: 49Min: 51 / Avg: 57.64 / Max: 63Min: 55 / Avg: 59.71 / Max: 66Min: 43 / Avg: 49.73 / Max: 56Min: 42 / Avg: 48.08 / Max: 55Min: 56 / Avg: 61.08 / Max: 66Min: 49 / Avg: 52.45 / Max: 57Min: 39 / Avg: 44.46 / Max: 52

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Inception V3 - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX0.2880.5760.8641.1521.440.670.760.900.780.820.921.071.081.121.28

OpenBenchmarking.orgWatts, Fewer Is BetterPlaidMLSystem Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX4080120160200Min: 42.1 / Avg: 107.84 / Max: 141.6Min: 42.8 / Avg: 91.98 / Max: 117.4Min: 48.7 / Avg: 98.18 / Max: 132.6Min: 43.7 / Avg: 93.48 / Max: 127.3Min: 91.6 / Avg: 149.17 / Max: 203.2Min: 45.5 / Avg: 121.65 / Max: 166.1Min: 46 / Avg: 93.68 / Max: 129.3Min: 56.3 / Avg: 130.87 / Max: 184.7Min: 48.4 / Avg: 94.48 / Max: 113.3Min: 52.9 / Avg: 110.8 / Max: 140.2

OpenBenchmarking.orgCelsius, Fewer Is BetterPlaidMLGPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1224364860Min: 49 / Avg: 50 / Max: 51Min: 47 / Avg: 49.5 / Max: 51Min: 46 / Avg: 47 / Max: 48Min: 47 / Avg: 49.75 / Max: 51Min: 56 / Avg: 56.67 / Max: 57Min: 46 / Avg: 47.75 / Max: 50Min: 45 / Avg: 46.75 / Max: 48Min: 61 / Avg: 61.67 / Max: 62Min: 52 / Avg: 52.5 / Max: 53Min: 41 / Avg: 41.5 / Max: 42

OpenBenchmarking.orgExamples Per Second Per Watt, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX36912155.127.226.577.345.196.418.876.8210.288.82

LeelaChessZero

OpenBenchmarking.orgWatts, Fewer Is BetterLeelaChessZero 0.20.1System Power Consumption MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX60120180240300Min: 41.9 / Avg: 150.59 / Max: 180Min: 42.6 / Avg: 184.04 / Max: 213.3Min: 80 / Avg: 156.39 / Max: 189.7Min: 40.2 / Avg: 148.14 / Max: 249.5Min: 96.9 / Avg: 265.17 / Max: 323.1Min: 69.4 / Avg: 196.01 / Max: 230Min: 75.1 / Avg: 186.61 / Max: 251.7Min: 88.3 / Avg: 222.99 / Max: 288.3Min: 48.2 / Avg: 205.76 / Max: 349.4Min: 97.7 / Avg: 148.16 / Max: 357.7

OpenBenchmarking.orgCelsius, Fewer Is BetterLeelaChessZero 0.20.1GPU Temperature MonitorGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX1428425670Min: 38 / Avg: 51.69 / Max: 59Min: 35 / Avg: 46.2 / Max: 55Min: 46 / Avg: 50.13 / Max: 53Min: 32 / Avg: 40.8 / Max: 58Min: 41 / Avg: 51.43 / Max: 61Min: 39 / Avg: 50.33 / Max: 58Min: 33 / Avg: 44.86 / Max: 52Min: 57 / Avg: 66.29 / Max: 73Min: 50 / Avg: 55.6 / Max: 60Min: 34 / Avg: 36.91 / Max: 47

OpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLGTX 1060GTX 1070GTX 1070 TiGTX 1080GTX 1080 TiRTX 2060RTX 2070RTX 2080RTX 2080 TiTITAN RTX5101520255.977.779.4511.818.807.579.2710.1315.1022.11

117 Results Shown

SHOC Scalable HeterOgeneous Computing
OctaneBench
LuxMark:
  GPU - Microphone
  GPU - Hotel
  GPU - Luxball HDR
Chaos Group V-RAY
Rodinia
PlaidML:
  No - Training - Mobilenet - OpenCL
  Yes - Inference - VGG16 - OpenCL
clpeak
PlaidML:
  Yes - Inference - Inception V3 - OpenCL
  No - Inference - Inception V3 - OpenCL
  No - Training - IMDB LSTM - OpenCL
SHOC Scalable HeterOgeneous Computing
PlaidML:
  No - Inference - VGG16 - OpenCL
  Yes - Inference - ResNet 50 - OpenCL
LeelaChessZero
NAMD CUDA
PlaidML
clpeak:
  Transfer Bandwidth enqueueReadBuffer
  Transfer Bandwidth enqueueWriteBuffer
PlaidML
Rodinia
JuliaGPU
cl-mem:
  Read
  Copy
  Write
PlaidML
Darktable
PlaidML
Darktable:
  Boat - OpenCL
  Server Room - OpenCL
SHOC Scalable HeterOgeneous Computing
clpeak
SHOC Scalable HeterOgeneous Computing
clpeak
SHOC Scalable HeterOgeneous Computing
clpeak
Darktable
clpeak
Chaos Group V-RAY:
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  Total Score
  Phoronix Test Suite System Monitoring
  Phoronix Test Suite System Monitoring
  Performance Per Watt
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL - Texture Read Bandwidth
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL - Max SP Flops
  System Power Consumption Monitor
  GPU Temp Monitor
  Copy
  System Power Consumption Monitor
  GPU Temp Monitor
  Write
  System Power Consumption Monitor
  GPU Temp Monitor
  Read
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Hotel
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Microphone
  System Power Consumption Monitor
  GPU Temp Monitor
  GPU - Luxball HDR
  System Power Consumption Monitor
  GPU Temp Monitor
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - VGG16 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - IMDB LSTM - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - ResNet 50 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - Inception V3 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Inference - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Training - IMDB LSTM - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  No - Training - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - VGG16 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - ResNet 50 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - Inception V3 - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  Yes - Inference - Mobilenet - OpenCL
  System Power Consumption Monitor
  GPU Temp Monitor
  OpenCL