OpenCL January 2018 Linux Radeon ROCm NVIDIA

Tests by Michael Larabekl for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1801186-PTS-OPENCLJA59
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1060
January 18 2018
 
GeForce GTX 1070
January 17 2018
 
GeForce GTX 1070 Ti
January 17 2018
 
GeForce GTX 1080
January 17 2018
 
GeForce GTX 1080 Ti
January 17 2018
 
GeForce GTX 680
January 18 2018
 
GeForce GTX 780 Ti
January 18 2018
 
GeForce GTX 960
January 18 2018
 
GeForce GTX 970
January 17 2018
 
GeForce GTX 980 Ti
January 18 2018
 
Radeon R9 285
January 18 2018
 
Radeon R9 290
January 18 2018
 
Radeon R9 Fury
January 18 2018
 
Radeon RX 580
January 18 2018
 
Radeon RX Vega 56
January 18 2018
 
Radeon RX Vega 64
January 18 2018
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL January 2018 Linux Radeon ROCm NVIDIAOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (0606 BIOS)Intel Device 3ec216384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1607/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)XFX AMD Radeon R9 200 2048MBXFX AMD Radeon R9 200 4096MBSapphire AMD Radeon 4096MBMSI AMD Radeon RX 580 8192MBAMD Radeon RX Vega 8192MBRealtek ALC1220DELL P2415QIntel ConnectionUbuntu 17.104.15.0-999-generic (x86_64) 201801144.13.0-25-generic (x86_64)GNOME Shell 3.26.2NVIDIA 390.12modesetting 1.19.54.5.04.5 Mesa 17.4.0-devel- padoka PPA (LLVM 7.0.0)GCC 7.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelsDesktopDisplay DriversOpenGLsCompilerFile-SystemScreen ResolutionOpenCL January 2018 Linux Radeon ROCm NVIDIA BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980 Ti: GPU Compute Cores: 2816

GeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64Result OverviewPhoronix Test Suite100%211%322%433%544%LuxMarkcl-memcl-memcl-memGPU - Luxball HDRWriteCopyRead

OpenCL January 2018 Linux Radeon ROCm NVIDIAfahbench: luxmark: GPU - Luxball HDRluxmark: GPU - Microphoneluxmark: GPU - Hotelmandelgpu: GPUjuliagpu: GPUdarktable: Server Room - OpenCLdarktable: Masskrug - OpenCLdarktable: Boat - OpenCLcl-mem: Writecl-mem: Readcl-mem: Copyshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6499.461140755352107115144009.93121183382.371.354.163.91145.57154.47140.80400.124803.557.37402.871577377362939168963001.60154016267.601.144.023.12198.57206.53188.53449.347126.6510.73556.11148.851524779232901207141915.30171841227.501.154.023.02196.40206.60188.30501.389057.2013.83566.821222765422986218025195.33178980675.831.154.012.94222229.90211.33519.769441.7614.42660.25190.1218898102943730289975392.23204310038.971.063.972.45343.70340.23319.37592.1113280.7720.14988.2041.144595256974748480538.0056403401.004.597.329.44150.13135.73121.50246.642.26271.4673.6096225243131377073285.6384426813.375.197.717.18257.70272.70239.20285.844948.604.68441.4760.4859243263125966365537.1085979128.7011.6820.2616.5375.5781.9072.30283.622959.584.49223.4587.52105046061199895506772.47112700768.904.867.504.67134.90144.67126.80296.384362.556.54408.53111.101475585492342131281594.13139018187.931.544.183.44244.40266.80219351.126216.249.34718.318988725136.07110.80126.071090.993281.794.18381.6715845119164943031.83198849223.60209.13116.80187.43224551654102371755.73208047859.87385.73111.90209.80251.107145.189.11841.8114712140879323817.30191808662.23181.63145.70184.10213.026263.258.00562.88241921591155441242.77241899430.33318.83151.90205.20373.3010681.5013.68952.4925012174577802.60254843524.60372.23160.87223.77426.7912794.8716.081103.23OpenBenchmarking.org

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 680GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1070 TiGeForce GTX 1080 Ti4080120160200SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.60, N = 3SE +/- 0.17, N = 341.1460.4873.6087.5299.46111.10148.85190.12

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 680GeForce GTX 960Radeon R9 285GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 1080Radeon RX 580GeForce GTX 980 TiGeForce GTX 1070 TiGeForce GTX 1070Radeon R9 290GeForce GTX 1080 TiRadeon R9 FuryRadeon RX Vega 56Radeon RX Vega 645K10K15K20K25KSE +/- 16.17, N = 3SE +/- 21.33, N = 3SE +/- 1.33, N = 3SE +/- 36.83, N = 3SE +/- 19.10, N = 3SE +/- 2.33, N = 3SE +/- 3.18, N = 3SE +/- 0.67, N = 3SE +/- 53.12, N = 3SE +/- 1.15, N = 3SE +/- 61.00, N = 3SE +/- 0.88, N = 3SE +/- 90.23, N = 3SE +/- 123.17, N = 3SE +/- 0.58, N = 3SE +/- 373.21, N = 34595592489889622105041140712227147121475515247157731584518898224552419225012

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 680GeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1080GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 980 TiGeForce GTX 1080 Ti2K4K6K8K10KSE +/- 2.00, N = 3SE +/- 7.13, N = 3SE +/- 6.06, N = 3SE +/- 4.73, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 3SE +/- 3.51, N = 3SE +/- 0.67, N = 3SE +/- 2.31, N = 3SE +/- 2.19, N = 325693263524355356061654277367923854910294

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon R9 285GeForce GTX 680Radeon R9 290GeForce GTX 960GeForce GTX 780 TiRadeon RX 580Radeon RX Vega 56Radeon R9 FuryGeForce GTX 970GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti8001600240032004000SE +/- 1.00, N = 3SE +/- 3.67, N = 3SE +/- 1.86, N = 3SE +/- 17.23, N = 3SE +/- 0.67, N = 3SE +/- 3.33, N = 3SE +/- 4.33, N = 3SE +/- 4.00, N = 3SE +/- 12.55, N = 3SE +/- 0.58, N = 3SE +/- 3.00, N = 3SE +/- 13.32, N = 3SE +/- 26.84, N = 3SE +/- 9.84, N = 3SE +/- 1.15, N = 37257471191125913131408159116541998210723422901293929863730

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 680Radeon R9 290GeForce GTX 960GeForce GTX 780 TiRadeon RX 580GeForce GTX 970Radeon R9 FuryGeForce GTX 1060GeForce GTX 980 TiRadeon RX Vega 56GeForce GTX 1070Radeon RX Vega 64GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60M120M180M240M300MSE +/- 25083.83, N = 3SE +/- 83691.30, N = 3SE +/- 121075.12, N = 3SE +/- 258135.31, N = 3SE +/- 115711.88, N = 3SE +/- 431906.66, N = 3SE +/- 120193.62, N = 3SE +/- 503535.67, N = 3SE +/- 510200.05, N = 3SE +/- 794414.93, N = 3SE +/- 1151240.82, N = 3SE +/- 741807.19, N = 3SE +/- 1459753.08, N = 3SE +/- 1913943.64, N = 3SE +/- 3984585.32, N = 348480538.0064943031.8366365537.1077073285.6379323817.3095506772.47102371755.73115144009.93131281594.13155441242.77168963001.60174577802.60207141915.30218025195.33289975392.231. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080Radeon RX 580Radeon R9 290GeForce GTX 1080 TiRadeon R9 FuryRadeon RX Vega 56Radeon RX Vega 6450M100M150M200M250MSE +/- 188888.98, N = 3SE +/- 303705.53, N = 3SE +/- 85942.56, N = 3SE +/- 625651.47, N = 3SE +/- 196419.50, N = 3SE +/- 1085388.09, N = 3SE +/- 613736.81, N = 3SE +/- 1031171.70, N = 3SE +/- 235273.88, N = 3SE +/- 577899.40, N = 3SE +/- 517038.05, N = 3SE +/- 843495.42, N = 3SE +/- 639092.02, N = 3SE +/- 621968.09, N = 3SE +/- 415881.11, N = 356403401.0084426813.3785979128.70112700768.90121183382.37139018187.93154016267.60171841227.50178980675.83191808662.23198849223.60204310038.97208047859.87241899430.33254843524.601. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Server Room - Acceleration: OpenCLGeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1080GeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1080 Ti3691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.685.194.864.591.541.351.151.151.141.06

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Masskrug - Acceleration: OpenCLGeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti510152025SE +/- 0.30, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 320.267.717.507.324.184.164.024.024.013.97

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Boat - Acceleration: OpenCLGeForce GTX 960GeForce GTX 680GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti48121620SE +/- 0.16, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.539.447.184.673.913.443.123.022.942.45

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 960GeForce GTX 970Radeon R9 285GeForce GTX 1060GeForce GTX 680Radeon RX 580GeForce GTX 1070 TiGeForce GTX 1070Radeon R9 290GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 780 TiRadeon RX Vega 56GeForce GTX 1080 TiRadeon RX Vega 64Radeon R9 Fury80160240320400SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.58, N = 3SE +/- 0.06, N = 3SE +/- 1.25, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.22, N = 375.57134.90136.07145.57150.13181.63196.40198.57209.13222.00244.40257.70318.83343.70372.23385.731. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 960Radeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 680GeForce GTX 970Radeon RX 580Radeon RX Vega 56GeForce GTX 1060Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 Ti70140210280350SE +/- 0.40, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 2.10, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.27, N = 381.90110.80111.90116.80135.73144.67145.70151.90154.47160.87206.53206.60229.90266.80272.70340.231. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 960GeForce GTX 680Radeon R9 285GeForce GTX 970GeForce GTX 1060Radeon RX 580Radeon R9 290GeForce GTX 1070 TiGeForce GTX 1070Radeon RX Vega 56Radeon R9 FuryGeForce GTX 1080GeForce GTX 980 TiRadeon RX Vega 64GeForce GTX 780 TiGeForce GTX 1080 Ti70140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.45, N = 3SE +/- 0.43, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.45, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.50, N = 3SE +/- 0.12, N = 372.30121.50126.07126.80140.80184.10187.43188.30188.53205.20209.80211.33219.00223.77239.20319.371. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 580GeForce GTX 680Radeon R9 FuryGeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 980 TiRadeon RX Vega 56GeForce GTX 1060Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiRadeon R9 2852004006008001000SE +/- 0.03, N = 3SE +/- 4.62, N = 3SE +/- 0.99, N = 3SE +/- 1.64, N = 3SE +/- 1.27, N = 3SE +/- 1.19, N = 3SE +/- 1.13, N = 3SE +/- 1.23, N = 3SE +/- 2.04, N = 3SE +/- 2.39, N = 3SE +/- 1.87, N = 3SE +/- 0.48, N = 3SE +/- 1.52, N = 3SE +/- 0.55, N = 3SE +/- 0.02, N = 3213.02246.64251.10283.62285.84296.38351.12373.30400.12426.79449.34501.38519.76592.111090.991. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 960Radeon R9 285GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980 TiRadeon RX 580GeForce GTX 1070Radeon R9 FuryGeForce GTX 1070 TiGeForce GTX 1080Radeon RX Vega 56Radeon RX Vega 64GeForce GTX 1080 Ti3K6K9K12K15KSE +/- 6.86, N = 3SE +/- 0.04, N = 3SE +/- 1.35, N = 3SE +/- 25.86, N = 3SE +/- 19.88, N = 3SE +/- 12.88, N = 3SE +/- 0.08, N = 3SE +/- 31.44, N = 3SE +/- 0.10, N = 3SE +/- 3.79, N = 3SE +/- 47.55, N = 3SE +/- 72.77, N = 3SE +/- 48.47, N = 3SE +/- 67.22, N = 32959.583281.794362.554803.554948.606216.246263.257126.657145.189057.209441.7610681.5012794.8713280.771. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 680Radeon R9 285GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060Radeon RX 580Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1070Radeon RX Vega 56GeForce GTX 1070 TiGeForce GTX 1080Radeon RX Vega 64GeForce GTX 1080 Ti510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 32.264.184.494.686.547.378.009.119.3410.7313.6813.8314.4216.0820.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 960GeForce GTX 680Radeon R9 285GeForce GTX 1060GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1070Radeon RX 580GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 980 TiRadeon R9 FuryRadeon RX Vega 56GeForce GTX 1080 TiRadeon RX Vega 642004006008001000SE +/- 1.16, N = 3SE +/- 3.93, N = 5SE +/- 0.07, N = 3SE +/- 2.48, N = 3SE +/- 0.66, N = 3SE +/- 7.87, N = 3SE +/- 3.52, N = 3SE +/- 3.87, N = 3SE +/- 0.30, N = 3SE +/- 1.31, N = 3SE +/- 12.20, N = 6SE +/- 0.11, N = 3SE +/- 0.36, N = 3SE +/- 0.82, N = 3SE +/- 1.19, N = 3223.45271.46381.67402.87408.53441.47556.11562.88566.82660.25718.31841.81952.49988.201103.231. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi