OpenCL January 2018 Linux Radeon ROCm NVIDIA

Tests by Michael Larabekl for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1801186-PTS-OPENCLJA59
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1060
January 18 2018
 
GeForce GTX 1070
January 17 2018
 
GeForce GTX 1070 Ti
January 17 2018
 
GeForce GTX 1080
January 17 2018
 
GeForce GTX 1080 Ti
January 17 2018
 
GeForce GTX 680
January 18 2018
 
GeForce GTX 780 Ti
January 18 2018
 
GeForce GTX 960
January 18 2018
 
GeForce GTX 970
January 17 2018
 
GeForce GTX 980 Ti
January 18 2018
 
Radeon R9 285
January 18 2018
 
Radeon R9 290
January 18 2018
 
Radeon R9 Fury
January 18 2018
 
Radeon RX 580
January 18 2018
 
Radeon RX Vega 56
January 18 2018
 
Radeon RX Vega 64
January 18 2018
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL January 2018 Linux Radeon ROCm NVIDIAOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (0606 BIOS)Intel Device 3ec216384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1607/4006MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)MSI AMD Radeon RX 580 8192MBXFX AMD Radeon R9 200 2048MBXFX AMD Radeon R9 200 4096MBSapphire AMD Radeon 4096MBAMD Radeon RX Vega 8192MBRealtek ALC1220DELL P2415QIntel ConnectionUbuntu 17.104.15.0-999-generic (x86_64) 201801144.13.0-25-generic (x86_64)GNOME Shell 3.26.2NVIDIA 390.12modesetting 1.19.54.5.04.5 Mesa 17.4.0-devel- padoka PPA (LLVM 7.0.0)GCC 7.2.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelsDesktopDisplay DriversOpenGLsCompilerFile-SystemScreen ResolutionOpenCL January 2018 Linux Radeon ROCm NVIDIA BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 680: GPU Compute Cores: 1536

GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1070 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 960GeForce GTX 980 TiGeForce GTX 680Radeon RX 580Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX Vega 64Radeon RX Vega 56Result OverviewPhoronix Test Suite100%211%322%433%544%LuxMarkcl-memcl-memcl-memGPU - Luxball HDRWriteCopyRead

OpenCL January 2018 Linux Radeon ROCm NVIDIAfahbench: luxmark: GPU - Luxball HDRluxmark: GPU - Microphoneluxmark: GPU - Hotelmandelgpu: GPUjuliagpu: GPUdarktable: Server Room - OpenCLdarktable: Masskrug - OpenCLdarktable: Boat - OpenCLcl-mem: Writecl-mem: Readcl-mem: Copyshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1070 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 960GeForce GTX 980 TiGeForce GTX 680Radeon RX 580Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX Vega 64Radeon RX Vega 561577377362939168963001.60154016267.601.144.023.12198.57206.53188.53449.347126.6510.73556.111222765422986218025195.33178980675.831.154.012.94222229.90211.33519.769441.7614.42660.25190.1218898102943730289975392.23204310038.971.063.972.45343.70340.23319.37592.1113280.7720.14988.20148.851524779232901207141915.30171841227.501.154.023.02196.40206.60188.30501.389057.2013.83566.8287.52105046061199895506772.47112700768.904.867.504.67134.90144.67126.80296.384362.556.54408.5399.461140755352107115144009.93121183382.371.354.163.91145.57154.47140.80400.124803.557.37402.8773.6096225243131377073285.6384426813.375.197.717.18257.70272.70239.20285.844948.604.68441.4760.4859243263125966365537.1085979128.7011.6820.2616.5375.5781.9072.30283.622959.584.49223.45111.101475585492342131281594.13139018187.931.544.183.44244.40266.80219351.126216.249.34718.3141.144595256974748480538.0056403401.004.597.329.44150.13135.73121.50246.642.26271.4614712140879323817.30191808662.23181.63145.70184.10213.026263.258.00562.888988725136.07110.80126.071090.993281.794.18381.6715845119164943031.83198849223.60209.13116.80187.43224551654102371755.73208047859.87385.73111.90209.80251.107145.189.11841.8125012174577802.60254843524.60372.23160.87223.77426.7912794.8716.081103.23241921591155441242.77241899430.33318.83151.90205.20373.3010681.5013.68952.49OpenBenchmarking.org

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 Ti4080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.25, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.60, N = 3SE +/- 0.17, N = 341.14111.1060.4873.6099.4687.52148.85190.12

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10705K10K15K20K25KSE +/- 0.58, N = 3SE +/- 373.21, N = 3SE +/- 123.17, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 3SE +/- 16.17, N = 3SE +/- 53.12, N = 3SE +/- 21.33, N = 3SE +/- 36.83, N = 3SE +/- 2.33, N = 3SE +/- 19.10, N = 3SE +/- 1.15, N = 3SE +/- 90.23, N = 3SE +/- 3.18, N = 3SE +/- 61.00, N = 32419225012224551584589881471245951475559249622114071050415247188981222715773

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10702K4K6K8K10KSE +/- 2.00, N = 3SE +/- 2.31, N = 3SE +/- 7.13, N = 3SE +/- 6.06, N = 3SE +/- 4.73, N = 3SE +/- 1.53, N = 3SE +/- 0.67, N = 3SE +/- 2.19, N = 3SE +/- 0.88, N = 3SE +/- 3.51, N = 325698549326352435535606179231029465427736

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX Vega 56Radeon R9 FuryRadeon R9 290Radeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10708001600240032004000SE +/- 4.33, N = 3SE +/- 4.00, N = 3SE +/- 1.86, N = 3SE +/- 1.00, N = 3SE +/- 3.33, N = 3SE +/- 3.67, N = 3SE +/- 3.00, N = 3SE +/- 17.23, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 12.55, N = 3SE +/- 13.32, N = 3SE +/- 1.15, N = 3SE +/- 9.84, N = 3SE +/- 26.84, N = 31591165411917251408747234212591313210719982901373029862939

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107060M120M180M240M300MSE +/- 794414.93, N = 3SE +/- 741807.19, N = 3SE +/- 120193.62, N = 3SE +/- 83691.30, N = 3SE +/- 115711.88, N = 3SE +/- 25083.83, N = 3SE +/- 510200.05, N = 3SE +/- 121075.12, N = 3SE +/- 258135.31, N = 3SE +/- 503535.67, N = 3SE +/- 431906.66, N = 3SE +/- 1459753.08, N = 3SE +/- 3984585.32, N = 3SE +/- 1913943.64, N = 3SE +/- 1151240.82, N = 3155441242.77174577802.60102371755.7364943031.8379323817.3048480538.00131281594.1366365537.1077073285.63115144009.9395506772.47207141915.30289975392.23218025195.33168963001.601. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPURadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107050M100M150M200M250MSE +/- 621968.09, N = 3SE +/- 415881.11, N = 3SE +/- 639092.02, N = 3SE +/- 517038.05, N = 3SE +/- 577899.40, N = 3SE +/- 188888.98, N = 3SE +/- 1085388.09, N = 3SE +/- 85942.56, N = 3SE +/- 303705.53, N = 3SE +/- 196419.50, N = 3SE +/- 625651.47, N = 3SE +/- 1031171.70, N = 3SE +/- 843495.42, N = 3SE +/- 235273.88, N = 3SE +/- 613736.81, N = 3241899430.33254843524.60208047859.87198849223.60191808662.2356403401.00139018187.9385979128.7084426813.37121183382.37112700768.90171841227.50204310038.97178980675.83154016267.601. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Server Room - Acceleration: OpenCLGeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10703691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.591.5411.685.191.354.861.151.061.151.14

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Masskrug - Acceleration: OpenCLGeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.30, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 37.324.1820.267.714.167.504.023.974.014.02

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Boat - Acceleration: OpenCLGeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.443.4416.537.183.914.673.022.452.943.12

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107080160240320400SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.22, N = 3SE +/- 0.58, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 1.25, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3318.83372.23385.73209.13136.07181.63150.13244.4075.57257.70145.57134.90196.40343.70222.00198.571. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107070140210280350SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 2.10, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3151.90160.87111.90116.80110.80145.70135.73266.8081.90272.70154.47144.67206.60340.23229.90206.531. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 290Radeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 107070140210280350SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.45, N = 3SE +/- 0.43, N = 3SE +/- 0.03, N = 3SE +/- 0.45, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.50, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3205.20223.77209.80187.43126.07184.10121.50219.0072.30239.20140.80126.80188.30319.37211.33188.531. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10702004006008001000SE +/- 1.23, N = 3SE +/- 2.39, N = 3SE +/- 0.99, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 4.62, N = 3SE +/- 1.13, N = 3SE +/- 1.64, N = 3SE +/- 1.27, N = 3SE +/- 2.04, N = 3SE +/- 1.19, N = 3SE +/- 0.48, N = 3SE +/- 0.55, N = 3SE +/- 1.52, N = 3SE +/- 1.87, N = 3373.30426.79251.101090.99213.02246.64351.12283.62285.84400.12296.38501.38592.11519.76449.341. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 285Radeon RX 580GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10703K6K9K12K15KSE +/- 72.77, N = 3SE +/- 48.47, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 12.88, N = 3SE +/- 6.86, N = 3SE +/- 19.88, N = 3SE +/- 25.86, N = 3SE +/- 1.35, N = 3SE +/- 3.79, N = 3SE +/- 67.22, N = 3SE +/- 47.55, N = 3SE +/- 31.44, N = 310681.5012794.877145.183281.796263.256216.242959.584948.604803.554362.559057.2013280.779441.767126.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.6816.089.114.188.002.269.344.494.687.376.5413.8320.1414.4210.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX Vega 56Radeon RX Vega 64Radeon R9 FuryRadeon R9 285Radeon RX 580GeForce GTX 680GeForce GTX 980 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 1070 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 10702004006008001000SE +/- 0.36, N = 3SE +/- 1.19, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 3.87, N = 3SE +/- 3.93, N = 5SE +/- 12.20, N = 6SE +/- 1.16, N = 3SE +/- 7.87, N = 3SE +/- 2.48, N = 3SE +/- 0.66, N = 3SE +/- 0.30, N = 3SE +/- 0.82, N = 3SE +/- 1.31, N = 3SE +/- 3.52, N = 3952.491103.23841.81381.67562.88271.46718.31223.45441.47402.87408.53566.82988.20660.25556.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi