OpenCL CUDA GPGPU GeForce GTX 1070

Intel Xeon E3-1280 v5 testing with a MSI C236A WORKSTATION (MS-7998) v1.0 and Device 8187MB on Ubuntu 16.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1606134-HA-OPENCLCUD74
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
NVIDIA GeForce GTX 1070
June 13 2016
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL CUDA GPGPU GeForce GTX 1070OpenBenchmarking.orgPhoronix Test SuiteIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MBSamsung SSD 950 PRO 256GBDevice 8187MB (1505/4006MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-22-generic (x86_64)Unity 7.4.0NVIDIA 367.184.5.01.0.8GCC 5.3.1 20160413 + CUDA 8.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCL CUDA GPGPU GeForce GTX 1070 BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Scaling Governor: intel_pstate powersave- GPU Compute Cores: 1920- GPU Compute Cores: 1920.

OpenCL CUDA GPGPU GeForce GTX 1070financebench: Monte-Carlo OpenCLfinancebench: Black-Scholes OpenCLmixbench: Integermixbench: Double Precisionmixbench: Single Precisionshoc: CUDA - Triadshoc: CUDA - FFT SPshoc: OpenCL - Triadshoc: CUDA - MD5 Hashshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: CUDA - Max SP Flopsshoc: OpenCL - Max SP Flopsshoc: CUDA - Bus Speed Downloadshoc: CUDA - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: CUDA - Texture Read Bandwidthshoc: OpenCL - Texture Read Bandwidthcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerocaffe: CUDAjuliagpu: GPUsmallpt-gpu: GPU - Complexsmallpt-gpu: GPU - Cornellsmallpt-gpu: GPU - Caustic3luxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRNVIDIA GeForce GTX 1070597.229.572016.00227.456585.7514.76368.5611.738.45289.518.377096.497064.7112.4713.2112.4713.22503.81446.4639.0418.0118.6637.3837.1011704.03142650486.331465855842146585596014658560863028730315716OpenBenchmarking.org

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Monte-Carlo OpenCLNVIDIA GeForce GTX 1070130260390520650SE +/- 0.25, N = 3597.221. (CXX) g++ options: -O3 -lOpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLNVIDIA GeForce GTX 10703691215SE +/- 0.01, N = 39.571. (CXX) g++ options: -O3 -lOpenCL

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerNVIDIA GeForce GTX 1070400800120016002000SE +/- 26.07, N = 32016.001. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Double PrecisionNVIDIA GeForce GTX 107050100150200250SE +/- 0.28, N = 3227.451. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionNVIDIA GeForce GTX 107014002800420056007000SE +/- 2.45, N = 36585.751. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: TriadNVIDIA GeForce GTX 107048121620SE +/- 0.01, N = 314.761. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPNVIDIA GeForce GTX 107080160240320400SE +/- 1.19, N = 3368.561. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadNVIDIA GeForce GTX 10703691215SE +/- 0.01, N = 311.731. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashNVIDIA GeForce GTX 1070246810SE +/- 0.00, N = 38.451. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPNVIDIA GeForce GTX 107060120180240300SE +/- 0.55, N = 3289.511. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashNVIDIA GeForce GTX 1070246810SE +/- 0.00, N = 38.371. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Max SP FlopsNVIDIA GeForce GTX 107015003000450060007500SE +/- 52.99, N = 37096.491. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsNVIDIA GeForce GTX 107015003000450060007500SE +/- 1.82, N = 37064.711. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Bus Speed DownloadNVIDIA GeForce GTX 10703691215SE +/- 0.00, N = 312.471. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Bus Speed ReadbackNVIDIA GeForce GTX 10703691215SE +/- 0.00, N = 313.211. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA GeForce GTX 10703691215SE +/- 0.01, N = 312.471. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA GeForce GTX 10703691215SE +/- 0.00, N = 313.221. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthNVIDIA GeForce GTX 1070110220330440550SE +/- 0.81, N = 3503.811. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA GeForce GTX 1070100200300400500SE +/- 0.42, N = 3446.461. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalNVIDIA GeForce GTX 1070918273645SE +/- 0.05, N = 339.04

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingNVIDIA GeForce GTX 107048121620SE +/- 0.01, N = 318.01

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingNVIDIA GeForce GTX 1070510152025SE +/- 0.00, N = 318.66

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutNVIDIA GeForce GTX 1070918273645SE +/- 0.03, N = 337.38

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroNVIDIA GeForce GTX 1070918273645SE +/- 0.05, N = 337.10

Caffe AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CUDANVIDIA GeForce GTX 10703K6K9K12K15KSE +/- 7.21, N = 311704.031. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUNVIDIA GeForce GTX 107030M60M90M120M150MSE +/- 166070.45, N = 3142650486.331. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: ComplexNVIDIA GeForce GTX 1070300M600M900M1200M1500MSE +/- 19.34, N = 314658558421. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellNVIDIA GeForce GTX 1070300M600M900M1200M1500MSE +/- 21.94, N = 314658559601. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3NVIDIA GeForce GTX 1070300M600M900M1200M1500MSE +/- 23.09, N = 314658560861. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelNVIDIA GeForce GTX 10706001200180024003000SE +/- 1.20, N = 33028

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneNVIDIA GeForce GTX 107016003200480064008000SE +/- 41.33, N = 37303

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRNVIDIA GeForce GTX 10703K6K9K12K15KSE +/- 3.00, N = 315716