NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux Run

NVIDIA CUDA Caffe build versus CPU-only Caffe build with both AlexNet and GoogleNet. Tests by Michael Larabel and all tests on same system sans switching out CPUs for other CPU-only runs.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1608296-LO-CUDA200NV70
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX TITAN X
August 29 2016
 
GeForce GTX 1080
August 29 2016
 
GeForce GTX 950
August 29 2016
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux RunOpenBenchmarking.orgPhoronix Test SuiteIntel Xeon E5-2609 v4 @ 1.70GHz (8 Cores)MSI X99A WORKSTATION (MS-7A54) v1.0Intel Xeon E7 v4/Xeon16384MB3 x 120GB TOSHIBA-TR150NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)NVIDIA GeForce GTX 1080 8192MB (1604/5005MHz)eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-34-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 367.354.5.01.0.8GCC 5.4.0 20160609 + CUDA 8.0ext42560x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionNVIDIA CUDA CuDNN Caffe Vs. Intel CPUs On Linux Run BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Scaling Governor: intel_pstate performance

NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux Runcaffe: 200 - AlexNetcaffe: CPU AlexNetcaffe: 200 - Googlenetcaffe: CPU GooglenetGeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950Xeon E5-2687W v3Core i7-5960XXeon E5-2609 v42297.706410.671990.285391.086141.3414154.9742719842719882560682560642476442476481465381465383162983162916031031603103OpenBenchmarking.org

Caffe AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950200K400K600K800K1000KSE +/- 561.17, N = 3SE +/- 627.11, N = 3SE +/- 4812.82, N = 3SE +/- 4.23, N = 3SE +/- 3.77, N = 3SE +/- 7.74, N = 3427198.00424764.00831629.002297.701990.286141.341. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950140K280K420K560K700KMin: 426177 / Avg: 427198.33 / Max: 428112Min: 423611 / Avg: 424764 / Max: 425768Min: 826649 / Avg: 831629.33 / Max: 841253Min: 2292.88 / Avg: 2297.7 / Max: 2306.14Min: 1984.57 / Avg: 1990.28 / Max: 1997.4Min: 6130.07 / Avg: 6141.34 / Max: 6156.171. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4200K400K600K800K1000KSE +/- 561.17, N = 3SE +/- 627.11, N = 3SE +/- 4812.82, N = 34271984247648316291. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4140K280K420K560K700KMin: 426177 / Avg: 427198.33 / Max: 428112Min: 423611 / Avg: 424764 / Max: 425768Min: 826649 / Avg: 831629.33 / Max: 8412531. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950300K600K900K1200K1500KSE +/- 1923.36, N = 3SE +/- 737.95, N = 3SE +/- 2962.73, N = 3SE +/- 17.08, N = 3SE +/- 3.61, N = 3SE +/- 6.83, N = 3825606.00814653.001603103.006410.675391.0814154.971. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950300K600K900K1200K1500KMin: 822970 / Avg: 825605.67 / Max: 829350Min: 813323 / Avg: 814653.33 / Max: 815872Min: 1598950 / Avg: 1603103.33 / Max: 1608840Min: 6387 / Avg: 6410.67 / Max: 6443.84Min: 5384.33 / Avg: 5391.08 / Max: 5396.68Min: 14145.7 / Avg: 14154.97 / Max: 14168.31. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4300K600K900K1200K1500KSE +/- 1923.36, N = 3SE +/- 737.95, N = 3SE +/- 2962.73, N = 382560681465316031031. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4300K600K900K1200K1500KMin: 822970 / Avg: 825605.67 / Max: 829350Min: 813323 / Avg: 814653.33 / Max: 815872Min: 1598950 / Avg: 1603103.33 / Max: 16088401. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas