NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux Run

NVIDIA CUDA Caffe build versus CPU-only Caffe build with both AlexNet and GoogleNet. Tests by Michael Larabel and all tests on same system sans switching out CPUs for other CPU-only runs.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1610067-STYL-160829608
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX TITAN X
August 29 2016
 
GeForce GTX 1080
August 29 2016
 
GeForce GTX 950
August 29 2016
 
GeForce GTX 1080 Zotac
October 06 2016
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux RunProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 ZotacIntel Xeon E5-2609 v4 @ 1.70GHz (8 Cores)MSI X99A WORKSTATION (MS-7A54) v1.0Intel Xeon E7 v4/Xeon16384MB3 x 120GB TOSHIBA-TR150NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-34-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 367.354.5.01.0.8GCC 5.4.0 20160609 + CUDA 8.0ext42560x1440NVIDIA GeForce GTX 1080 8192MB (1604/5005MHz)eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)Intel Core i7-6700 @ 4.00GHz (8 Cores)Gigabyte H170-Gaming 3Intel Sky Lake63488MB1000GB Seagate ST1000DM003-1SB1 + Samsung SSD 950 PRO 256GBZotac NVIDIA Device 1b80Qualcomm Atheros Killer E220x Gigabit4.4.0-38-generic (x86_64)800x600OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- GeForce GTX TITAN X: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 950: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Zotac: Scaling Governor: intel_pstate powersave

NVIDIA CUDA cuDNN Caffe vs. Intel CPUs On Linux Runcaffe: 200 - AlexNetcaffe: CPU AlexNetcaffe: 200 - Googlenetcaffe: CPU GooglenetGeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 ZotacXeon E5-2687W v3Core i7-5960XXeon E5-2609 v42297.706410.671990.285391.086141.3414154.971793.673575074507.8870747942719842719882560682560642476442476481465381465383162983162916031031603103OpenBenchmarking.org

Caffe AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 Zotac200K400K600K800K1000KSE +/- 561.17, N = 3SE +/- 627.11, N = 3SE +/- 4812.82, N = 3SE +/- 4.23, N = 3SE +/- 3.77, N = 3SE +/- 7.74, N = 3SE +/- 0.66, N = 3427198.00424764.00831629.002297.701990.286141.341793.671. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 Zotac140K280K420K560K700KMin: 426177 / Avg: 427198.33 / Max: 428112Min: 423611 / Avg: 424764 / Max: 425768Min: 826649 / Avg: 831629.33 / Max: 841253Min: 2292.88 / Avg: 2297.7 / Max: 2306.14Min: 1984.57 / Avg: 1990.28 / Max: 1997.4Min: 6130.07 / Avg: 6141.34 / Max: 6156.17Min: 1792.7 / Avg: 1793.67 / Max: 1794.941. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX 1080 Zotac200K400K600K800K1000KSE +/- 561.17, N = 3SE +/- 627.11, N = 3SE +/- 4812.82, N = 3SE +/- 235.33, N = 34271984247648316293575071. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU AlexNetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX 1080 Zotac140K280K420K560K700KMin: 426177 / Avg: 427198.33 / Max: 428112Min: 423611 / Avg: 424764 / Max: 425768Min: 826649 / Avg: 831629.33 / Max: 841253Min: 357039 / Avg: 357507.33 / Max: 3577821. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 Zotac300K600K900K1200K1500KSE +/- 1923.36, N = 3SE +/- 737.95, N = 3SE +/- 2962.73, N = 3SE +/- 17.08, N = 3SE +/- 3.61, N = 3SE +/- 6.83, N = 3SE +/- 5.49, N = 3825606.00814653.001603103.006410.675391.0814154.974507.881. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Iterations: 200 - Build: GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX TITAN XGeForce GTX 1080GeForce GTX 950GeForce GTX 1080 Zotac300K600K900K1200K1500KMin: 822970 / Avg: 825605.67 / Max: 829350Min: 813323 / Avg: 814653.33 / Max: 815872Min: 1598950 / Avg: 1603103.33 / Max: 1608840Min: 6387 / Avg: 6410.67 / Max: 6443.84Min: 5384.33 / Avg: 5391.08 / Max: 5396.68Min: 14145.7 / Avg: 14154.97 / Max: 14168.3Min: 4499.08 / Avg: 4507.88 / Max: 4517.981. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX 1080 Zotac300K600K900K1200K1500KSE +/- 1923.36, N = 3SE +/- 737.95, N = 3SE +/- 2962.73, N = 3SE +/- 363.75, N = 382560681465316031037074791. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU GooglenetXeon E5-2687W v3Core i7-5960XXeon E5-2609 v4GeForce GTX 1080 Zotac300K600K900K1200K1500KMin: 822970 / Avg: 825605.67 / Max: 829350Min: 813323 / Avg: 814653.33 / Max: 815872Min: 1598950 / Avg: 1603103.33 / Max: 1608840Min: 706845 / Avg: 707478.67 / Max: 7081051. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas