Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark caffe.

Project Site

caffe.berkeleyvision.org

Test Created

14 November 2015

Last Updated

26 September 2020

Test Maintainer

Michael Larabel

Test Type

System

Average Install Time

24 Seconds

Average Run Time

2 Minutes, 1 Second

Test Dependencies

C/C++ Compiler Toolchain + CMake + Python + BLAS (Basic Linear Algebra Sub-Routine) + C++ Boost + Linear Algebra Pack + Snappy Compression + GFlags + OpenCV + HDF5

Accolades

150k+ Downloads

Supported Platforms

* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 2 February 2025.

Revision History

pts/caffe-1.5.0 [View Source] Sat, 26 Sep 2020 21:35:45 GMT
Overhaul Caffe test profile with latest Git snapshot, switch to CMake build system, clean up test options, etc.

pts/caffe-1.4.0 [View Source] Sat, 29 Dec 2018 11:15:41 GMT
Update Caffe to latest Git snapshot to hopefully workaround build problems on newer distros.

pts/caffe-1.3.3 [View Source] Sun, 01 Apr 2018 18:50:19 GMT
Basic fix for OpenCV 3.4.

pts/caffe-1.3.2 [View Source] Wed, 04 Jan 2017 11:07:36 GMT
Fix for OpenCV 3.2.

pts/caffe-1.3.1 [View Source] Wed, 28 Dec 2016 20:36:42 GMT
Don't show title string of "Caffe AlexNet" but "Caffe" with recent test profile versions supporting more than just AlexNet.

pts/caffe-1.3.0 [View Source] Wed, 28 Dec 2016 20:34:27 GMT
Update to latest Git snapshot to fix OpenCV compatibility.

pts/caffe-1.2.0 [View Source] Mon, 15 Aug 2016 16:11:16 GMT
Add Googlenet support, decrease CPU only iteration count.

pts/caffe-1.1.1 [View Source] Sun, 12 Jun 2016 18:32:44 GMT
Add OpenCV and OpenBLAS support.

pts/caffe-1.1.0 [View Source] Sat, 11 Jun 2016 19:32:55 GMT
Update

pts/caffe-1.0.0 [View Source] Sat, 14 Nov 2015 15:29:45 GMT
Initial commit of Caffe deep learning framework and with this benchmark using the AlexNet model for benchmarking.

Suites Using This Test

Machine Learning

HPC - High Performance Computing

NVIDIA GPU Compute

Performance Metrics

Analyze Test Configuration:

Caffe 2020-02-13

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.org metrics for this test profile configuration based on 679 public results since 26 September 2020 with the latest data as of 6 September 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

Milli-Seconds (Average)

AMD Ryzen 7 5800X3D 8-Core

97th

61026 ^{+/- 486}

Intel Core i5-8400

97th

62602 ^{+/- 247}

Intel Core i5-9400F

95th

64228 ^{+/- 734}

Intel Core i9-11900K

95th

64307 ^{+/- 2674}

Intel Core i5-12400

95th

64710 ^{+/- 67}

AMD EPYC 7763 64-Core

93rd

67800 ^{+/- 662}

Intel Core i3-8100

92nd

69547 ^{+/- 56}

AMD Ryzen 7 5800X 8-Core

91st

70178 ^{+/- 1779}

AMD Ryzen 5 5600X 6-Core

88th

73630 ^{+/- 669}

Intel Core i5-7600K

87th

74271 ^{+/- 70}

AMD Ryzen 9 5950X 16-Core

85th

75620 ^{+/- 5949}

AMD Ryzen 3 3300X 4-Core

85th

75950 ^{+/- 410}

AMD Ryzen 9 5900X 12-Core

85th

76348 ^{+/- 3287}

AMD EPYC 7373X 16-Core

82nd

79773 ^{+/- 399}

Intel Xeon E-2288G

82nd

79908 ^{+/- 403}

Intel Core i5-4670

81st

80617 ^{+/- 1285}

Intel Core i9-10980XE

79th

81625 ^{+/- 11296}

Intel Core i9-9900KS

78th

81752 ^{+/- 84}

Mid-Tier

75th

> 87602

Intel Core i7-9700TE

75th

88529 ^{+/- 1791}

AMD EPYC 75F3 32-Core

75th

89252 ^{+/- 3692}

Intel Core i9-9900K

74th

89443 ^{+/- 1333}

Intel Core i5-10600K

74th

89564 ^{+/- 81}

2 x AMD EPYC 75F3 32-Core

73rd

90389 ^{+/- 8876}

AMD Ryzen 3 1300X

72nd

93847 ^{+/- 217}

Intel Core i7-7900X

71st

96087 ^{+/- 128}

Intel Xeon Gold 5217

70th

97335

AMD Ryzen 7 3800XT 8-Core

69th

100406 ^{+/- 5139}

AMD Ryzen 5 3600XT 6-Core

69th

101069 ^{+/- 677}

Intel Xeon Silver 4215R

68th

101598

AMD Ryzen 7 3700X 8-Core

67th

103831 ^{+/- 1771}

AMD Ryzen 9 3950X 16-Core

66th

104489 ^{+/- 1237}

Intel Core i7-7700K

66th

105259 ^{+/- 2634}

AMD Ryzen 7 4700U

66th

105463

2 x AMD EPYC 7713 64-Core

64th

106754 ^{+/- 3969}

Intel Xeon Gold 6226R

64th

107095

AMD Ryzen 9 3900XT 12-Core

63rd

107223 ^{+/- 4085}

Intel Core i7-8086K

62nd

107769 ^{+/- 480}

AMD Ryzen 9 3900X 12-Core

62nd

108091 ^{+/- 813}

2 x AMD EPYC 7763 64-Core

61st

108205 ^{+/- 787}

Intel Core i7-8700K

61st

108482 ^{+/- 201}

AMD Ryzen Threadripper 3990X 64-Core

58th

112044 ^{+/- 301}

Intel Xeon Silver 4214R

58th

112281

AMD Ryzen 5 4500U

57th

112603 ^{+/- 604}

Intel Xeon Platinum 8280

56th

114228 ^{+/- 1992}

Intel Core i7-4790K

56th

114458 ^{+/- 524}

Intel Xeon Gold 6258R

56th

114831 ^{+/- 794}

Intel Xeon Gold 5218

55th

116644 ^{+/- 8097}

Intel Xeon Gold 5220R

55th

116913 ^{+/- 876}

Intel Xeon E3-1275 v6

54th

117435 ^{+/- 102}

AMD Ryzen 7 1800X Eight-Core

54th

117632 ^{+/- 405}

Intel Xeon E5-1680 v3

53rd

118247 ^{+/- 227}

Intel Xeon E5-2609 v4

51st

119687 ^{+/- 404}

AMD Ryzen 7 2700X Eight-Core

51st

120153 ^{+/- 202}

2 x AMD EPYC 7373X 16-Core

51st

120202 ^{+/- 1993}

Median

50th

120383

Intel Core i9-7980XE

50th

120562 ^{+/- 155}

Intel Core i7-7740K

48th

121645 ^{+/- 146}

Intel Core i3-10100

46th

122846 ^{+/- 5556}

AMD Ryzen 7 2700 Eight-Core

46th

122885 ^{+/- 956}

AMD EPYC 7742 64-Core

46th

123132 ^{+/- 2195}

AMD EPYC 7662 64-Core

44th

125387 ^{+/- 2866}

AMD EPYC 7702 64-Core

44th

126334 ^{+/- 2828}

Intel Xeon Silver 4216

44th

126960 ^{+/- 8050}

AMD Ryzen Threadripper 3960X 24-Core

44th

127408 ^{+/- 173}

Intel Core i5-3470

42nd

128586 ^{+/- 802}

Intel Core i7-1065G7

40th

129971 ^{+/- 285}

2 x Intel Xeon Gold 5220R

39th

130377 ^{+/- 1800}

Intel Xeon E3-1280 v5

39th

130901 ^{+/- 185}

AMD Ryzen Threadripper 3970X 32-Core

38th

132553 ^{+/- 514}

Intel Xeon E3-1245 v5

37th

133822 ^{+/- 48}

Intel Xeon E5-2687W v3

36th

134387 ^{+/- 694}

Intel Xeon E3-1260L v5

35th

137302 ^{+/- 134}

AMD EPYC 7F32 8-Core

33rd

139688 ^{+/- 1120}

AMD EPYC 7F52 16-Core

32nd

142703 ^{+/- 4586}

Intel Core i7-9750H

32nd

142973 ^{+/- 424}

Intel Core i7-4960X

31st

143622 ^{+/- 167}

AMD EPYC 7262 8-Core

30th

148683

AMD EPYC 7302P 16-Core

28th

149988 ^{+/- 3124}

AMD Ryzen Threadripper 2950X 16-Core

27th

150575 ^{+/- 162}

AMD EPYC 7F72 24-Core

26th

151039 ^{+/- 1095}

Low-Tier

25th

> 151327

AMD EPYC 7232P 8-Core

24th

151623 ^{+/- 299}

AMD EPYC 7402P 24-Core

23rd

152268 ^{+/- 2920}

AMD EPYC 7252 8-Core

23rd

152406

AMD EPYC 7552 48-Core

23rd

152465 ^{+/- 20877}

AMD EPYC 7282 16-Core

22nd

153144 ^{+/- 3125}

AMD EPYC 7272 12-Core

19th

157338 ^{+/- 3321}

Intel Core i7-6700HQ

17th

158865 ^{+/- 409}

AMD EPYC 7542 32-Core

17th

160293 ^{+/- 3276}

Intel Core i7-1165G7

17th

161676 ^{+/- 9590}

AMD EPYC 7642 48-Core

17th

162384 ^{+/- 12424}

AMD EPYC 7502 32-Core

16th

163216

2 x AMD EPYC 7742 64-Core

15th

165029 ^{+/- 8986}

AMD EPYC 7502P 32-Core

15th

166472 ^{+/- 3205}

AMD EPYC 7352 24-Core

14th

169060

AMD EPYC 7452 32-Core

13th

170209

AMD Ryzen Threadripper 2970WX 24-Core

10th

173691 ^{+/- 242}

AMD EPYC 7532 32-Core

9th

177498 ^{+/- 5017}

AMD A10-7850K APU

8th

193058 ^{+/- 6247}

AMD Ryzen 3 3200U

7th

203286 ^{+/- 3004}

2 x AMD EPYC 7F72 24-Core

7th

204085 ^{+/- 1020}

Intel Core i7-8565U

6th

209788 ^{+/- 545}

AMD EPYC 7601 32-Core

5th

213239

2 x AMD EPYC 7601 32-Core

4th

214117 ^{+/- 1527}

2 x AMD EPYC 7F52 16-Core

3rd

224315 ^{+/- 647}

Intel Core i7-2700K

2nd

244159 ^{+/- 8571}

AMD EPYC 7551 32-Core

2nd

248715 ^{+/- 4600}

Detailed Performance Overview

Based on OpenBenchmarking.org data, the selected test / test configuration (Caffe 2020-02-13 - Model: AlexNet - Acceleration: CPU - Iterations: 200) has an average run-time of 8 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.3%.

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set

Support

Instructions Detected

SSE2 (SSE2)

Used by default on supported hardware.

PUNPCKLQDQ MOVDQA MOVDQU CVTSS2SD MOVD ADDSD DIVSD CVTTSD2SI MOVUPD CVTPS2PD CVTPD2PS CVTSD2SS PSHUFD XORPD SHUFPD SUBSD MULSD CVTSI2SD MOVAPD UCOMISD UNPCKLPD CVTDQ2PS COMISD CVTDQ2PD SQRTSD ANDPD ANDNPD CMPNLESD ORPD DIVPD MULPD MINSD MINPD MAXPD MAXSD CMPLTPD ADDPD CMPLTSD MOVHPD SUBPD MOVLPD UNPCKHPD PMULUDQ PSRLDQ

Advanced Vector Extensions (AVX)

Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

VZEROUPPER VINSERTF128 VEXTRACTF128 VPERM2F128 VPERMILPS VPERMILPD VBROADCASTSS VBROADCASTSD VMASKMOVPS

Advanced Vector Extensions 2 (AVX2)

Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

VPERM2I128 VPERMD VPERMPD VPBROADCASTQ VPBROADCASTD VPERMQ VGATHERQPS VEXTRACTI128 VPMASKMOVD VINSERTI128 VPGATHERDD VPBROADCASTW

FMA (FMA)

VFMADD132SS VFMADD132SD VFMSUB213PS VFMSUB132SS VFMSUB213PD VFMSUB132SD VFNMADD213SD VFNMADD213SS VFMADD231SS VFNMADD231SS VFMADD213SS VFNMADD132SS VFMADD231SD VFNMADD132SD VFMADD213SD VFMADD132PS VFMADD132PD VFNMADD132PD VFNMADD213PD VFNMADD132PS VFNMADD213PS VFMSUB231SD VFNMADD231SD VFMADD231PD

Advanced Vector Extensions 512 (AVX512)

Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).

(ZMM REGISTER USE)

The test / benchmark does honor compiler flag changes.

Last automated analysis: 17 January 2022

This test profile binary relies on the shared libraries libcaffe.so.1.0.0, libglog.so.0, libgflags.so.2.2, libprotobuf.so.23, libc.so.6, libm.so.6, liblmdb.so.0, libopenblas.so.0, libunwind.so.8, libpthread.so.0, libz.so.1, libcrypto.so.3, libcurl.so.4, libsz.so.2, libgfortran.so.5, liblzma.so.5, libnghttp2.so.14, libidn2.so.0, librtmp.so.1, libssh.so.4, libpsl.so.5, libssl.so.3, libldap-2.5.so.0, liblber-2.5.so.0, libzstd.so.1, libbrotlidec.so.1, libaec.so.0, libquadmath.so.0, libunistring.so.2, libgnutls.so.30, libhogweed.so.6, libnettle.so.8, libgmp.so.10, libkrb5.so.3, libk5crypto.so.3, libkrb5support.so.0, libsasl2.so.2, libbrotlicommon.so.1, libp11-kit.so.0, libtasn1.so.6, libkeyutils.so.1, libresolv.so.2, libffi.so.8.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture

Kernel Identifier

Verified On

Intel / AMD x86 64-bit

x86_64

(Many Processors)

IBM POWER (PowerPC) 64-bit

ppc64le

POWER9 44-Core

ARMv8 64-bit

aarch64

ARMv8 Cortex-A72 6-Core, ARMv8 Neoverse-V1, HiSilicon TSV110

hurricane-server 1 System - 322 Benchmark Results	AMD Eng Sample 100-000000897-03 - Supermicro Super Server H13SSL-N v2.00 - AMD Device 14a4 Ubuntu 24.04 - 6.8.0-50-generic - GNOME Shell 46.0
hurricane-server 1 System - 322 Benchmark Results
RTX 4070 SUPER 6 Systems - 199 Benchmark Results	Intel Core Ultra 9 285K - MSI MEG Z890 UNIFY-X - Intel Device ae7f Ubuntu 24.10 - 6.12.1-061201-generic - GNOME Shell 47.0
RTX 4070 SUPER 5 Systems - 211 Benchmark Results	Intel Core Ultra 9 285K - MSI MEG Z890 UNIFY-X - Intel Device ae7f Ubuntu 24.10 - 6.12.1-061201-generic - GNOME Shell 47.0
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
phoronix-machine-learning.txt 1 System - 323 Benchmark Results	AMD Ryzen Threadripper 7960X 24-Cores - Gigabyte TRX50 AERO D - AMD Device 14a4 Ubuntu 24.04 - 6.8.0-48-generic - GNOME Shell 46.0
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
test Featured Processor Comparison	2 x Intel Xeon E5-2680 v4
102424machinelearningtest 1 System - 342 Benchmark Results	Intel Core i9-12900K - ASUS PRIME Z790-V AX - Intel Raptor Lake-S PCH Ubuntu 24.04 - 6.8.0-47-generic - GNOME Shell 46.0
test_002 1 System - 113 Benchmark Results	AMD Ryzen 9 7900X3D 12-Core - ASUS ProArt B650-CREATOR - AMD Device 14d8 Ubuntu 24.04 - 6.8.0-47-generic - GNOME Shell 46.0
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results

Renoir vs. Icelake Benchmarks 2 Systems - 535 Benchmark Results	AMD Ryzen 5 4500U - LENOVO LNVNB161216 - AMD Renoir Root Complex Ubuntu 20.04 - 5.9.0-050900rc7daily20201002-generic - GNOME Shell 3.36.4
sys76-kudu-ML 1 System - 115 Benchmark Results	AMD Ryzen 9 5900HX - System76 Kudu - AMD Renoir Pop 21.10 - 5.15.15-76051515-generic - GNOME Shell 40.5
Ryzen 9 5900X / Ryzen 9 5950X Linux Performance 11 Systems - 217 Benchmark Results	AMD Ryzen 9 5900X 12-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4
Fedora 32 vs. Fedora 33 Beta Benchmarks 3 Systems - 174 Benchmark Results	Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH Fedora 32 - 5.8.11-200.fc32.x86_64 - GNOME Shell 3.36.6
Ryzen 5 5600X Linux Performance 12 Systems - 229 Benchmark Results	AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4
GM-1000, Karbon 700, CompuLab Airtop PCs 4 Systems - 513 Benchmark Results	Intel Xeon E-2288G - Compulab SBC-ATCFL v1.2 - Intel Cannon Lake PCH Ubuntu 20.10 - 5.8.0-26-generic - GNOME Shell 3.38.1
Core i9 10900K - Ubuntu 20.04 LTS vs. Ubuntu 20.10 2 Systems - 403 Benchmark Results	Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH Ubuntu 20.10 - 5.8.0-22-generic - GNOME Shell 3.38.0
Ryzen 7 5800X Linux Performance 11 Systems - 229 Benchmark Results	AMD Ryzen 9 3950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship Ubuntu 20.04 - 5.9.0-050900-generic - GNOME Shell 3.36.4
AMD EPYC 7003 Linux Benchmarks 26 Systems - 438 Benchmark Results	Intel Xeon Platinum 8280 - GIGABYTE MD61-SC2-00 v01000100 - Intel Sky Lake-E DMI3 Registers Ubuntu 20.04 - 5.11.0-051100rc6daily20210201-generic - GNOME Shell 3.36.4
TR 3960X WK 3 Systems - 46 Benchmark Results	AMD Ryzen Threadripper 3960X 24-Core - MSI Creator TRX40 - AMD Starship Ubuntu 20.04 - 5.9.0-rc5-14sep-patch - GNOME Shell 3.36.4
POWER9 44c 176t 2021 4 Systems - 210 Benchmark Results	POWER9 - PowerNV T2P9D01 REV 1.01 - 64GB Ubuntu 20.10 - 5.9.10-050910-generic - X Server
7700K Intel 2020 3 Systems - 202 Benchmark Results	Intel Core i7-7700K - MSI Z270-A PRO - Intel Xeon E3-1200 v6 Ubuntu 20.04 - 5.9.0-050900rc8daily20201011-generic - GNOME Shell 3.36.4
Ryzen 9 3900XT Ubuntu 20.04 LTS vs. Ubuntu 20.10 3 Systems - 406 Benchmark Results	Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH Ubuntu 20.04 - 5.4.0-48-generic - GNOME Shell 3.36.4
core-i9-10900k-fedora 2 Systems - 221 Benchmark Results	Intel Core i9-10900K - Gigabyte Z490 AORUS MASTER - Intel Comet Lake PCH Fedora 32 - 5.8.11-200.fc32.x86_64 - GNOME Shell 3.36.6
Initial Intel Xeon Platinum 8380 2P Benchmarks 5 Systems - 259 Benchmark Results	2 x AMD EPYC 75F3 32-Core - AMD DAYTONA_X - AMD Starship Ubuntu 20.04 - 5.11.0-051100rc6daily20210201-generic - GNOME Shell 3.36.4

Caffe

Project Site

Test Created

Last Updated

Test Maintainer

Test Type

Average Install Time

Average Run Time

Test Dependencies

Accolades

Supported Platforms

Revision History

Suites Using This Test

Performance Metrics

Caffe 2020-02-13

Model: AlexNet - Acceleration: CPU - Iterations: 200

Does It Scale Well With Increasing Cores?

Notable Instruction Set Usage

Tested CPU Architectures

Recent Test Results

Most Popular Test Results