Caffe Update AMD Ryzen Threadripper 3970X 32-Core testing with a ASUS ROG ZENITH II EXTREME (0702 BIOS) and NVIDIA TITAN RTX 24GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2009276-PTS-CAFFEUPD69&rdt&grr .
Caffe Update Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 1 2 AMD Ryzen Threadripper 3970X 32-Core @ 3.70GHz (32 Cores / 64 Threads) ASUS ROG ZENITH II EXTREME (0702 BIOS) AMD Starship/Matisse 64GB 1000GB Corsair Force MP600 NVIDIA TITAN RTX 24GB (1350/7000MHz) NVIDIA TU102 HD Audio ASUS MG28U Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-48-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 450.36.06 4.6.0 OpenCL 1.2 CUDA 11.0.185 1.2.133 GCC 9.3.0 + CUDA 11.0 ext4 3840x2160 NVIDIA TITAN RTX 24GB (1020/810MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025 Python Details - Python 3.8.2 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Caffe Update caffe: GoogleNet - CPU - 1000 caffe: AlexNet - CPU - 1000 caffe: GoogleNet - CPU - 200 caffe: GoogleNet - CPU - 100 caffe: AlexNet - CPU - 200 caffe: AlexNet - CPU - 100 caffe: GoogleNet - NVIDIA CUDA - 1000 caffe: AlexNet - NVIDIA CUDA - 1000 caffe: GoogleNet - NVIDIA CUDA - 200 caffe: GoogleNet - NVIDIA CUDA - 100 caffe: AlexNet - NVIDIA CUDA - 200 caffe: AlexNet - NVIDIA CUDA - 100 1 2 1777493 667195 355281 175859 132563 66259 31109.2 9460.33 6281.84 3157.90 1932.36 1000.518 1769330 665298 353956 176413 132034 65444 31162.4 9464.17 6250.80 3130.40 1930.00 994.427 OpenBenchmarking.org
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 1 2 400K 800K 1200K 1600K 2000K SE +/- 2402.17, N = 3 SE +/- 2456.48, N = 3 1777493 1769330 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 1 2 140K 280K 420K 560K 700K SE +/- 850.74, N = 3 SE +/- 574.16, N = 3 667195 665298 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 1 2 80K 160K 240K 320K 400K SE +/- 661.91, N = 3 SE +/- 639.54, N = 3 355281 353956 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 1 2 40K 80K 120K 160K 200K SE +/- 669.06, N = 3 SE +/- 494.28, N = 3 175859 176413 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 1 2 30K 60K 90K 120K 150K SE +/- 246.42, N = 3 SE +/- 235.96, N = 3 132563 132034 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: CPU - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 1 2 14K 28K 42K 56K 70K SE +/- 273.03, N = 3 SE +/- 206.01, N = 3 66259 65444 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 1 2 7K 14K 21K 28K 35K SE +/- 61.82, N = 3 SE +/- 68.26, N = 3 31109.2 31162.4 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 1 2 2K 4K 6K 8K 10K SE +/- 13.99, N = 3 SE +/- 2.76, N = 3 9460.33 9464.17 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 1 2 1300 2600 3900 5200 6500 SE +/- 10.99, N = 3 SE +/- 19.72, N = 3 6281.84 6250.80 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 2 700 1400 2100 2800 3500 SE +/- 0.31, N = 3 SE +/- 7.11, N = 3 3157.90 3130.40 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 1 2 400 800 1200 1600 2000 SE +/- 2.59, N = 3 SE +/- 8.43, N = 3 1932.36 1930.00 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Caffe Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 OpenBenchmarking.org Milli-Seconds, Fewer Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 1 2 200 400 600 800 1000 SE +/- 10.23, N = 3 SE +/- 10.28, N = 3 1000.52 994.43 1. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
Phoronix Test Suite v10.8.4