pi one ARMv8 Cortex-A72 testing with a BCM2835 Raspberry Pi 400 Rev 1.0 and V3D 4.2 4GB on Debian 10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2103178-HA-PIONE661279 1 Processor: ARMv8 Cortex-A72 @ 1.80GHz (4 Cores), Motherboard: BCM2835 Raspberry Pi 400 Rev 1.0, Memory: 4096MB, Disk: 32GB GB1QT, Graphics: V3D 4.2 4GB, Monitor: DELL P2210H
OS: Debian 10, Kernel: 5.4.51-v8+ (aarch64), Desktop: LXDE 0.10.0, Display Server: X Server 1.20.4, OpenGL: 2.1 Mesa 19.3.2, Compiler: GCC 8.3.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: snd_bcm2835.enable_compat_alsa=0 snd_bcm2835.enable_hdmi=1 snd_bcm2835.enable_headphones=1Compiler Notes: --build=aarch64-linux-gnu --disable-libphobos --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -vProcessor Notes: Scaling Governor: cpufreq-dt ondemandSecurity Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable + srbds: Not affected + tsx_async_abort: Not affected
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 40 80 120 160 200 SE +/- 0.63, N = 3 181.42 MIN: 179.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 30 60 90 120 150 SE +/- 0.21, N = 3 141.98 MIN: 139.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 100 200 300 400 500 SE +/- 2.89, N = 3 482.05 MIN: 462.21 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 160 320 480 640 800 SE +/- 1.79, N = 3 759.06 MIN: 749.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 110 220 330 440 550 SE +/- 0.62, N = 3 506.45 MIN: 503.2 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 800 1600 2400 3200 4000 SE +/- 23.68, N = 3 3779.35 MIN: 3676.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 100 200 300 400 500 SE +/- 2.63, N = 3 441.81 MIN: 435.47 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 200 400 600 800 1000 SE +/- 7.65, N = 3 1003.98 MIN: 984.99 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 200 400 600 800 1000 SE +/- 11.03, N = 3 907.74 MIN: 874.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 140 280 420 560 700 SE +/- 3.46, N = 3 632.59 MIN: 614.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 50K 100K 150K 200K 250K SE +/- 436.52, N = 3 237960 MIN: 235788 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 30K 60K 90K 120K 150K SE +/- 1513.69, N = 3 125895 MIN: 123533 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 50K 100K 150K 200K 250K SE +/- 3008.48, N = 9 238247 MIN: 231947 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 30K 60K 90K 120K 150K SE +/- 1296.47, N = 5 130575 MIN: 127671 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 20 40 60 80 100 SE +/- 0.20, N = 3 80.71 MIN: 79.64 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 50K 100K 150K 200K 250K SE +/- 3796.29, N = 9 242197 MIN: 231788 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 30K 60K 90K 120K 150K SE +/- 1352.35, N = 9 129988 MIN: 123578 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 30 60 90 120 150 SE +/- 1.57, N = 15 153.40 MIN: 145.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread -ldl
OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU 1 1500 3000 4500 6000 7500 SE +/- 0.76, N = 3 7139.32 1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm
1 Processor: ARMv8 Cortex-A72 @ 1.80GHz (4 Cores), Motherboard: BCM2835 Raspberry Pi 400 Rev 1.0, Memory: 4096MB, Disk: 32GB GB1QT, Graphics: V3D 4.2 4GB, Monitor: DELL P2210H
OS: Debian 10, Kernel: 5.4.51-v8+ (aarch64), Desktop: LXDE 0.10.0, Display Server: X Server 1.20.4, OpenGL: 2.1 Mesa 19.3.2, Compiler: GCC 8.3.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: snd_bcm2835.enable_compat_alsa=0 snd_bcm2835.enable_hdmi=1 snd_bcm2835.enable_headphones=1Compiler Notes: --build=aarch64-linux-gnu --disable-libphobos --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -vProcessor Notes: Scaling Governor: cpufreq-dt ondemandSecurity Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 15 March 2021 07:06 by user root.