Xeon Broadwell Intel Xeon E5-2609 v4 testing with a MSI X99A RAIDER (MS-7885) v5.0 (P.50 BIOS) and eVGA NVIDIA NV117 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2006186-NE-XEONBROAD23 Xeon E5-2609 v4 Processor: Intel Xeon E5-2609 v4 @ 1.70GHz (8 Cores), Motherboard: MSI X99A RAIDER (MS-7885) v5.0 (P.50 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 256GB CORSAIR FORCE LX, Graphics: eVGA NVIDIA NV117 1GB, Audio: Realtek ALC892, Monitor: G237HL, Network: Intel I218-V
OS: Ubuntu 20.04, Kernel: 5.4.0-37-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xb000038Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT disabled
Xeon Broadwell OpenBenchmarking.org Phoronix Test Suite Intel Xeon E5-2609 v4 @ 1.70GHz (8 Cores) MSI X99A RAIDER (MS-7885) v5.0 (P.50 BIOS) Intel Xeon E7 v4/Xeon 16GB 256GB CORSAIR FORCE LX eVGA NVIDIA NV117 1GB Realtek ALC892 G237HL Intel I218-V Ubuntu 20.04 5.4.0-37-generic (x86_64) GNOME Shell 3.36.2 X Server 1.20.8 modesetting 1.20.8 4.3 Mesa 20.0.4 GCC 9.3.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Xeon Broadwell Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xb000038 - itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT disabled
Xeon Broadwell ethr: TCP - Latency - 1 ethr: TCP - Latency - 2 ethr: TCP - Latency - 4 ethr: TCP - Latency - 8 ethr: TCP - Latency - 16 ethr: TCP - Latency - 32 ethr: TCP - Latency - 64 ethr: TCP - Bandwidth - 2 ethr: TCP - Bandwidth - 4 ethr: TCP - Bandwidth - 8 ethr: UDP - Bandwidth - 2 ethr: UDP - Bandwidth - 4 ethr: UDP - Bandwidth - 8 ethr: HTTP - Bandwidth - 1 ethr: HTTP - Bandwidth - 2 ethr: HTTP - Bandwidth - 4 ethr: HTTP - Bandwidth - 8 ethr: TCP - Bandwidth - 16 ethr: TCP - Bandwidth - 32 ethr: TCP - Bandwidth - 64 ethr: UDP - Bandwidth - 16 ethr: UDP - Bandwidth - 32 ethr: UDP - Bandwidth - 64 ethr: HTTP - Bandwidth - 16 ethr: HTTP - Bandwidth - 32 ethr: HTTP - Bandwidth - 64 ethr: TCP - Connections/s - 1 ethr: TCP - Connections/s - 2 ethr: TCP - Connections/s - 4 ethr: TCP - Connections/s - 8 ethr: TCP - Connections/s - 16 ethr: TCP - Connections/s - 32 ethr: TCP - Connections/s - 64 wireguard: onednn: IP Batch 1D - f32 - CPU onednn: IP Batch All - f32 - CPU onednn: IP Batch 1D - u8s8f32 - CPU onednn: IP Batch All - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Deconvolution Batch deconv_1d - f32 - CPU onednn: Deconvolution Batch deconv_3d - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPU onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU Xeon E5-2609 v4 29.53 29.43 29.71 29.91 29.86 29.89 29.75 26962.222222222 47778.000000000 65346.222222222 19027.555555556 40341.555555556 71781.333333333 512.25 1008.19 1881.93 2902.98 69351.777777778 71865.333333333 73444.666666667 75410.444444444 84064.888888889 84412.888888889 3126.66 2898.60 2372.28 84261 9543 18367 52520 16127 17996 19531 397.369 8.59081 104.330 6.12258 80.2738 14.1299 11.1215 14.3799 14.6183 15.8017 11.9129 548.687 169.541 4.11893 7.15319 OpenBenchmarking.org
Ethr Ethr is a cross-platform Golang-written network performance measurement tool developed by Microsoft that is capable of testing multiple protocols and different measurements. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 1 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.03, N = 3 29.53 MIN: 25.26 / MAX: 33.56
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 2 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.05, N = 3 29.43 MIN: 25.27 / MAX: 33.58
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 4 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.09, N = 3 29.71 MIN: 25.35 / MAX: 33.1
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 8 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.04, N = 3 29.91 MIN: 25.59 / MAX: 33.79
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 16 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.13, N = 3 29.86 MIN: 25.34 / MAX: 33.21
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 32 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.04, N = 3 29.89 MIN: 25.38 / MAX: 33.57
OpenBenchmarking.org Microseconds, Fewer Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Latency - Threads: 64 Xeon E5-2609 v4 7 14 21 28 35 SE +/- 0.12, N = 3 29.75 MIN: 25.38 / MAX: 33.55
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 2 Xeon E5-2609 v4 6K 12K 18K 24K 30K SE +/- 12.08, N = 3 26962.22 MIN: 26650 / MAX: 27240
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 4 Xeon E5-2609 v4 10K 20K 30K 40K 50K SE +/- 234.93, N = 3 47778.00 MIN: 44490 / MAX: 50630
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 8 Xeon E5-2609 v4 14K 28K 42K 56K 70K SE +/- 54.46, N = 3 65346.22 MIN: 63620 / MAX: 68140
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 2 Xeon E5-2609 v4 4K 8K 12K 16K 20K SE +/- 60.78, N = 3 19027.56 MIN: 18890 / MAX: 19180
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 4 Xeon E5-2609 v4 9K 18K 27K 36K 45K SE +/- 81.14, N = 3 40341.56 MIN: 39980 / MAX: 40860
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 8 Xeon E5-2609 v4 15K 30K 45K 60K 75K SE +/- 315.32, N = 3 71781.33 MIN: 68280 / MAX: 74350
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 1 Xeon E5-2609 v4 110 220 330 440 550 SE +/- 0.65, N = 3 512.25 MIN: 507.01 / MAX: 515.97
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 2 Xeon E5-2609 v4 200 400 600 800 1000 SE +/- 3.91, N = 4 1008.19 MAX: 1020
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 4 Xeon E5-2609 v4 400 800 1200 1600 2000 SE +/- 0.98, N = 3 1881.93 MIN: 1860 / MAX: 1910
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 8 Xeon E5-2609 v4 600 1200 1800 2400 3000 SE +/- 4.61, N = 3 2902.98 MIN: 2810 / MAX: 2960
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 16 Xeon E5-2609 v4 15K 30K 45K 60K 75K SE +/- 525.72, N = 3 69351.78 MIN: 60380 / MAX: 73420
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 32 Xeon E5-2609 v4 15K 30K 45K 60K 75K SE +/- 41.85, N = 3 71865.33 MIN: 68230 / MAX: 75780
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Bandwidth - Threads: 64 Xeon E5-2609 v4 16K 32K 48K 64K 80K SE +/- 117.17, N = 3 73444.67 MIN: 68190 / MAX: 78550
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 16 Xeon E5-2609 v4 16K 32K 48K 64K 80K SE +/- 155.06, N = 3 75410.44 MIN: 68550 / MAX: 78460
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 32 Xeon E5-2609 v4 20K 40K 60K 80K 100K SE +/- 259.83, N = 3 84064.89 MIN: 77410 / MAX: 88990
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: UDP - Test: Bandwidth - Threads: 64 Xeon E5-2609 v4 20K 40K 60K 80K 100K SE +/- 137.88, N = 3 84412.89 MIN: 72680 / MAX: 94570
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 16 Xeon E5-2609 v4 700 1400 2100 2800 3500 SE +/- 10.73, N = 3 3126.66 MIN: 3010 / MAX: 3190
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 32 Xeon E5-2609 v4 600 1200 1800 2400 3000 SE +/- 1.83, N = 3 2898.60 MIN: 2850 / MAX: 2940
OpenBenchmarking.org Mbits/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: HTTP - Test: Bandwidth - Threads: 64 Xeon E5-2609 v4 500 1000 1500 2000 2500 SE +/- 0.35, N = 3 2372.28 MIN: 2330 / MAX: 2410
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 1 Xeon E5-2609 v4 20K 40K 60K 80K 100K SE +/- 79430.83, N = 12 84261
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 2 Xeon E5-2609 v4 2K 4K 6K 8K 10K SE +/- 78.81, N = 3 9543
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 4 Xeon E5-2609 v4 4K 8K 12K 16K 20K SE +/- 193.42, N = 3 18367
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 8 Xeon E5-2609 v4 11K 22K 33K 44K 55K SE +/- 796.07, N = 3 52520
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 16 Xeon E5-2609 v4 3K 6K 9K 12K 15K SE +/- 4252.32, N = 15 16127
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 32 Xeon E5-2609 v4 4K 8K 12K 16K 20K SE +/- 5521.93, N = 12 17996
OpenBenchmarking.org Connections/sec, More Is Better Ethr 2019-01-02 Server Address: localhost - Protocol: TCP - Test: Connections/s - Threads: 64 Xeon E5-2609 v4 4K 8K 12K 16K 20K SE +/- 4138.28, N = 15 19531
WireGuard + Linux Networking Stack Stress Test This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WireGuard + Linux Networking Stack Stress Test Xeon E5-2609 v4 90 180 270 360 450 SE +/- 3.37, N = 3 397.37
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 2 4 6 8 10 SE +/- 0.04107, N = 3 8.59081 MIN: 8.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 20 40 60 80 100 SE +/- 0.02, N = 3 104.33 MIN: 103.89 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 2 4 6 8 10 SE +/- 0.03805, N = 3 6.12258 MIN: 6.02 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 20 40 60 80 100 SE +/- 0.10, N = 3 80.27 MIN: 80.04 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 4 8 12 16 20 SE +/- 0.05, N = 3 14.13 MIN: 13.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 3 6 9 12 15 SE +/- 0.04, N = 3 11.12 MIN: 11.02 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 4 8 12 16 20 SE +/- 0.07, N = 3 14.38 MIN: 14.11 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 4 8 12 16 20 SE +/- 0.09, N = 3 14.62 MIN: 14.47 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 4 8 12 16 20 SE +/- 0.03, N = 3 15.80 MIN: 15.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 3 6 9 12 15 SE +/- 0.05, N = 3 11.91 MIN: 11.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 120 240 360 480 600 SE +/- 0.40, N = 3 548.69 MIN: 546.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 40 80 120 160 200 SE +/- 5.61, N = 12 169.54 MIN: 142.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Xeon E5-2609 v4 0.9268 1.8536 2.7804 3.7072 4.634 SE +/- 0.00512, N = 3 4.11893 MIN: 4.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Xeon E5-2609 v4 2 4 6 8 10 SE +/- 0.02407, N = 3 7.15319 MIN: 7.07 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Xeon E5-2609 v4 Processor: Intel Xeon E5-2609 v4 @ 1.70GHz (8 Cores), Motherboard: MSI X99A RAIDER (MS-7885) v5.0 (P.50 BIOS), Chipset: Intel Xeon E7 v4/Xeon, Memory: 16GB, Disk: 256GB CORSAIR FORCE LX, Graphics: eVGA NVIDIA NV117 1GB, Audio: Realtek ALC892, Monitor: G237HL, Network: Intel I218-V
OS: Ubuntu 20.04, Kernel: 5.4.0-37-generic (x86_64), Desktop: GNOME Shell 3.36.2, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xb000038Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT disabled
Testing initiated at 17 June 2020 21:48 by user phoronix.