Xeon E5 December Intel Xeon E5-2687W v3 testing with a MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS) and NVIDIA GeForce GTX 770 on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012165-HA-XEONE5DEC90&sro&grr .
Xeon E5 December Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution 1 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 Intel Xeon E5-2687W v3 @ 3.50GHz (10 Cores / 20 Threads) MSI X99S SLI PLUS (MS-7885) v1.0 (1.E0 BIOS) Intel Xeon E7 v3/Xeon 32GB 80GB INTEL SSDSCKGW08 NVIDIA GeForce GTX 770 Realtek ALC892 LG Ultra HD Intel I218-V Ubuntu 20.04 5.9.0-050900rc7daily20200928-generic (x86_64) 20200927 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - 1: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x43 Java Details - 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu120.04) Python Details - Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Xeon E5 December lammps: 20k Atoms numpy: asmfish: 1024 Hash Memory, 26 Depth hmmer: Pfam Database Search kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium leveldb: Fill Sync espeak: Text-To-Speech Synthesis node-web-tooling: onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU leveldb: Rand Delete leveldb: Seq Fill leveldb: Seq Fill onednn: Recurrent Neural Network Inference - u8s8f32 - CPU compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed onednn: Recurrent Neural Network Inference - f32 - CPU rav1e: 1 onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed rav1e: 5 stockfish: Total Time simdjson: Kostya openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP16 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Person Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP16 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Face Detection 0106 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU openvino: Age Gender Recognition Retail 0013 FP32 - CPU simdjson: LargeRand rav1e: 6 simdjson: PartialTweets simdjson: DistinctUserID kvazaar: Bosphorus 4K - Very Fast rav1e: 10 kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 1080p - Medium compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed crafty: Elapsed Time kvazaar: Bosphorus 4K - Ultra Fast coremark: CoreMark Size 666 - Iterations Per Second onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU leveldb: Seek Rand leveldb: Rand Read kvazaar: Bosphorus 1080p - Very Fast lammps: Rhodopsin Protein onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU leveldb: Rand Fill leveldb: Rand Fill leveldb: Overwrite leveldb: Overwrite leveldb: Hot Read mafft: Multiple Sequence Alignment - LSU RNA onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU kvazaar: Bosphorus 1080p - Ultra Fast onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU ffte: N=256, 3D Complex FFT Routine onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU 1 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 4.130 258.49 23870550 170.268 3.90 4.00 53733.124 39.865 4016.51 4008.44 3996.99 170.380 170.669 13.0 2163.14 6155.2 37.65 2193.66 0.255 2199.30 6155.4 38.34 0.773 16194329 0.54 0.36 1.033 0.61 0.62 11.27 2.344 15.62 16.05 6320.0 5159.88 6421461 20.19 311796.455844 6.38245 5.78276 18.254 14.639 37.03 4.164 4.08635 2.83340 147.874 14.9 146.918 15.1 14.752 12.330 3.35443 3.03087 2.47526 7.12287 68.91 13.7561 12.7230 33878.011131313 5.08034 8.40089 4.098 257.43 24692278 170.188 3.92 4.01 45150.962 44.335 8.71 4052.51 4030.05 4038.02 170.523 168.772 13.1 2215.38 6216.0 37.84 2209.41 0.256 2209.58 6213.4 38.53 0.775 16334659 0.54 3534.49 1.41 3532.92 1.41 2417.41 2.06 2408.66 2.07 0.86 5696.85 0.86 5701.10 0.36 1.036 0.59 0.61 11.29 2.286 15.64 16.21 6405.8 5211.28 6387692 19.95 313060.150460 6.64935 5.81747 18.308 14.569 37.04 4.940 4.07562 2.83445 147.600 15.0 147.615 15.0 14.510 12.703 3.36230 2.99043 2.46414 7.30202 69.65 13.7930 12.7267 33707.028098983 5.08821 8.39159 4.165 258.45 24386621 170.349 3.91 4.00 45853.467 44.718 8.80 4043.92 4044.72 4040.19 170.781 169.465 13.0 2213.16 6182.9 37.97 2212.35 0.255 2212.31 6123.2 38.29 0.770 15967956 0.54 3529.79 1.41 3515.69 1.42 2419.72 2.06 2412.80 2.07 0.87 5685.65 0.87 5684.56 0.36 1.027 0.60 0.60 11.26 2.281 15.57 16.24 6308.5 5085.80 6401963 20.10 308373.331448 6.49144 5.78249 18.534 14.544 36.47 4.375 4.08504 2.83458 147.058 15.0 148.173 14.9 14.632 12.493 3.35854 2.95549 2.46727 7.38898 69.07 13.8025 12.7103 34103.488281708 5.07451 8.39554 OpenBenchmarking.org
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.9371 1.8742 2.8113 3.7484 4.6855 SE +/- 0.014, N = 3 SE +/- 0.066, N = 3 SE +/- 0.046, N = 3 4.130 4.165 4.098 1. (CXX) g++ options: -O3 -pthread -lm
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 60 120 180 240 300 SE +/- 0.52, N = 3 SE +/- 1.35, N = 3 SE +/- 0.35, N = 3 258.49 258.45 257.43
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 5M 10M 15M 20M 25M SE +/- 249790.13, N = 3 SE +/- 179287.50, N = 3 SE +/- 194591.53, N = 3 23870550 24386621 24692278
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 40 80 120 160 200 SE +/- 0.21, N = 3 SE +/- 0.35, N = 3 SE +/- 0.15, N = 3 170.27 170.35 170.19 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.882 1.764 2.646 3.528 4.41 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.90 3.91 3.92 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.9023 1.8046 2.7069 3.6092 4.5115 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.00 4.00 4.01 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
LevelDB Benchmark: Fill Sync OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Fill Sync 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 12K 24K 36K 48K 60K SE +/- 1171.62, N = 15 SE +/- 325.36, N = 3 SE +/- 81.33, N = 3 53733.12 45853.47 45150.96 1. (CXX) g++ options: -O3 -lsnappy -lpthread
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 10 20 30 40 50 SE +/- 0.30, N = 4 SE +/- 0.08, N = 4 SE +/- 0.38, N = 17 39.87 44.72 44.34 1. (CC) gcc options: -O2 -std=c99
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 8.80 8.71 1. Nodejs
v10.19.0
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 900 1800 2700 3600 4500 SE +/- 1.04, N = 3 SE +/- 0.35, N = 3 SE +/- 14.06, N = 3 4016.51 4043.92 4052.51 MIN: 4011.07 MIN: 4036.97 MIN: 4031.4 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 900 1800 2700 3600 4500 SE +/- 4.27, N = 3 SE +/- 0.65, N = 3 SE +/- 11.20, N = 3 4008.44 4044.72 4030.05 MIN: 3998.3 MIN: 4038.4 MIN: 3975.86 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 900 1800 2700 3600 4500 SE +/- 40.61, N = 3 SE +/- 6.70, N = 3 SE +/- 2.66, N = 3 3996.99 4040.19 4038.02 MIN: 3891.91 MIN: 4020.21 MIN: 4027.78 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
LevelDB Benchmark: Random Delete OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Delete 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.28, N = 3 SE +/- 0.53, N = 3 170.38 170.78 170.52 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Sequential Fill 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 40 80 120 160 200 SE +/- 0.87, N = 3 SE +/- 0.55, N = 3 SE +/- 0.55, N = 3 170.67 169.47 168.77 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Sequential Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Sequential Fill 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 13.0 13.0 13.1 1. (CXX) g++ options: -O3 -lsnappy -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 500 1000 1500 2000 2500 SE +/- 8.88, N = 3 SE +/- 0.57, N = 3 SE +/- 4.81, N = 3 2163.14 2213.16 2215.38 MIN: 2150.81 MIN: 2208.14 MIN: 2203.55 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1300 2600 3900 5200 6500 SE +/- 14.07, N = 3 SE +/- 37.98, N = 3 SE +/- 1.13, N = 3 6155.2 6182.9 6216.0 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.19, N = 3 SE +/- 0.00, N = 3 37.65 37.97 37.84 1. (CC) gcc options: -O3
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 500 1000 1500 2000 2500 SE +/- 6.16, N = 3 SE +/- 1.56, N = 3 SE +/- 1.64, N = 3 2193.66 2212.35 2209.41 MIN: 2184.62 MIN: 2206.84 MIN: 2204.28 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 1 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.0576 0.1152 0.1728 0.2304 0.288 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.255 0.255 0.256
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 500 1000 1500 2000 2500 SE +/- 1.10, N = 3 SE +/- 1.30, N = 3 SE +/- 3.10, N = 3 2199.30 2212.31 2209.58 MIN: 2195.17 MIN: 2208.61 MIN: 2202.4 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1300 2600 3900 5200 6500 SE +/- 1.39, N = 3 SE +/- 2.00, N = 3 SE +/- 0.38, N = 3 6155.4 6123.2 6213.4 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 SE +/- 0.13, N = 3 38.34 38.29 38.53 1. (CC) gcc options: -O3
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 5 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1744 0.3488 0.5232 0.6976 0.872 SE +/- 0.003, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.773 0.770 0.775
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3M 6M 9M 12M 15M SE +/- 122294.66, N = 3 SE +/- 129748.65, N = 3 SE +/- 200611.67, N = 3 16194329 15967956 16334659 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1215 0.243 0.3645 0.486 0.6075 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.54 0.54 0.54 1. (CXX) g++ options: -O3 -pthread
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 800 1600 2400 3200 4000 SE +/- 2.52, N = 3 SE +/- 8.59, N = 3 3529.79 3534.49
OpenVINO Model: Person Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.3173 0.6346 0.9519 1.2692 1.5865 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.41 1.41
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 800 1600 2400 3200 4000 SE +/- 12.33, N = 3 SE +/- 4.96, N = 3 3515.69 3532.92
OpenVINO Model: Person Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Person Detection 0106 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.3195 0.639 0.9585 1.278 1.5975 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 1.42 1.41
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 500 1000 1500 2000 2500 SE +/- 2.60, N = 3 SE +/- 0.67, N = 3 2419.72 2417.41
OpenVINO Model: Face Detection 0106 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.4635 0.927 1.3905 1.854 2.3175 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.06 2.06
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 500 1000 1500 2000 2500 SE +/- 2.08, N = 3 SE +/- 4.51, N = 3 2412.80 2408.66
OpenVINO Model: Face Detection 0106 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Face Detection 0106 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 2.07 2.07
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.86
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1200 2400 3600 4800 6000 SE +/- 11.02, N = 3 SE +/- 10.38, N = 3 5685.65 5696.85
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.87 0.86
OpenVINO Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2021.1 Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1200 2400 3600 4800 6000 SE +/- 1.02, N = 3 SE +/- 10.33, N = 3 5684.56 5701.10
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.081 0.162 0.243 0.324 0.405 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.36 0.36 0.36 1. (CXX) g++ options: -O3 -pthread
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 6 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.2331 0.4662 0.6993 0.9324 1.1655 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 1.033 1.027 1.036
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.61 0.60 0.59 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.1395 0.279 0.4185 0.558 0.6975 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 0.62 0.60 0.61 1. (CXX) g++ options: -O3 -pthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 11.27 11.26 11.29 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Alpha Speed: 10 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.5274 1.0548 1.5822 2.1096 2.637 SE +/- 0.005, N = 3 SE +/- 0.024, N = 3 SE +/- 0.015, N = 3 2.344 2.281 2.286
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 15.62 15.57 15.64 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 16.05 16.24 16.21 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Decompression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1400 2800 4200 5600 7000 SE +/- 7.98, N = 3 SE +/- 0.70, N = 3 SE +/- 0.87, N = 3 6320.0 6308.5 6405.8 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 1 - Compression Speed 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1100 2200 3300 4400 5500 SE +/- 2.09, N = 3 SE +/- 7.05, N = 3 SE +/- 0.14, N = 3 5159.88 5085.80 5211.28 1. (CC) gcc options: -O3
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1.4M 2.8M 4.2M 5.6M 7M SE +/- 37926.20, N = 3 SE +/- 20532.86, N = 3 SE +/- 13921.29, N = 3 6421461 6401963 6387692 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 20.19 20.10 19.95 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 70K 140K 210K 280K 350K SE +/- 930.87, N = 3 SE +/- 2252.84, N = 3 SE +/- 230.17, N = 3 311796.46 308373.33 313060.15 1. (CC) gcc options: -O2 -lrt" -lrt
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 2 4 6 8 10 SE +/- 0.00469, N = 3 SE +/- 0.01871, N = 3 SE +/- 0.07104, N = 3 6.38245 6.49144 6.64935 MIN: 6.31 MIN: 6.39 MIN: 6.43 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1.3089 2.6178 3.9267 5.2356 6.5445 SE +/- 0.01265, N = 3 SE +/- 0.01269, N = 3 SE +/- 0.02108, N = 3 5.78276 5.78249 5.81747 MIN: 5.72 MIN: 5.72 MIN: 5.72 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
LevelDB Benchmark: Seek Random OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Seek Random 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 5 10 15 20 25 SE +/- 0.18, N = 3 SE +/- 0.12, N = 3 SE +/- 0.15, N = 3 18.25 18.53 18.31 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Random Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Read 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 SE +/- 0.21, N = 4 14.64 14.54 14.57 1. (CXX) g++ options: -O3 -lsnappy -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 9 18 27 36 45 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 SE +/- 0.10, N = 3 37.03 36.47 37.04 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1.1115 2.223 3.3345 4.446 5.5575 SE +/- 0.062, N = 3 SE +/- 0.127, N = 15 SE +/- 0.153, N = 12 4.164 4.375 4.940 1. (CXX) g++ options: -O3 -pthread -lm
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.9194 1.8388 2.7582 3.6776 4.597 SE +/- 0.01028, N = 3 SE +/- 0.02680, N = 3 SE +/- 0.00631, N = 3 4.08635 4.08504 4.07562 MIN: 4 MIN: 3.99 MIN: 4.02 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.6378 1.2756 1.9134 2.5512 3.189 SE +/- 0.00400, N = 3 SE +/- 0.00087, N = 3 SE +/- 0.00330, N = 3 2.83340 2.83458 2.83445 MIN: 2.8 MIN: 2.8 MIN: 2.8 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Random Fill 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 30 60 90 120 150 SE +/- 0.62, N = 3 SE +/- 0.47, N = 3 SE +/- 0.46, N = 3 147.87 147.06 147.60 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Random Fill OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Random Fill 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 14.9 15.0 15.0 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Overwrite 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 30 60 90 120 150 SE +/- 1.77, N = 3 SE +/- 0.49, N = 3 SE +/- 0.82, N = 3 146.92 148.17 147.62 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Overwrite OpenBenchmarking.org MB/s, More Is Better LevelDB 1.22 Benchmark: Overwrite 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.19, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 15.1 14.9 15.0 1. (CXX) g++ options: -O3 -lsnappy -lpthread
LevelDB Benchmark: Hot Read OpenBenchmarking.org Microseconds Per Op, Fewer Is Better LevelDB 1.22 Benchmark: Hot Read 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 14.75 14.63 14.51 1. (CXX) g++ options: -O3 -lsnappy -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 12.33 12.49 12.70 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.7565 1.513 2.2695 3.026 3.7825 SE +/- 0.00152, N = 3 SE +/- 0.00289, N = 3 SE +/- 0.00207, N = 3 3.35443 3.35854 3.36230 MIN: 3.29 MIN: 3.27 MIN: 3.3 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.6819 1.3638 2.0457 2.7276 3.4095 SE +/- 0.01888, N = 3 SE +/- 0.01799, N = 3 SE +/- 0.00465, N = 3 3.03087 2.95549 2.99043 MIN: 2.9 MIN: 2.85 MIN: 2.93 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 0.5569 1.1138 1.6707 2.2276 2.7845 SE +/- 0.00204, N = 3 SE +/- 0.00334, N = 3 SE +/- 0.00225, N = 3 2.47526 2.46727 2.46414 MIN: 2.44 MIN: 2.44 MIN: 2.43 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 2 4 6 8 10 SE +/- 0.00161, N = 3 SE +/- 0.00412, N = 3 SE +/- 0.00231, N = 3 7.12287 7.38898 7.30202 MIN: 7.08 MIN: 7.35 MIN: 7.26 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 16 32 48 64 80 SE +/- 0.32, N = 3 SE +/- 0.46, N = 3 SE +/- 0.05, N = 3 68.91 69.07 69.65 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 13.76 13.80 13.79 MIN: 13.67 MIN: 13.71 MIN: 13.69 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 12.72 12.71 12.73 MIN: 12.64 MIN: 12.64 MIN: 12.56 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 7K 14K 21K 28K 35K SE +/- 29.49, N = 3 SE +/- 45.08, N = 3 SE +/- 10.36, N = 3 33878.01 34103.49 33707.03 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 1.1448 2.2896 3.4344 4.5792 5.724 SE +/- 0.00848, N = 3 SE +/- 0.00764, N = 3 SE +/- 0.00368, N = 3 5.08034 5.07451 5.08821 MIN: 5.05 MIN: 5.04 MIN: 5.06 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 3 INTEL SSDSCKGW08 - Intel Xeon E5-2687W v3 2 4 6 8 10 SE +/- 0.01669, N = 3 SE +/- 0.01369, N = 3 SE +/- 0.01424, N = 3 8.40089 8.39554 8.39159 MIN: 8.33 MIN: 8.32 MIN: 8.33 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Phoronix Test Suite v10.8.4