Tests Xeon Silver Intel Xeon Silver 4216 testing with a TYAN S7100AG2NR (V4.02 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2006296-NE-TESTSXEON20 Intel Xeon Silver 4216 Processor: Intel Xeon Silver 4216 @ 3.20GHz (16 Cores / 32 Threads), Motherboard: TYAN S7100AG2NR (V4.02 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 24GB, Disk: 240GB Force MP500, Graphics: ASPEED, Audio: Realtek ALC892, Monitor: VE228, Network: 2 x Intel I350
OS: Ubuntu 20.04, Kernel: 5.4.0-39-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002cSecurity Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Tests Xeon Silver OpenBenchmarking.org Phoronix Test Suite Intel Xeon Silver 4216 @ 3.20GHz (16 Cores / 32 Threads) TYAN S7100AG2NR (V4.02 BIOS) Intel Sky Lake-E DMI3 Registers 24GB 240GB Force MP500 ASPEED Realtek ALC892 VE228 2 x Intel I350 Ubuntu 20.04 5.4.0-39-generic (x86_64) GNOME Shell 3.36.1 X Server 1.20.8 modesetting 1.20.8 GCC 9.3.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution Tests Xeon Silver Benchmarks System Logs - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c - itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Tests Xeon Silver avifenc: 10 avifenc: 8 avifenc: 2 avifenc: 0 openvkl: vklBenchmarkUnstructuredVolume openvkl: vklBenchmarkStructuredVolume openvkl: vklBenchmarkVdbVolume openvkl: vklBenchmark aom-av1: Speed 8 Realtime aom-av1: Speed 6 Two-Pass aom-av1: Speed 6 Realtime aom-av1: Speed 4 Two-Pass aom-av1: Speed 0 Two-Pass onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Deconvolution Batch deconv_3d - bf16bf16bf16 - CPU onednn: Deconvolution Batch deconv_1d - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPU onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch deconv_3d - f32 - CPU onednn: Deconvolution Batch deconv_1d - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: IP Batch All - bf16bf16bf16 - CPU onednn: IP Batch 1D - bf16bf16bf16 - CPU onednn: IP Batch All - u8s8f32 - CPU onednn: IP Batch 1D - u8s8f32 - CPU onednn: IP Batch All - f32 - CPU onednn: IP Batch 1D - f32 - CPU fftw: Float + SSE - 2D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 2048 fftw: Float + SSE - 2D FFT Size 1024 fftw: Float + SSE - 2D FFT Size 512 fftw: Float + SSE - 2D FFT Size 256 fftw: Float + SSE - 2D FFT Size 128 fftw: Float + SSE - 1D FFT Size 512 fftw: Float + SSE - 2D FFT Size 32 fftw: Float + SSE - 1D FFT Size 32 fftw: Stock - 2D FFT Size 4096 fftw: Stock - 2D FFT Size 2048 fftw: Stock - 2D FFT Size 1024 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 1D FFT Size 2048 fftw: Stock - 1D FFT Size 1024 fftw: Stock - 2D FFT Size 512 fftw: Stock - 2D FFT Size 256 fftw: Stock - 2D FFT Size 128 fftw: Stock - 1D FFT Size 512 fftw: Stock - 1D FFT Size 256 fftw: Stock - 1D FFT Size 128 fftw: Stock - 2D FFT Size 64 fftw: Stock - 2D FFT Size 32 fftw: Stock - 1D FFT Size 64 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 2048 fftw: Float + SSE - 1D FFT Size 1024 fftw: Float + SSE - 1D FFT Size 256 fftw: Float + SSE - 1D FFT Size 128 fftw: Float + SSE - 2D FFT Size 64 fftw: Float + SSE - 1D FFT Size 64 fftw: Stock - 1D FFT Size 32 Intel Xeon Silver 4216 7.020 7.250 60.282 102.703 1288377.5930599 45415364.702703 12813527.636364 146.25 22.02 2.56 10.08 1.65 0.22 3.50313 0.730665 1.55221 21.6767 17.5632 16.3043 63.6049 310.108 1.88546 1.26891 6.22350 7.58591 5.01111 6.57813 128.702 10.6243 14.3881 1.03818 55.8505 4.67122 14744 16509 19445 19220 18618 22926 32015 33602 11995 4280.7 4644.0 5228.1 5797.4 5752.0 6164.0 5425.5 5456.6 6069.2 6046.9 5611.0 5477.4 5783.9 6736.4 6084.6 28711 27930 32733 23980 16690.7 25551 13623.6 5851.0 OpenBenchmarking.org
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Intel Xeon Silver 4216 300K 600K 900K 1200K 1500K SE +/- 873.76, N = 3 1288377.59 MIN: 16642 / MAX: 5030297
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Intel Xeon Silver 4216 10M 20M 30M 40M 50M SE +/- 378168.99, N = 3 45415364.70 MIN: 814675 / MAX: 351021888
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Intel Xeon Silver 4216 3M 6M 9M 12M 15M SE +/- 76648.57, N = 3 12813527.64 MIN: 857914 / MAX: 72746784
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Intel Xeon Silver 4216 30 60 90 120 150 SE +/- 1.03, N = 3 146.25 MIN: 1 / MAX: 582
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass Intel Xeon Silver 4216 0.576 1.152 1.728 2.304 2.88 SE +/- 0.00, N = 3 2.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime Intel Xeon Silver 4216 3 6 9 12 15 SE +/- 0.08, N = 3 10.08 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass Intel Xeon Silver 4216 0.3713 0.7426 1.1139 1.4852 1.8565 SE +/- 0.01, N = 3 1.65 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 0 Two-Pass Intel Xeon Silver 4216 0.0495 0.099 0.1485 0.198 0.2475 SE +/- 0.00, N = 3 0.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 0.7882 1.5764 2.3646 3.1528 3.941 SE +/- 0.00045, N = 3 3.50313 MIN: 3.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 0.1644 0.3288 0.4932 0.6576 0.822 SE +/- 0.001714, N = 3 0.730665 MIN: 0.71 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 0.3492 0.6984 1.0476 1.3968 1.746 SE +/- 0.00174, N = 3 1.55221 MIN: 1.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 5 10 15 20 25 SE +/- 0.01, N = 3 21.68 MIN: 21.58 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 4 8 12 16 20 SE +/- 0.00, N = 3 17.56 MIN: 17.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 4 8 12 16 20 SE +/- 0.00, N = 3 16.30 MIN: 16.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 14 28 42 56 70 SE +/- 0.22, N = 3 63.60 MIN: 60.34 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 70 140 210 280 350 SE +/- 0.37, N = 3 310.11 MIN: 308.17 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 0.4242 0.8484 1.2726 1.6968 2.121 SE +/- 0.00023, N = 3 1.88546 MIN: 1.88 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 0.2855 0.571 0.8565 1.142 1.4275 SE +/- 0.00024, N = 3 1.26891 MIN: 1.26 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 2 4 6 8 10 SE +/- 0.00377, N = 3 6.22350 MIN: 6.16 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 2 4 6 8 10 SE +/- 0.01229, N = 3 7.58591 MIN: 7.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 1.1275 2.255 3.3825 4.51 5.6375 SE +/- 0.00127, N = 3 5.01111 MIN: 4.96 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 2 4 6 8 10 SE +/- 0.00333, N = 3 6.57813 MIN: 6.52 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 30 60 90 120 150 SE +/- 0.31, N = 3 128.70 MIN: 125.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: bf16bf16bf16 - Engine: CPU Intel Xeon Silver 4216 3 6 9 12 15 SE +/- 0.00, N = 3 10.62 MIN: 10.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 4 8 12 16 20 SE +/- 0.02, N = 3 14.39 MIN: 13.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU Intel Xeon Silver 4216 0.2336 0.4672 0.7008 0.9344 1.168 SE +/- 0.00042, N = 3 1.03818 MIN: 1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 13 26 39 52 65 SE +/- 0.08, N = 3 55.85 MIN: 54.35 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU Intel Xeon Silver 4216 1.051 2.102 3.153 4.204 5.255 SE +/- 0.00290, N = 3 4.67122 MIN: 4.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
FFTW FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Intel Xeon Silver 4216 3K 6K 9K 12K 15K SE +/- 174.00, N = 6 14744 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 Intel Xeon Silver 4216 4K 8K 12K 16K 20K SE +/- 276.22, N = 3 16509 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 1024 Intel Xeon Silver 4216 4K 8K 12K 16K 20K SE +/- 230.38, N = 3 19445 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 512 Intel Xeon Silver 4216 4K 8K 12K 16K 20K SE +/- 86.81, N = 3 19220 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 256 Intel Xeon Silver 4216 4K 8K 12K 16K 20K SE +/- 202.03, N = 3 18618 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 128 Intel Xeon Silver 4216 5K 10K 15K 20K 25K SE +/- 246.94, N = 3 22926 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 512 Intel Xeon Silver 4216 7K 14K 21K 28K 35K SE +/- 357.56, N = 3 32015 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Intel Xeon Silver 4216 7K 14K 21K 28K 35K SE +/- 229.29, N = 3 33602 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Intel Xeon Silver 4216 3K 6K 9K 12K 15K SE +/- 172.45, N = 4 11995 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Intel Xeon Silver 4216 900 1800 2700 3600 4500 SE +/- 5.27, N = 3 4280.7 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 2048 Intel Xeon Silver 4216 1000 2000 3000 4000 5000 SE +/- 16.86, N = 3 4644.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 1024 Intel Xeon Silver 4216 1100 2200 3300 4400 5500 SE +/- 17.96, N = 3 5228.1 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 13.98, N = 3 5797.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 2048 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 55.13, N = 15 5752.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 1024 Intel Xeon Silver 4216 1300 2600 3900 5200 6500 SE +/- 4.05, N = 3 6164.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 512 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 7.55, N = 3 5425.5 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 256 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 10.66, N = 3 5456.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 128 Intel Xeon Silver 4216 1300 2600 3900 5200 6500 SE +/- 33.80, N = 3 6069.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 512 Intel Xeon Silver 4216 1300 2600 3900 5200 6500 SE +/- 53.77, N = 11 6046.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 256 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 51.54, N = 10 5611.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 128 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 19.25, N = 3 5477.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 64 Intel Xeon Silver 4216 1200 2400 3600 4800 6000 SE +/- 79.38, N = 3 5783.9 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Intel Xeon Silver 4216 1400 2800 4200 5600 7000 SE +/- 13.54, N = 3 6736.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 64 Intel Xeon Silver 4216 1300 2600 3900 5200 6500 SE +/- 51.50, N = 15 6084.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Intel Xeon Silver 4216 6K 12K 18K 24K 30K SE +/- 820.36, N = 15 28711 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 Intel Xeon Silver 4216 6K 12K 18K 24K 30K SE +/- 704.23, N = 15 27930 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 1024 Intel Xeon Silver 4216 7K 14K 21K 28K 35K SE +/- 1329.39, N = 12 32733 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 256 Intel Xeon Silver 4216 5K 10K 15K 20K 25K SE +/- 1145.68, N = 12 23980 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 128 Intel Xeon Silver 4216 4K 8K 12K 16K 20K SE +/- 703.18, N = 13 16690.7 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 64 Intel Xeon Silver 4216 5K 10K 15K 20K 25K SE +/- 783.39, N = 15 25551 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 64 Intel Xeon Silver 4216 3K 6K 9K 12K 15K SE +/- 653.81, N = 15 13623.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Intel Xeon Silver 4216 1300 2600 3900 5200 6500 SE +/- 100.50, N = 13 5851.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Intel Xeon Silver 4216 Processor: Intel Xeon Silver 4216 @ 3.20GHz (16 Cores / 32 Threads), Motherboard: TYAN S7100AG2NR (V4.02 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 24GB, Disk: 240GB Force MP500, Graphics: ASPEED, Audio: Realtek ALC892, Monitor: VE228, Network: 2 x Intel I350
OS: Ubuntu 20.04, Kernel: 5.4.0-39-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002cSecurity Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 29 June 2020 07:30 by user phoronix.