2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe 93GB on Ubuntu 20.04 via the Phoronix Test Suite.
2 x Intel Xeon Gold 5220R @ 3.90GHz Processor: 2 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads), Motherboard: TYAN S7106 (V2.01.B40 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 94GB, Disk: 500GB Samsung SSD 860, Graphics: llvmpipe 93GB, Monitor: VE228, Network: 2 x Intel I210 + 2 x QLogic cLOM8214 1/10GbE
OS: Ubuntu 20.04, Kernel: 5.8.0-rc2-phx-fgkaslr (x86_64) 20200624, Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 3.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5002f01Python Notes: Python 3.8.2Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Tungsten Renderer Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic 2 x Intel Xeon Gold 5220R @ 3.90GHz 4 8 12 16 20 SE +/- 0.05, N = 3 13.82 1. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic 2 x Intel Xeon Gold 5220R @ 3.90GHz 6 12 18 24 30 SE +/- 0.09, N = 3 24.26 1. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair 2 x Intel Xeon Gold 5220R @ 3.90GHz 3 6 9 12 15 SE +/- 0.03, N = 3 11.18 1. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.8744 1.7488 2.6232 3.4976 4.372 SE +/- 0.022, N = 3 3.886 1. (CXX) g++ options: -fopenmp -O3
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel 2 x Intel Xeon Gold 5220R @ 3.90GHz 6 12 18 24 30 SE +/- 0.29, N = 15 25.03 1. (CC) gcc options: -lm -lpthread -O3
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume 2 x Intel Xeon Gold 5220R @ 3.90GHz 400K 800K 1200K 1600K 2000K SE +/- 6250.70, N = 3 1906651.23 MIN: 18421 / MAX: 6439910
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume 2 x Intel Xeon Gold 5220R @ 3.90GHz 14M 28M 42M 56M 70M SE +/- 511505.23, N = 3 66535879.41 MIN: 1260326 / MAX: 611334792
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume 2 x Intel Xeon Gold 5220R @ 3.90GHz 5M 10M 15M 20M 25M SE +/- 52715.13, N = 3 23268510.22 MIN: 1047360 / MAX: 165094920
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark 2 x Intel Xeon Gold 5220R @ 3.90GHz 70 140 210 280 350 SE +/- 0.80, N = 3 337.48 MIN: 1 / MAX: 1399
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2019-12-17 H.264 Video Encoding 2 x Intel Xeon Gold 5220R @ 3.90GHz 30 60 90 120 150 SE +/- 1.09, N = 3 115.43 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 2 x Intel Xeon Gold 5220R @ 3.90GHz 1.2915 2.583 3.8745 5.166 6.4575 SE +/- 0.02, N = 3 5.74 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 50 100 150 200 250 SE +/- 0.87, N = 3 223.63 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 60 120 180 240 300 SE +/- 1.48, N = 3 276.85 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: VMAF Optimized - Input: Bosphorus 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 60 120 180 240 300 SE +/- 3.58, N = 3 273.77 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.4.1 1080p 8-bit YUV To HEVC Video Encode 2 x Intel Xeon Gold 5220R @ 3.90GHz 30 60 90 120 150 SE +/- 0.64, N = 3 119.59 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 11 22 33 44 55 SE +/- 0.12, N = 3 47.41 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 1.1765 2.353 3.5295 4.706 5.8825 SE +/- 0.021, N = 3 5.229 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 0 - Input: 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.0261 0.0522 0.0783 0.1044 0.1305 SE +/- 0.001, N = 3 0.116 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x Intel Xeon Gold 5220R @ 3.90GHz 10 20 30 40 50 SE +/- 0.34, N = 3 44.63 MIN: 41.81 / MAX: 45.65
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Obj 2 x Intel Xeon Gold 5220R @ 3.90GHz 8 16 24 32 40 SE +/- 0.22, N = 3 33.34 MIN: 31.97 / MAX: 34.16
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon 2 x Intel Xeon Gold 5220R @ 3.90GHz 8 16 24 32 40 SE +/- 0.46, N = 3 36.20 MIN: 34.74 / MAX: 37.68
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown 2 x Intel Xeon Gold 5220R @ 3.90GHz 8 16 24 32 40 SE +/- 0.42, N = 3 34.09 MIN: 32.31 / MAX: 35.4
OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Crown 2 x Intel Xeon Gold 5220R @ 3.90GHz 7 14 21 28 35 SE +/- 0.06, N = 3 31.56 MIN: 31.07 / MAX: 32.29
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Two-Pass 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.6683 1.3366 2.0049 2.6732 3.3415 SE +/- 0.00, N = 3 2.97 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 6 Realtime 2 x Intel Xeon Gold 5220R @ 3.90GHz 3 6 9 12 15 SE +/- 0.14, N = 4 10.59 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 4 Two-Pass 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.4365 0.873 1.3095 1.746 2.1825 SE +/- 0.00, N = 3 1.94 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2.0 Encoder Mode: Speed 0 Two-Pass 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.00, N = 3 0.27 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
TTSIOD 3D Renderer A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping 2 x Intel Xeon Gold 5220R @ 3.90GHz 200 400 600 800 1000 SE +/- 0.13, N = 3 792.85 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
OSPray Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer 2 x Intel Xeon Gold 5220R @ 3.90GHz 110 220 330 440 550 500 MIN: 166.67
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer 2 x Intel Xeon Gold 5220R @ 3.90GHz 3 6 9 12 15 SE +/- 0.00, N = 12 10.42 MIN: 8.7 / MAX: 10.64
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: SciVis 2 x Intel Xeon Gold 5220R @ 3.90GHz 11 22 33 44 55 SE +/- 0.00, N = 12 47.62 MIN: 17.54
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.8955 1.791 2.6865 3.582 4.4775 SE +/- 0.00, N = 12 3.98 MIN: 3.61 / MAX: 4.02
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: SciVis 2 x Intel Xeon Gold 5220R @ 3.90GHz 12 24 36 48 60 SE +/- 0.00, N = 15 55.56 MIN: 24.39
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.891 1.782 2.673 3.564 4.455 SE +/- 0.01, N = 3 3.96 MIN: 3.5 / MAX: 4
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: SciVis 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.02, N = 3 7.18 MIN: 6.45 / MAX: 7.25
OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis 2 x Intel Xeon Gold 5220R @ 3.90GHz 10 20 30 40 50 SE +/- 0.00, N = 12 43.48 MIN: 27.03 / MAX: 45.45
OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Summer Nature 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 70 140 210 280 350 SE +/- 1.21, N = 3 339.44 MIN: 199.09 / MAX: 375.82 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Summer Nature 4K 2 x Intel Xeon Gold 5220R @ 3.90GHz 40 80 120 160 200 SE +/- 0.34, N = 3 182.63 MIN: 112.4 / MAX: 196.56 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Chimera 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 70 140 210 280 350 SE +/- 1.32, N = 3 336.75 MIN: 224.54 / MAX: 435.93 1. (CC) gcc options: -pthread
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.3268 0.6536 0.9804 1.3072 1.634 SE +/- 0.00110, N = 3 1.45226 MIN: 1.42 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.0659 0.1318 0.1977 0.2636 0.3295 SE +/- 0.003726, N = 4 0.292860 MIN: 0.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.1177 0.2354 0.3531 0.4708 0.5885 SE +/- 0.000963, N = 3 0.522925 MIN: 0.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 3 6 9 12 15 SE +/- 0.00565, N = 3 9.46999 MIN: 9.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.00936, N = 3 7.38633 MIN: 7.21 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.02077, N = 3 6.39092 MIN: 6.3 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 20 40 60 80 100 SE +/- 0.55, N = 3 79.98 MIN: 77.87 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 50 100 150 200 250 SE +/- 3.14, N = 3 224.04 MIN: 212.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.1552 0.3104 0.4656 0.6208 0.776 SE +/- 0.002933, N = 3 0.689628 MIN: 0.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.1236 0.2472 0.3708 0.4944 0.618 SE +/- 0.001009, N = 3 0.549423 MIN: 0.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.09175, N = 3 6.91968 MIN: 2.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.607 1.214 1.821 2.428 3.035 SE +/- 0.00291, N = 3 2.69760 MIN: 2.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.4411 0.8822 1.3233 1.7644 2.2055 SE +/- 0.00249, N = 3 1.96034 MIN: 1.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.00938, N = 3 7.43431 MIN: 7.36 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 12 24 36 48 60 SE +/- 0.02, N = 3 51.30 MIN: 50.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: bf16bf16bf16 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 1.2773 2.5546 3.8319 5.1092 6.3865 SE +/- 0.00516, N = 3 5.67705 MIN: 5.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.00440, N = 3 6.70798 MIN: 6.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.4054 0.8108 1.2162 1.6216 2.027 SE +/- 0.00714, N = 3 1.80177 MIN: 1.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch 1D - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 0.3954 0.7908 1.1862 1.5816 1.977 SE +/- 0.00257, N = 3 1.75753 MIN: 1.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space 2 x Intel Xeon Gold 5220R @ 3.90GHz 200 400 600 800 1000 SE +/- 10.27, N = 5 821 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 2 x Intel Xeon Gold 5220R @ 3.90GHz 90 180 270 360 450 SE +/- 5.90, N = 3 410 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 2 x Intel Xeon Gold 5220R @ 3.90GHz 300 600 900 1200 1500 SE +/- 12.10, N = 3 1204 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 2 x Intel Xeon Gold 5220R @ 3.90GHz 120 240 360 480 600 SE +/- 0.58, N = 3 539 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 2 x Intel Xeon Gold 5220R @ 3.90GHz 50 100 150 200 250 SE +/- 0.88, N = 3 222 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 2 x Intel Xeon Gold 5220R @ 3.90GHz 140 280 420 560 700 SE +/- 3.18, N = 3 658 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 2 x Intel Xeon Gold 5220R @ 3.90GHz 200 400 600 800 1000 SE +/- 6.49, N = 3 966 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p 2 x Intel Xeon Gold 5220R @ 3.90GHz 8 16 24 32 40 SE +/- 0.20, N = 3 36.19 1. (CXX) g++ options: -O3 -lpthread
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.036, N = 3 7.292 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte 2 x Intel Xeon Gold 5220R @ 3.90GHz 20 40 60 80 100 SE +/- 0.31, N = 3 74.79 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D 2 x Intel Xeon Gold 5220R @ 3.90GHz 30 60 90 120 150 SE +/- 0.05, N = 3 114.87 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD 2 x Intel Xeon Gold 5220R @ 3.90GHz 20 40 60 80 100 SE +/- 0.32, N = 3 90.09 1. (CXX) g++ options: -O2 -lOpenCL
YafaRay YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene 2 x Intel Xeon Gold 5220R @ 3.90GHz 30 60 90 120 150 SE +/- 10.50, N = 12 126.81 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
Tungsten Renderer Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential 2 x Intel Xeon Gold 5220R @ 3.90GHz 2 4 6 8 10 SE +/- 0.14801, N = 15 6.11221 1. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
POV-Ray This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time 2 x Intel Xeon Gold 5220R @ 3.90GHz 9 18 27 36 45 SE +/- 7.92, N = 12 41.59 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 1.5 Harness: IP Batch All - Data Type: f32 - Engine: CPU 2 x Intel Xeon Gold 5220R @ 3.90GHz 10 20 30 40 50 SE +/- 13.57, N = 15 45.70 MIN: 25.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster 2 x Intel Xeon Gold 5220R @ 3.90GHz 3 6 9 12 15 SE +/- 0.30, N = 12 12.03 1. (CXX) g++ options: -O2 -lOpenCL
2 x Intel Xeon Gold 5220R @ 3.90GHz Processor: 2 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads), Motherboard: TYAN S7106 (V2.01.B40 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 94GB, Disk: 500GB Samsung SSD 860, Graphics: llvmpipe 93GB, Monitor: VE228, Network: 2 x Intel I210 + 2 x QLogic cLOM8214 1/10GbE
OS: Ubuntu 20.04, Kernel: 5.8.0-rc2-phx-fgkaslr (x86_64) 20200624, Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 3.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits), Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5002f01Python Notes: Python 3.8.2Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 11 July 2020 09:48 by user phoronix.