Xeon Gold

Intel Xeon Gold 5218 testing with a Supermicro X11SPL-F v1.02 (3.1 BIOS) and llvmpipe 188GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2002201-VE-XEONGOLD541&grr.

Xeon GoldProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionXeon Gold 5218Intel Xeon Gold 5218 @ 3.90GHz (16 Cores / 32 Threads)Supermicro X11SPL-F v1.02 (3.1 BIOS)Intel Sky Lake-E DMI3 Registers188GB3841GB Micron_9300_MTFDHAL3T8TDPllvmpipe 188GBVE2282 x Intel I210Ubuntu 19.105.5.0-050500-generic (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.53.3 Mesa 19.2.8 (LLVM 9.0 256 bits)GCC 9.2.1 20191008ext41920x1080OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c- Python 2.7.17 + Python 3.7.5- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled

Xeon Goldblender: Pabellon Barcelona - OpenCLblender: Fishy Cat - OpenCLbuild-gcc: Time To Compileblender: Barbershop - OpenCLmkl-dnn: Convolution Batch conv_all - bf16bf16bf16mkl-dnn: Convolution Batch conv_all - u8s8f32mkl-dnn: Convolution Batch conv_all - f32blender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Classroom - OpenCLblender: BMW27 - OpenCLblender: Classroom - CPU-Onlymkl-dnn: Deconvolution Batch deconv_all - bf16bf16bf16mkl-dnn: Deconvolution Batch deconv_all - f32fftw: Float + SSE - 2D FFT Size 4096povray: Trace Timefftw: Stock - 2D FFT Size 4096ospray: XFrog Forest - Path Tracerradiance: Serialblender: Fishy Cat - CPU-Onlyappleseed: Emilyospray: NASA Streamlines - Path Tracerospray: San Miguel - SciVisblender: BMW27 - CPU-Onlyospray: San Miguel - Path Tracerbuild-gdb: Time To Compilemkl-dnn: Convolution Batch conv_googlenet_v3 - bf16bf16bf16mkl-dnn: Convolution Batch conv_googlenet_v3 - u8s8f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32appleseed: Disney Materialappleseed: Material Testerbuild-llvm: Time To Compilefftw: Float + SSE - 2D FFT Size 2048build2: Time To Compilemkl-dnn: Convolution Batch conv_3d - u8s8f32fftw: Stock - 2D FFT Size 2048radiance: SMP Parallelospray: XFrog Forest - SciVisv-ray: CPUindigobench: Bedroomluxcorerender: DLSCluxcorerender: Rainbow Colors and Prismindigobench: Supercarembree: Pathtracer - Asian Dragon Objbuild-linux-kernel: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelbuild-php: Time To Compileembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Crownospray: NASA Streamlines - SciVisbuild-ffmpeg: Time To Compilemkl-dnn: IP Batch All - f32mkl-dnn: IP Batch All - bf16bf16bf16mkl-dnn: IP Batch All - u8s8f32mkl-dnn: Deconvolution Batch deconv_3d - u8s8f32embree: Pathtracer ISPC - Crownospray: Magnetic Reconnection - SciVistungsten: Non-Exponentialembree: Pathtracer - Asian Dragonmkl-dnn: Convolution Batch conv_3d - bf16bf16bf16mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Recurrent Neural Network Training - f32aobench: 2048 x 2048 - Total Timeembree: Pathtracer ISPC - Asian Dragonttsiod-renderer: Phong Rendering With Soft-Shadow Mappingbuild-mplayer: Time To Compilefftw: Float + SSE - 1D FFT Size 2048tungsten: Water Causticbuild-imagemagick: Time To Compilefftw: Float + SSE - 1D FFT Size 512build-apache: Time To Compilemkl-dnn: Convolution Batch conv_alexnet - bf16bf16bf16tungsten: Hairmkl-dnn: Deconvolution Batch deconv_1d - bf16bf16bf16mkl-dnn: Deconvolution Batch deconv_1d - u8s8f32mkl-dnn: Deconvolution Batch deconv_1d - f32fftw: Stock - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 1024mkl-dnn: Convolution Batch conv_alexnet - u8s8f32fftw: Stock - 2D FFT Size 128mkl-dnn: Convolution Batch conv_alexnet - f32fftw: Float + SSE - 1D FFT Size 128rays1bench: Large Scenefftw: Float + SSE - 1D FFT Size 256mkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch 1D - bf16bf16bf16mkl-dnn: IP Batch 1D - u8s8f32fftw: Stock - 2D FFT Size 32fftw: Float + SSE - 2D FFT Size 32fftw: Stock - 2D FFT Size 1024tungsten: Volumetric Causticfftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 512smallpt: Global Illumination Renderer; 128 Samplesfftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 4096oidn: Memorialospray: Magnetic Reconnection - Path Tracerfftw: Stock - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 1024fftw: Stock - 2D FFT Size 512fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 1024fftw: Float + SSE - 2D FFT Size 64fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 64fftw: Stock - 1D FFT Size 512mkl-dnn: Deconvolution Batch deconv_3d - bf16bf16bf16fftw: Float + SSE - 1D FFT Size 32mkl-dnn: Deconvolution Batch deconv_3d - f32fftw: Stock - 1D FFT Size 128fftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 64Xeon Gold 52181412.201236.44959.663724.5410330.35845.322395.11588.12530.65474.07462.52431.596792.391721.201205752.3024224.61.67757.569229.16323.2364874.3718.87149.151.75136.688488.92035.2905133.100181.761631179.068868293.7381508492.40911821.64837.3239.6063.01170491.81868754.6690196083919.364.30913.788159.28457.51056.67416.639212.354023.2647.5719.5197312.93744.883677304.3714.358321.748.2776615.192140.575613.0199241.11236.25719.2595469.84030.0184588326.173827.4393923526.1811888.9022.370016.84190.9930714.075465960.81931682.25657023.2301.6131911856.59277695.285028.997650.9256508037.0349286198.89.5004920850192988.702234334484014.83333.337056.4491966205.57180.17572.9300145241.57472.2159527493.621.3683137275.855995355.87700.75527.5OpenBenchmarking.org

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: OpenCLXeon Gold 521830060090012001500SE +/- 2.80, N = 31412.20

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: OpenCLXeon Gold 521830060090012001500SE +/- 0.96, N = 31236.44

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 8.2Time To CompileXeon Gold 52182004006008001000SE +/- 0.94, N = 3959.66

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: OpenCLXeon Gold 5218160320480640800SE +/- 2.80, N = 3724.54

MKL-DNN DNNL

Harness: Convolution Batch conv_all - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: bf16bf16bf16Xeon Gold 52182K4K6K8K10KSE +/- 2.47, N = 310330.3MIN: 102671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_all - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: u8s8f32Xeon Gold 521813002600390052006500SE +/- 4.39, N = 35845.32MIN: 5816.91. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_all - Data Type: f32Xeon Gold 52185001000150020002500SE +/- 2.58, N = 32395.11MIN: 2361.651. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-OnlyXeon Gold 5218130260390520650SE +/- 0.22, N = 3588.12

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-OnlyXeon Gold 5218110220330440550SE +/- 0.33, N = 3530.65

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: OpenCLXeon Gold 5218100200300400500SE +/- 0.99, N = 3474.07

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: OpenCLXeon Gold 5218100200300400500SE +/- 1.89, N = 3462.52

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-OnlyXeon Gold 521890180270360450SE +/- 0.95, N = 3431.59

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_all - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_all - Data Type: bf16bf16bf16Xeon Gold 521815003000450060007500SE +/- 3.47, N = 36792.39MIN: 6719.391. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_all - Data Type: f32Xeon Gold 5218400800120016002000SE +/- 0.75, N = 31721.20MIN: 1693.331. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Xeon Gold 52183K6K9K12K15KSE +/- 32.95, N = 3120571. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeXeon Gold 52181224364860SE +/- 1.09, N = 1552.301. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Xeon Gold 52189001800270036004500SE +/- 6.76, N = 34224.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OSPray

Demo: XFrog Forest - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerXeon Gold 52180.37580.75161.12741.50321.879SE +/- 0.00, N = 61.67MIN: 1.61 / MAX: 1.68

Radiance Benchmark

Test: Serial

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SerialXeon Gold 5218160320480640800757.57

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-OnlyXeon Gold 521850100150200250SE +/- 0.07, N = 3229.16

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilyXeon Gold 521870140210280350323.24

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerXeon Gold 52180.98331.96662.94993.93324.9165SE +/- 0.00, N = 124.37MIN: 4.08 / MAX: 4.44

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisXeon Gold 5218510152025SE +/- 0.00, N = 1218.87MIN: 15.38 / MAX: 19.23

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-OnlyXeon Gold 5218306090120150SE +/- 0.12, N = 3149.15

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerXeon Gold 52180.39380.78761.18141.57521.969SE +/- 0.00, N = 31.75MIN: 1.7

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileXeon Gold 5218306090120150SE +/- 0.25, N = 3136.69

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: bf16bf16bf16Xeon Gold 5218110220330440550SE +/- 0.32, N = 3488.92MIN: 483.011. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: u8s8f32Xeon Gold 5218816243240SE +/- 0.03, N = 335.29MIN: 34.291. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32Xeon Gold 5218306090120150SE +/- 0.09, N = 3133.10MIN: 130.31. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialXeon Gold 52184080120160200181.76

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterXeon Gold 52184080120160200179.07

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileXeon Gold 521860120180240300293.74

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048Xeon Gold 52183K6K9K12K15KSE +/- 124.29, N = 3150841. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.12Time To CompileXeon Gold 521820406080100SE +/- 0.50, N = 392.41

MKL-DNN DNNL

Harness: Convolution Batch conv_3d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: u8s8f32Xeon Gold 52183K6K9K12K15KSE +/- 6.41, N = 311821.6MIN: 11788.61. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048Xeon Gold 521810002000300040005000SE +/- 28.29, N = 34837.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Radiance Benchmark

Test: SMP Parallel

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP ParallelXeon Gold 521850100150200250239.61

OSPray

Demo: XFrog Forest - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisXeon Gold 52180.67731.35462.03192.70923.3865SE +/- 0.00, N = 33.01MIN: 2.81 / MAX: 3.02

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPUXeon Gold 52184K8K12K16K20KSE +/- 143.66, N = 317049

IndigoBench

Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomXeon Gold 52180.40910.81821.22731.63642.0455SE +/- 0.002, N = 31.818

LuxCoreRender

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.2Scene: DLSCXeon Gold 521815K30K45K60K75KSE +/- 0.01, N = 368754.67MIN: 1.78 / MAX: 68754.69

LuxCoreRender

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.2Scene: Rainbow Colors and PrismXeon Gold 52188001600240032004000SE +/- 0.00, N = 33919.36MIN: 1.78

IndigoBench

Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarXeon Gold 52180.96951.9392.90853.8784.8475SE +/- 0.007, N = 34.309

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer - Model: Asian Dragon ObjXeon Gold 521848121620SE +/- 0.02, N = 313.79MIN: 13.7 / MAX: 13.9

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileXeon Gold 52181326395265SE +/- 1.01, N = 359.28

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelXeon Gold 52181326395265SE +/- 0.03, N = 357.511. (CC) gcc options: -lm -lpthread -O3

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To CompileXeon Gold 52181326395265SE +/- 0.03, N = 356.67

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer ISPC - Model: Asian Dragon ObjXeon Gold 521848121620SE +/- 0.01, N = 316.64MIN: 16.53 / MAX: 16.8

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer - Model: CrownXeon Gold 52183691215SE +/- 0.01, N = 312.35MIN: 12.26 / MAX: 12.5

OSPray

Demo: NASA Streamlines - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisXeon Gold 5218612182430SE +/- 0.00, N = 1223.26MIN: 20 / MAX: 23.81

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileXeon Gold 52181122334455SE +/- 0.02, N = 347.57

MKL-DNN DNNL

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: f32Xeon Gold 52183691215SE +/- 0.02708, N = 39.51973MIN: 9.061. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch All - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: bf16bf16bf16Xeon Gold 52183691215SE +/- 0.02, N = 312.94MIN: 9.641. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch All - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch All - Data Type: u8s8f32Xeon Gold 52181.09882.19763.29644.39525.494SE +/- 0.01863, N = 34.88367MIN: 4.671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32Xeon Gold 521816003200480064008000SE +/- 5.83, N = 37304.37MIN: 7288.581. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer ISPC - Model: CrownXeon Gold 521848121620SE +/- 0.01, N = 314.36MIN: 14.21 / MAX: 14.55

OSPray

Demo: Magnetic Reconnection - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisXeon Gold 5218510152025SE +/- 0.00, N = 1221.74MIN: 18.52 / MAX: 22.22

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialXeon Gold 5218246810SE +/- 0.10378, N = 158.277661. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer - Model: Asian DragonXeon Gold 521848121620SE +/- 0.02, N = 315.19MIN: 15.11 / MAX: 15.32

MKL-DNN DNNL

Harness: Convolution Batch conv_3d - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: bf16bf16bf16Xeon Gold 5218918273645SE +/- 0.11, N = 340.58MIN: 39.811. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_3d - Data Type: f32Xeon Gold 52183691215SE +/- 0.01, N = 313.02MIN: 12.611. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Recurrent Neural Network Training - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32Xeon Gold 521850100150200250SE +/- 0.79, N = 3241.11MIN: 233.621. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeXeon Gold 5218816243240SE +/- 0.01, N = 336.261. (CC) gcc options: -lm -O3

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer ISPC - Model: Asian DragonXeon Gold 5218510152025SE +/- 0.01, N = 319.26MIN: 19.15 / MAX: 19.41

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingXeon Gold 5218100200300400500SE +/- 0.95, N = 3469.841. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To CompileXeon Gold 5218714212835SE +/- 0.05, N = 330.02

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048Xeon Gold 521810K20K30K40K50KSE +/- 604.43, N = 13458831. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticXeon Gold 5218612182430SE +/- 0.14, N = 326.171. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileXeon Gold 5218612182430SE +/- 0.08, N = 327.44

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 512Xeon Gold 52188K16K24K32K40KSE +/- 359.64, N = 15392351. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileXeon Gold 5218612182430SE +/- 0.12, N = 326.18

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: bf16bf16bf16Xeon Gold 5218400800120016002000SE +/- 2.46, N = 31888.90MIN: 1880.021. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairXeon Gold 5218510152025SE +/- 0.02, N = 322.371. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: bf16bf16bf16Xeon Gold 521848121620SE +/- 0.03, N = 316.84MIN: 16.471. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32Xeon Gold 52180.22340.44680.67020.89361.117SE +/- 0.001821, N = 30.993071MIN: 0.961. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: f32Xeon Gold 52180.9171.8342.7513.6684.585SE +/- 0.01401, N = 34.07546MIN: 3.941. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 256Xeon Gold 521813002600390052006500SE +/- 160.35, N = 155960.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024Xeon Gold 52184K8K12K16K20KSE +/- 168.47, N = 3193161. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: u8s8f32Xeon Gold 521820406080100SE +/- 0.08, N = 382.26MIN: 80.661. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 128Xeon Gold 521815003000450060007500SE +/- 164.04, N = 157023.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32Xeon Gold 521870140210280350SE +/- 0.52, N = 3301.61MIN: 297.341. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 128Xeon Gold 52184K8K12K16K20KSE +/- 148.93, N = 14191181. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large SceneXeon Gold 52181326395265SE +/- 0.13, N = 356.59

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 256Xeon Gold 52186K12K18K24K30KSE +/- 362.39, N = 15277691. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32Xeon Gold 52181.18912.37823.56734.75645.9455SE +/- 0.04179, N = 35.28502MIN: 4.821. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: bf16bf16bf16Xeon Gold 52183691215SE +/- 0.02626, N = 38.99765MIN: 8.641. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: u8s8f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: u8s8f32Xeon Gold 52180.20830.41660.62490.83321.0415SE +/- 0.006102, N = 30.925650MIN: 0.871. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Xeon Gold 52182K4K6K8K10KSE +/- 390.57, N = 158037.01. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Xeon Gold 52187K14K21K28K35KSE +/- 764.03, N = 15349281. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024Xeon Gold 521813002600390052006500SE +/- 18.19, N = 36198.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticXeon Gold 52183691215SE +/- 0.05813, N = 39.500491. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -ljpeg -lpthread -ldl

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 256Xeon Gold 52184K8K12K16K20KSE +/- 261.39, N = 5208501. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 512Xeon Gold 52184K8K12K16K20KSE +/- 138.66, N = 3192981. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesXeon Gold 5218246810SE +/- 0.018, N = 38.7021. (CXX) g++ options: -fopenmp -O3

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 128Xeon Gold 52185K10K15K20K25KSE +/- 252.28, N = 7234331. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Xeon Gold 521810K20K30K40K50KSE +/- 323.08, N = 3448401. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.0.0Scene: MemorialXeon Gold 521848121620SE +/- 0.09, N = 314.83

OSPray

Demo: Magnetic Reconnection - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerXeon Gold 521870140210280350SE +/- 0.00, N = 12333.33MIN: 200

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Xeon Gold 521815003000450060007500SE +/- 36.04, N = 37056.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024Xeon Gold 521811K22K33K44K55KSE +/- 332.27, N = 3491961. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 512Xeon Gold 521813002600390052006500SE +/- 12.16, N = 36205.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048Xeon Gold 521815003000450060007500SE +/- 24.29, N = 37180.11. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024Xeon Gold 521816003200480064008000SE +/- 35.32, N = 37572.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 64Xeon Gold 52186K12K18K24K30KSE +/- 364.33, N = 3300141. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 256Xeon Gold 521811002200330044005500SE +/- 30.00, N = 35241.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 64Xeon Gold 521816003200480064008000SE +/- 11.65, N = 37472.21. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 64Xeon Gold 52183K6K9K12K15KSE +/- 203.68, N = 3159521. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 512Xeon Gold 521816003200480064008000SE +/- 7.05, N = 37493.61. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_3d - Data Type: bf16bf16bf16

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: bf16bf16bf16Xeon Gold 5218510152025SE +/- 0.03, N = 321.37MIN: 21.241. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Xeon Gold 52183K6K9K12K15KSE +/- 161.53, N = 3137271. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_3d - Data Type: f32Xeon Gold 52181.31762.63523.95285.27046.588SE +/- 0.01377, N = 35.85599MIN: 5.761. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 128Xeon Gold 521811002200330044005500SE +/- 72.39, N = 35355.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Xeon Gold 521817003400510068008500SE +/- 91.10, N = 37700.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 64Xeon Gold 521812002400360048006000SE +/- 14.41, N = 35527.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm


Phoronix Test Suite v10.8.5