multicore-40c-shared

Multicore 40C Shared

HTML result view exported from: https://openbenchmarking.org/result/2106090-IB-MULTICORE60&grr.

multicore-40c-sharedProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionSystem Layermulticore-40c-shared2 x Intel Xeon (Skylake IBRS) (40 Cores)OpenStack Foundation Nova v21.0.0 (1.13.0-1ubuntu1.1 BIOS)Intel 440FX 82441FX PMC16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 16384 MB + 8192 MB RAM148GBCirrus Logic GD 5446Red Hat Virtio deviceUbuntu 20.045.4.0-74-generic (x86_64)GCC 9.3.0ext41024x768KVMOpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - CPU Microcode: 0x1- OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04)- Python 3.8.5- itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion; VMX: flush not necessary SMT disabled + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown

multicore-40c-sharedopenvkl: vklBenchmarkUnstructuredVolumemysqlslap: 32build-llvm: Unix Makefilesmysqlslap: 64build-gcc: Time To Compilemysqlslap: 4hpcg: mysqlslap: 256mysqlslap: 512mysqlslap: 16mysqlslap: 128openvkl: vklBenchmarkyafaray: Total Time For Sample Scenemysqlslap: 1cassandra: Readslibgav1: Summer Nature 4Klammps: 20k Atomsgromacs: MPI CPU - water_GMX50_barebuild-llvm: Ninjaaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingkvazaar: Bosphorus 4K - Slowblender: Barbershop - CPU-Onlygraphics-magick: HWB Color Spacemysqlslap: 8radiance: Serialpovray: Trace Timeluxcorerender: LuxCore Benchmark - CPUbuild-nodejs: Time To Compileaom-av1: Speed 4 Two-Pass - Bosphorus 4Kblender: Pabellon Barcelona - CPU-Onlynpb: SP.Clibgav1: Chimera 1080p 10-bitbuild-linux-kernel: Time To Compileappleseed: Material Testerblender: Classroom - CPU-Onlygraphics-magick: Rotatepennant: sedovbigvpxenc: Speed 0 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Kluxcorerender: Rainbow Colors and Prism - CPUappleseed: Emilyasmfish: 1024 Hash Memory, 26 Depthstockfish: Total Timeaom-av1: Speed 6 Two-Pass - Bosphorus 4Knpb: EP.Dbuild-erlang: Time To Compileaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 1080pnpb: LU.Cintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 Exchangeaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingsvt-av1: Preset 4 - Bosphorus 4Krodinia: OpenMP HotSpot3Dospray: San Miguel - Path Tracerrodinia: OpenMP LavaMDlibgav1: Chimera 1080pblender: Fishy Cat - CPU-Onlyluxcorerender: DLSC - CPUbuild-eigen: Time To Compilebuild-godot: Time To Compileintel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Sendrecvsysbench: CPUcassandra: Writesbuild-wasmer: Time To Compilevpxenc: Speed 5 - Bosphorus 4Kradiance: SMP Parallelbuild2: Time To Compilegraphics-magick: Swirlebizzy: onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUkvazaar: Bosphorus 1080p - Slowospray: XFrog Forest - Path Traceronednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUopenvkl: vklBenchmarkVdbVolumevpxenc: Speed 0 - Bosphorus 1080pbuild-gdb: Time To Compileopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP32 - CPUavifenc: 0blender: BMW27 - CPU-Onlyopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP16 - CPUcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedopenvino: Face Detection 0106 FP32 - CPUopenvino: Face Detection 0106 FP32 - CPUoidn: RTLightmap.hdr.4096x4096luxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUaom-av1: Speed 6 Two-Pass - Bosphorus 1080ponednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUappleseed: Disney Materialjohn-the-ripper: MD5openvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUgraphics-magick: Resizinggraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiankvazaar: Bosphorus 4K - Mediumintel-mpi: IMB-P2P PingPongbuild-php: Time To Compilerav1e: 5svt-hevc: 1 - Bosphorus 1080prav1e: 1namd: ATPase Simulation - 327,506 Atomsaom-av1: Speed 0 Two-Pass - Bosphorus 1080popenvkl: vklBenchmarkStructuredVolumeaom-av1: Speed 6 Realtime - Bosphorus 4Krust-mandel: Time To Complete Serial/Parallel Mandelbrotx265: Bosphorus 1080pembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objnpb: SP.Brodinia: OpenMP Leukocytesvt-av1: Preset 8 - Bosphorus 1080pcompress-7zip: Compress Speed Testrav1e: 6ospray: XFrog Forest - SciViscompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedvpxenc: Speed 5 - Bosphorus 1080paobench: 2048 x 2048 - Total Timem-queens: Time To Solvelibgav1: Summer Nature 1080pavifenc: 2ospray: San Miguel - SciVisavifenc: 6, Losslessnpb: BT.Csvt-av1: Preset 4 - Bosphorus 1080pmt-dgemm: Sustained Floating-Point Ratebuild-ffmpeg: Time To Compileembree: Pathtracer ISPC - Crownoidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160ospray: NASA Streamlines - Path Tracersmallpt: Global Illumination Renderer; 128 Samplesx265: Bosphorus 4Krav1e: 10aom-av1: Speed 6 Realtime - Bosphorus 1080ptungsten: Water Causticjohn-the-ripper: Blowfishembree: Pathtracer - Crownttsiod-renderer: Phong Rendering With Soft-Shadow Mappingnpb: IS.Dembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragondav1d: Chimera 1080p 10-bitaskap: Hogbom Clean OpenMPswet: Averagerust-prime: Prime Number Test To 200,000,000build-imagemagick: Time To Compilekvazaar: Bosphorus 4K - Very Fastbuild-apache: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelnpb: MG.Caom-av1: Speed 8 Realtime - Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingaom-av1: Speed 9 Realtime - Bosphorus 4Konednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUbuild-mplayer: Time To Compiledav1d: Chimera 1080px264: H.264 Video Encodingkvazaar: Bosphorus 1080p - Mediumintel-mpi: IMB-MPI1 PingPongrodinia: OpenMP Streamclusterxsbench: sysbench: RAM / Memorykvazaar: Bosphorus 4K - Ultra Fasttungsten: Volumetric Causticdav1d: Summer Nature 4Kavifenc: 6onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUpennant: leblancbigarrayfire: BLAS CPUtungsten: Hairsvt-vp9: VMAF Optimized - Bosphorus 1080ponednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUneatbench: Allneatbench: CPUnpb: FT.Cavifenc: 10, Losslessonednn: IP Shapes 3D - f32 - CPUrodinia: OpenMP CFD Solvertungsten: Non-Exponentialaom-av1: Speed 8 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - bf16bf16bf16 - CPUnpb: CG.Conednn: IP Shapes 3D - u8s8f32 - CPUcompress-pbzip2: 256MB File Compressionrays1bench: Large Sceneospray: NASA Streamlines - SciVisaom-av1: Speed 9 Realtime - Bosphorus 1080pkvazaar: Bosphorus 1080p - Very Fastprimesieve: 1e12 Prime Number Generationffmpeg: H.264 HD To NTSC DVavifenc: 10ospray: Magnetic Reconnection - SciVisonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUn-queens: Elapsed Timedav1d: Summer Nature 1080pkvazaar: Bosphorus 1080p - Ultra Fastnpb: EP.Csvt-hevc: 7 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080plammps: Rhodopsin Proteinospray: Magnetic Reconnection - Path Tracermulticore-40c-shared1864738502416.040436970.36573812.5146292294654329304154.3738047832931.2116.4702.351352.1426130.954685.519.55297.33604796820.39465.1582.30238.9523.11232.3532677.5540.1250.351300.643252191.1445233.331293.7815.6745.35233.4596154575960534517115.423951.07142.3720.155.1476891.25531.093436.393210.732986.911.392113.5672.60110.09881.89104.283.42101.60794.931248.252697.0040229.0410810185.8547.45249.57181.08082216163391017.171017.741032.1031.632.97613.007606.543609.212237804838.5869.2192095.454.702101.624.7268.81066.871117.858.891652.925.01102.379.120.455.372.4614.290.4710470.42593991.02246743183330.5517007.090.5517046.67178722951433110.051013723459.2901.01418.390.3580.678540.399388893112.6750.44357.6320.271720.752840434.0046.46460.5541185351.3525.302004.91394.52002.31676.01652.254.92034.8614.92079.1600.515.4539.07738.82194.1138.35131.2538.10179136.064.5326.51188435.23018.85300.930.937.256.51518.622.84719.8727.37454243921.4279527.7781367.7922.499722.7572404.76536.8825488107825.54927.02522.4726.56424.70540309.8328.58821728.7478858068.366946.8533.207.450235.532680.75929120.464558.50140.5132.953670.3417.77650944175972.1235.0816.4905230.3515.7681.446534.461880.86015814.525873787.2413.9272296.971.2449828.928.740867.9910.0323.909479.5479.3583568.244.4394516758.301.079022.128126.804074.4569.708.3687.7287.28634.483.844813.749335.034336.202567.52114.094097.83219.86249.126.878101.754131.02688313.44383.9616.386333.33OpenBenchmarking.org

OpenVKL

Benchmark: vklBenchmarkUnstructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUnstructuredVolumemulticore-40c-shared400K800K1200K1600K2000KSE +/- 13759.58, N = 31864738MIN: 13506 / MAX: 6225496

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32multicore-40c-shared110220330440550SE +/- 16.55, N = 95021. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Unix Makefilesmulticore-40c-shared90180270360450SE +/- 4.36, N = 9416.04

MariaDB

Clients: 64

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 64multicore-40c-shared90180270360450SE +/- 10.75, N = 64361. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 9.3.0Time To Compilemulticore-40c-shared2004006008001000SE +/- 4.31, N = 3970.37

MariaDB

Clients: 4

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 4multicore-40c-shared160320480640800SE +/- 37.34, N = 97381. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1multicore-40c-shared3691215SE +/- 0.21, N = 1212.511. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256multicore-40c-shared60120180240300SE +/- 2.02, N = 32921. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512multicore-40c-shared60120180240300SE +/- 3.03, N = 32941. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 16

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 16multicore-40c-shared140280420560700SE +/- 26.25, N = 66541. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 128

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128multicore-40c-shared70140210280350SE +/- 3.78, N = 33291. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkmulticore-40c-shared70140210280350SE +/- 0.88, N = 3304MIN: 1 / MAX: 1221

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample Scenemulticore-40c-shared306090120150SE +/- 2.25, N = 12154.371. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1multicore-40c-shared2004006008001000SE +/- 61.04, N = 68041. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Apache Cassandra

Test: Reads

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 3.11.4Test: Readsmulticore-40c-shared20K40K60K80K100KSE +/- 3611.34, N = 1478329

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Summer Nature 4Kmulticore-40c-shared714212835SE +/- 0.27, N = 1531.211. (CXX) g++ options: -O3 -lpthread -lrt

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atomsmulticore-40c-shared48121620SE +/- 0.10, N = 316.471. (CXX) g++ options: -O3 -pthread -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021.2Implementation: MPI CPU - Input: water_GMX50_baremulticore-40c-shared0.5291.0581.5872.1162.645SE +/- 0.091, N = 152.3511. (CXX) g++ options: -O3 -pthread

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Ninjamulticore-40c-shared80160240320400SE +/- 1.14, N = 3352.14

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Griddingmulticore-40c-shared13002600390052006500SE +/- 41.10, N = 156130.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degriddingmulticore-40c-shared10002000300040005000SE +/- 32.18, N = 154685.511. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slowmulticore-40c-shared3691215SE +/- 0.08, N = 159.551. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Onlymulticore-40c-shared60120180240300SE +/- 1.41, N = 3297.33

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spacemulticore-40c-shared130260390520650SE +/- 4.14, N = 156041. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

MariaDB

Clients: 8

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 8multicore-40c-shared2004006008001000SE +/- 3.87, N = 37961. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Radiance Benchmark

Test: Serial

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: Serialmulticore-40c-shared2004006008001000820.39

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timemulticore-40c-shared1530456075SE +/- 6.77, N = 1265.161. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: CPUmulticore-40c-shared0.51751.0351.55252.072.5875SE +/- 0.09, N = 122.30MIN: 0.59 / MAX: 3.3

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compilemulticore-40c-shared50100150200250SE +/- 1.51, N = 3238.95

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4Kmulticore-40c-shared0.69981.39962.09942.79923.499SE +/- 0.01, N = 33.111. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Onlymulticore-40c-shared50100150200250SE +/- 1.39, N = 3232.35

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cmulticore-40c-shared7K14K21K28K35KSE +/- 632.95, N = 1532677.551. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Chimera 1080p 10-bitmulticore-40c-shared918273645SE +/- 0.07, N = 340.121. (CXX) g++ options: -O3 -lpthread -lrt

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compilemulticore-40c-shared1122334455SE +/- 0.35, N = 1250.35

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material Testermulticore-40c-shared70140210280350300.64

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Onlymulticore-40c-shared4080120160200SE +/- 1.09, N = 3191.14

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotatemulticore-40c-shared100200300400500SE +/- 3.77, N = 94521. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigmulticore-40c-shared816243240SE +/- 0.38, N = 1533.331. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4Kmulticore-40c-shared0.85051.7012.55153.4024.2525SE +/- 0.04, N = 33.781. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4Kmulticore-40c-shared48121620SE +/- 0.23, N = 1215.671. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: CPUmulticore-40c-shared1.20382.40763.61144.81526.019SE +/- 0.36, N = 155.35MIN: 2.74 / MAX: 7.63

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Emilymulticore-40c-shared50100150200250233.46

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthmulticore-40c-shared12M24M36M48M60MSE +/- 403040.15, N = 354575960

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timemulticore-40c-shared11M22M33M44M55MSE +/- 423071.20, N = 15534517111. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4Kmulticore-40c-shared1.21952.4393.65854.8786.0975SE +/- 0.02, N = 35.421. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dmulticore-40c-shared8001600240032004000SE +/- 127.03, N = 123951.071. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compilemulticore-40c-shared306090120150SE +/- 0.52, N = 3142.37

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4Kmulticore-40c-shared0.03380.06760.10140.13520.169SE +/- 0.00, N = 30.151. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pmulticore-40c-shared1.15652.3133.46954.6265.7825SE +/- 0.03, N = 35.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cmulticore-40c-shared16K32K48K64K80KSE +/- 714.10, N = 1576891.251. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 Exchangemulticore-40c-shared110220330440550SE +/- 1.98, N = 3531.09MIN: 1.76 / MAX: 8112.141. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 Exchangemulticore-40c-shared7001400210028003500SE +/- 17.40, N = 33436.39MIN: 2.17 / MAX: 18404.231. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degriddingmulticore-40c-shared7001400210028003500SE +/- 19.87, N = 33210.731. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Griddingmulticore-40c-shared6001200180024003000SE +/- 21.51, N = 32986.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4Kmulticore-40c-shared0.31320.62640.93961.25281.566SE +/- 0.010, N = 31.3921. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dmulticore-40c-shared306090120150SE +/- 0.12, N = 3113.571. (CXX) g++ options: -O2 -lOpenCL

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path Tracermulticore-40c-shared0.5851.171.7552.342.925SE +/- 0.00, N = 32.60MIN: 2.5 / MAX: 2.62

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDmulticore-40c-shared20406080100SE +/- 0.17, N = 3110.101. (CXX) g++ options: -O2 -lOpenCL

libgav1

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Chimera 1080pmulticore-40c-shared20406080100SE +/- 0.34, N = 381.891. (CXX) g++ options: -O3 -lpthread -lrt

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Onlymulticore-40c-shared20406080100SE +/- 1.11, N = 3104.28

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: CPUmulticore-40c-shared0.76951.5392.30853.0783.8475SE +/- 0.03, N = 53.42MIN: 3.18 / MAX: 3.78

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compilemulticore-40c-shared20406080100SE +/- 0.23, N = 3101.61

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compilemulticore-40c-shared20406080100SE +/- 0.49, N = 394.93

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 Sendrecvmulticore-40c-shared50100150200250SE +/- 2.35, N = 3248.25MIN: 1.17 / MAX: 4065.61. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 Sendrecvmulticore-40c-shared6001200180024003000SE +/- 22.33, N = 32697.00MIN: 1.52 / MAX: 12350.671. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUmulticore-40c-shared9K18K27K36K45KSE +/- 5.45, N = 340229.041. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 3.11.4Test: Writesmulticore-40c-shared20K40K60K80K100KSE +/- 606.98, N = 3108101

Timed Wasmer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To Compilemulticore-40c-shared20406080100SE +/- 0.27, N = 385.851. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4Kmulticore-40c-shared246810SE +/- 0.10, N = 37.451. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Radiance Benchmark

Test: SMP Parallel

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP Parallelmulticore-40c-shared50100150200250249.57

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilemulticore-40c-shared20406080100SE +/- 0.26, N = 381.08

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirlmulticore-40c-shared2004006008001000SE +/- 9.60, N = 48221. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3multicore-40c-shared300K600K900K1200K1500KSE +/- 38070.71, N = 1216163391. (CC) gcc options: -pthread -lpthread -O3 -march=native

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared2004006008001000SE +/- 4.34, N = 31017.17MIN: 1003.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUmulticore-40c-shared2004006008001000SE +/- 6.05, N = 31017.74MIN: 998.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared2004006008001000SE +/- 4.92, N = 31032.10MIN: 1015.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slowmulticore-40c-shared714212835SE +/- 0.26, N = 1231.631. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OSPray

Demo: XFrog Forest - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path Tracermulticore-40c-shared0.66831.33662.00492.67323.3415SE +/- 0.03, N = 32.97MIN: 2.85 / MAX: 3.03

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared130260390520650SE +/- 0.91, N = 3613.01MIN: 606.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared130260390520650SE +/- 3.32, N = 3606.54MIN: 595.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUmulticore-40c-shared130260390520650SE +/- 4.47, N = 3609.21MIN: 593.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenVKL

Benchmark: vklBenchmarkVdbVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkVdbVolumemulticore-40c-shared5M10M15M20M25MSE +/- 82579.12, N = 323780483MIN: 866256 / MAX: 131388624

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 1080pmulticore-40c-shared246810SE +/- 0.12, N = 38.581. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To Compilemulticore-40c-shared1530456075SE +/- 0.18, N = 369.22

OpenVINO

Model: Person Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUmulticore-40c-shared400800120016002000SE +/- 7.43, N = 32095.45

OpenVINO

Model: Person Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUmulticore-40c-shared1.05752.1153.17254.235.2875SE +/- 0.01, N = 34.70

OpenVINO

Model: Person Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUmulticore-40c-shared5001000150020002500SE +/- 4.46, N = 32101.62

OpenVINO

Model: Person Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUmulticore-40c-shared1.0622.1243.1864.2485.31SE +/- 0.00, N = 34.72

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0multicore-40c-shared1530456075SE +/- 0.45, N = 368.811. (CXX) g++ options: -O3 -fPIC -lm

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Onlymulticore-40c-shared1530456075SE +/- 0.23, N = 366.87

OpenVINO

Model: Face Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUmulticore-40c-shared2004006008001000SE +/- 2.18, N = 31117.85

OpenVINO

Model: Face Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUmulticore-40c-shared246810SE +/- 0.02, N = 38.89

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 1.47, N = 31652.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedmulticore-40c-shared612182430SE +/- 0.10, N = 325.01. (CC) gcc options: -O3 -pthread -lz -llzma

OpenVINO

Model: Face Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUmulticore-40c-shared2004006008001000SE +/- 8.58, N = 31102.37

OpenVINO

Model: Face Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUmulticore-40c-shared3691215SE +/- 0.10, N = 39.12

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x4096multicore-40c-shared0.10130.20260.30390.40520.5065SE +/- 0.00, N = 30.45

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: CPUmulticore-40c-shared1.20832.41663.62494.83326.0415SE +/- 0.02, N = 35.37MIN: 4.95 / MAX: 5.47

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: CPUmulticore-40c-shared0.55351.1071.66052.2142.7675SE +/- 0.02, N = 32.46MIN: 0.81 / MAX: 2.92

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pmulticore-40c-shared48121620SE +/- 0.09, N = 314.291. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUmulticore-40c-shared0.1060.2120.3180.4240.53SE +/- 0.011059, N = 150.471047MIN: 0.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.09580.19160.28740.38320.479SE +/- 0.004188, N = 150.425939MIN: 0.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney Materialmulticore-40c-shared2040608010091.02

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5multicore-40c-shared900K1800K2700K3600K4500KSE +/- 8647.41, N = 343183331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUmulticore-40c-shared0.12380.24760.37140.49520.619SE +/- 0.00, N = 30.55

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUmulticore-40c-shared4K8K12K16K20KSE +/- 36.69, N = 317007.09

OpenVINO

Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUmulticore-40c-shared0.12380.24760.37140.49520.619SE +/- 0.00, N = 30.55

OpenVINO

Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUmulticore-40c-shared4K8K12K16K20KSE +/- 87.29, N = 317046.67

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingmulticore-40c-shared400800120016002000SE +/- 5.33, N = 317871. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenmulticore-40c-shared50100150200250SE +/- 1.45, N = 32291. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedmulticore-40c-shared110220330440550SE +/- 0.58, N = 35141. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussianmulticore-40c-shared70140210280350SE +/- 3.67, N = 33311. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Mediummulticore-40c-shared3691215SE +/- 0.02, N = 310.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Intel MPI Benchmarks

Test: IMB-P2P PingPong

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongmulticore-40c-shared2M4M6M8M10MSE +/- 70926.46, N = 310137234MIN: 4304 / MAX: 276005651. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilemulticore-40c-shared1326395265SE +/- 0.34, N = 359.29

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5multicore-40c-shared0.22820.45640.68460.91281.141SE +/- 0.004, N = 31.014

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pmulticore-40c-shared510152025SE +/- 0.19, N = 518.391. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1multicore-40c-shared0.08060.16120.24180.32240.403SE +/- 0.001, N = 30.358

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomsmulticore-40c-shared0.15270.30540.45810.61080.7635SE +/- 0.00038, N = 30.67854

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pmulticore-40c-shared0.08780.17560.26340.35120.439SE +/- 0.00, N = 30.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenVKL

Benchmark: vklBenchmarkStructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkStructuredVolumemulticore-40c-shared20M40M60M80M100MSE +/- 371048.93, N = 393888931MIN: 1022989 / MAX: 995673960

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4Kmulticore-40c-shared3691215SE +/- 0.08, N = 312.671. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Rust Mandelbrot

Time To Complete Serial/Parallel Mandelbrot

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel Mandelbrotmulticore-40c-shared1122334455SE +/- 0.58, N = 350.441. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pmulticore-40c-shared1326395265SE +/- 0.42, N = 1457.631. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon Objmulticore-40c-shared510152025SE +/- 0.14, N = 320.27MIN: 19.85 / MAX: 20.77

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Asian Dragon Objmulticore-40c-shared510152025SE +/- 0.13, N = 320.75MIN: 20.45 / MAX: 21.38

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Bmulticore-40c-shared9K18K27K36K45KSE +/- 332.50, N = 1540434.001. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytemulticore-40c-shared1122334455SE +/- 0.62, N = 346.461. (CXX) g++ options: -O2 -lOpenCL

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 1080pmulticore-40c-shared1428425670SE +/- 0.82, N = 1260.551. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testmulticore-40c-shared30K60K90K120K150KSE +/- 271.54, N = 31185351. (CXX) g++ options: -pipe -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6multicore-40c-shared0.30420.60840.91261.21681.521SE +/- 0.006, N = 31.352

OSPray

Demo: XFrog Forest - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVismulticore-40c-shared1.19252.3853.57754.775.9625SE +/- 0.05, N = 35.30MIN: 5.18 / MAX: 5.46

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 2.08, N = 32004.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression Speedmulticore-40c-shared30060090012001500SE +/- 4.21, N = 31394.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 4.62, N = 32002.31. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedmulticore-40c-shared400800120016002000SE +/- 10.37, N = 31676.01. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 7.49, N = 31652.21. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedmulticore-40c-shared1224364860SE +/- 0.12, N = 354.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 9.87, N = 32034.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speedmulticore-40c-shared130260390520650SE +/- 1.34, N = 3614.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression Speedmulticore-40c-shared400800120016002000SE +/- 6.18, N = 32079.11. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression Speedmulticore-40c-shared130260390520650SE +/- 2.42, N = 3600.51. (CC) gcc options: -O3 -pthread -lz -llzma

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080pmulticore-40c-shared48121620SE +/- 0.17, N = 315.451. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timemulticore-40c-shared918273645SE +/- 0.29, N = 339.081. (CC) gcc options: -lm -O3

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvemulticore-40c-shared918273645SE +/- 0.02, N = 338.821. (CXX) g++ options: -fopenmp -O2 -march=native

libgav1

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.16.3Video Input: Summer Nature 1080pmulticore-40c-shared20406080100SE +/- 0.57, N = 394.111. (CXX) g++ options: -O3 -lpthread -lrt

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2multicore-40c-shared918273645SE +/- 0.28, N = 338.351. (CXX) g++ options: -O3 -fPIC -lm

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVismulticore-40c-shared714212835SE +/- 0.00, N = 331.25MIN: 30.3 / MAX: 32.26

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Losslessmulticore-40c-shared918273645SE +/- 0.12, N = 338.101. (CXX) g++ options: -O3 -fPIC -lm

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cmulticore-40c-shared20K40K60K80K100KSE +/- 323.57, N = 379136.061. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 1080pmulticore-40c-shared1.01972.03943.05914.07885.0985SE +/- 0.030, N = 34.5321. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratemulticore-40c-shared246810SE +/- 0.031042, N = 36.5118841. (CC) gcc options: -O3 -march=native -fopenmp

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.4Time To Compilemulticore-40c-shared816243240SE +/- 0.11, N = 335.23

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Crownmulticore-40c-shared510152025SE +/- 0.16, N = 318.85MIN: 18.37 / MAX: 19.55

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x2160multicore-40c-shared0.20930.41860.62790.83721.0465SE +/- 0.01, N = 30.93

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x2160multicore-40c-shared0.20930.41860.62790.83721.0465SE +/- 0.01, N = 30.93

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path Tracermulticore-40c-shared246810SE +/- 0.00, N = 37.25MIN: 6.94 / MAX: 7.35

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesmulticore-40c-shared246810SE +/- 0.165, N = 156.5151. (CXX) g++ options: -fopenmp -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4Kmulticore-40c-shared510152025SE +/- 0.09, N = 318.621. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10multicore-40c-shared0.64061.28121.92182.56243.203SE +/- 0.003, N = 32.847

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pmulticore-40c-shared510152025SE +/- 0.25, N = 319.871. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water Causticmulticore-40c-shared612182430SE +/- 0.08, N = 327.371. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: Blowfishmulticore-40c-shared9K18K27K36K45KSE +/- 180.20, N = 3424391. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Crownmulticore-40c-shared510152025SE +/- 0.00, N = 321.43MIN: 21.21 / MAX: 21.74

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mappingmulticore-40c-shared110220330440550SE +/- 3.12, N = 3527.781. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dmulticore-40c-shared30060090012001500SE +/- 18.23, N = 31367.791. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Asian Dragonmulticore-40c-shared510152025SE +/- 0.19, N = 322.50MIN: 21.94 / MAX: 23

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragonmulticore-40c-shared510152025SE +/- 0.12, N = 322.76MIN: 22.34 / MAX: 23.27

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Chimera 1080p 10-bitmulticore-40c-shared90180270360450SE +/- 0.35, N = 3404.76MIN: 315.36 / MAX: 564.581. (CC) gcc options: -pthread -lm

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPmulticore-40c-shared120240360480600SE +/- 7.48, N = 3536.881. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Swet

Average

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16Averagemulticore-40c-shared120M240M360M480M600MSE +/- 6011171.95, N = 55488107821. (CC) gcc options: -lm -lpthread -lcurses -lrt

Rust Prime Benchmark

Prime Number Test To 200,000,000

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000multicore-40c-shared1.24852.4973.74554.9946.2425SE +/- 0.055, N = 155.5491. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilemulticore-40c-shared612182430SE +/- 0.19, N = 327.03

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fastmulticore-40c-shared510152025SE +/- 0.03, N = 322.471. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilemulticore-40c-shared612182430SE +/- 0.03, N = 326.56

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelmulticore-40c-shared612182430SE +/- 0.03, N = 324.711. (CC) gcc options: -lm -lpthread -O3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cmulticore-40c-shared9K18K27K36K45KSE +/- 558.29, N = 1540309.831. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4Kmulticore-40c-shared714212835SE +/- 0.11, N = 328.581. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondmulticore-40c-shared200K400K600K800K1000KSE +/- 1802.97, N = 3821728.751. (CC) gcc options: -O2 -lrt" -lrt

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degriddingmulticore-40c-shared2K4K6K8K10KSE +/- 0.00, N = 38068.361. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Griddingmulticore-40c-shared15003000450060007500SE +/- 59.89, N = 36946.851. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4Kmulticore-40c-shared816243240SE +/- 0.37, N = 333.201. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared246810SE +/- 0.04336, N = 37.45023MIN: 7.291. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUmulticore-40c-shared1.24492.48983.73474.97966.2245SE +/- 0.00262, N = 35.53268MIN: 3.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.17080.34160.51240.68320.854SE +/- 0.000566, N = 30.759291MIN: 0.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To Compilemulticore-40c-shared510152025SE +/- 0.09, N = 320.46

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Chimera 1080pmulticore-40c-shared120240360480600SE +/- 1.34, N = 3558.50MIN: 443.97 / MAX: 743.271. (CC) gcc options: -pthread -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encodingmulticore-40c-shared306090120150SE +/- 6.78, N = 12140.511. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Mediummulticore-40c-shared816243240SE +/- 0.03, N = 332.951. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Intel MPI Benchmarks

Test: IMB-MPI1 PingPong

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongmulticore-40c-shared8001600240032004000SE +/- 48.82, N = 33670.34MIN: 2.91 / MAX: 13415.291. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclustermulticore-40c-shared48121620SE +/- 0.03, N = 317.781. (CXX) g++ options: -O2 -lOpenCL

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06multicore-40c-shared1.1M2.2M3.3M4.4M5.5MSE +/- 68387.30, N = 350944171. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memorymulticore-40c-shared13002600390052006500SE +/- 8.78, N = 35972.121. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fastmulticore-40c-shared816243240SE +/- 0.02, N = 335.081. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric Causticmulticore-40c-shared48121620SE +/- 0.23, N = 316.491. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4Kmulticore-40c-shared50100150200250SE +/- 1.41, N = 3230.35MIN: 118.79 / MAX: 245.581. (CC) gcc options: -pthread -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6multicore-40c-shared48121620SE +/- 0.08, N = 315.771. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUmulticore-40c-shared0.32550.6510.97651.3021.6275SE +/- 0.01770, N = 31.44653MIN: 1.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared1.00392.00783.01174.01565.0195SE +/- 0.01067, N = 34.46188MIN: 4.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.19350.3870.58050.7740.9675SE +/- 0.003828, N = 30.860158MIN: 0.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigmulticore-40c-shared48121620SE +/- 0.07, N = 314.531. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUmulticore-40c-shared8001600240032004000SE +/- 5.17, N = 33787.241. (CXX) g++ options: -rdynamic

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Hairmulticore-40c-shared48121620SE +/- 0.02, N = 313.931. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pmulticore-40c-shared60120180240300SE +/- 18.82, N = 12296.971. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared0.28010.56020.84031.12041.4005SE +/- 0.00658, N = 31.24498MIN: 1.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NeatBench

Acceleration: All

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: Allmulticore-40c-shared714212835SE +/- 0.23, N = 328.9

NeatBench

Acceleration: CPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPUmulticore-40c-shared714212835SE +/- 0.37, N = 328.7

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cmulticore-40c-shared9K18K27K36K45KSE +/- 470.96, N = 340867.991. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Losslessmulticore-40c-shared3691215SE +/- 0.08, N = 310.031. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUmulticore-40c-shared0.87961.75922.63883.51844.398SE +/- 0.00812, N = 33.90947MIN: 3.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solvermulticore-40c-shared3691215SE +/- 0.010, N = 39.5471. (CXX) g++ options: -O2 -lOpenCL

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-Exponentialmulticore-40c-shared3691215SE +/- 0.05258, N = 39.358351. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pmulticore-40c-shared1530456075SE +/- 0.33, N = 368.241. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared0.99891.99782.99673.99564.9945SE +/- 0.00773, N = 34.43945MIN: 4.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cmulticore-40c-shared4K8K12K16K20KSE +/- 108.53, N = 316758.301. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.24280.48560.72840.97121.214SE +/- 0.00144, N = 31.07902MIN: 1.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File Compressionmulticore-40c-shared0.47880.95761.43641.91522.394SE +/- 0.034, N = 132.1281. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large Scenemulticore-40c-shared306090120150SE +/- 0.57, N = 3126.80

OSPray

Demo: NASA Streamlines - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVismulticore-40c-shared91827364540MIN: 32.26

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pmulticore-40c-shared20406080100SE +/- 0.49, N = 374.451. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fastmulticore-40c-shared1632486480SE +/- 0.42, N = 369.701. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number Generationmulticore-40c-shared246810SE +/- 0.026, N = 38.3681. (CXX) g++ options: -O3 -lpthread

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVmulticore-40c-shared246810SE +/- 0.031, N = 37.7281. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lasound -pthread -lva -lbz2 -llzma -lva-drm -lva-x11 -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10multicore-40c-shared246810SE +/- 0.033, N = 37.2861. (CXX) g++ options: -O3 -fPIC -lm

OSPray

Demo: Magnetic Reconnection - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVismulticore-40c-shared816243240SE +/- 0.00, N = 334.48MIN: 33.33 / MAX: 35.71

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUmulticore-40c-shared0.86511.73022.59533.46044.3255SE +/- 0.00622, N = 33.84481MIN: 3.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.84361.68722.53083.37444.218SE +/- 0.01542, N = 33.74933MIN: 3.61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared1.13272.26543.39814.53085.6635SE +/- 0.00177, N = 35.03433MIN: 51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timemulticore-40c-shared246810SE +/- 0.001, N = 36.2021. (CC) gcc options: -static -fopenmp -O3 -march=native

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 1080pmulticore-40c-shared120240360480600SE +/- 2.20, N = 3567.52MIN: 325.87 / MAX: 619.741. (CC) gcc options: -pthread -lm

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fastmulticore-40c-shared306090120150SE +/- 0.04, N = 3114.091. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cmulticore-40c-shared9001800270036004500SE +/- 43.38, N = 54097.831. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pmulticore-40c-shared50100150200250SE +/- 0.14, N = 3219.861. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pmulticore-40c-shared50100150200250SE +/- 0.60, N = 3249.121. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUmulticore-40c-shared246810SE +/- 0.00230, N = 36.87810MIN: 6.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUmulticore-40c-shared0.39470.78941.18411.57881.9735SE +/- 0.00302, N = 31.75413MIN: 1.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUmulticore-40c-shared0.2310.4620.6930.9241.155SE +/- 0.00262, N = 31.02688MIN: 0.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pmulticore-40c-shared70140210280350SE +/- 1.55, N = 3313.441. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pmulticore-40c-shared80160240320400SE +/- 0.97, N = 3383.961. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinmulticore-40c-shared48121620SE +/- 0.06, N = 316.391. (CXX) g++ options: -O3 -pthread -lm

OSPray

Demo: Magnetic Reconnection - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path Tracermulticore-40c-shared70140210280350SE +/- 0.00, N = 3333.33MIN: 250 / MAX: 500


Phoronix Test Suite v10.8.4