AWSU20-Multicore

AWSU20-Multicore

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2011223-SP-AWSU20MUL47
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
AWSU20-Multicore
November 20 2020
  1 Day, 21 Hours, 33 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AWSU20-MulticoreOpenBenchmarking.orgPhoronix Test SuiteIntel Xeon Platinum 8275CL (4 Cores / 8 Threads)Amazon EC2 c5.2xlarge (1.0 BIOS)Intel 440FX 82441FX PMC16384MB215GB Amazon Elastic Block StoreAmazon ElasticUbuntu 20.045.4.0-1029-aws (x86_64)GCC 9.3.0ext4KVMProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem LayerAWSU20-Multicore BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Python 2.7.18 + Python 3.8.5- KPTI + usercopy/swapgs barriers and __user pointer sanitization + Full generic retpoline STIBP: disabled RSB filling + PTE Inversion

AWSU20-Multicorelammps: 20k Atomsblender: Pabellon Barcelona - NVIDIA OptiXblender: Pabellon Barcelona - OpenCLblender: Pabellon Barcelona - CUDAblender: Barbershop - NVIDIA OptiXblender: Barbershop - OpenCLblender: Barbershop - CUDAblender: Barbershop - CPU-Onlybuild-gcc: Time To Compileblender: Pabellon Barcelona - CPU-Onlyblender: BMW27 - OpenCLopenvkl: vklBenchmarkUnstructuredVolumebuild-llvm: Time To Compileblender: Classroom - NVIDIA OptiXblender: Classroom - CUDAblender: Classroom - OpenCLblender: Classroom - CPU-Onlyblender: Fishy Cat - NVIDIA OptiXblender: Fishy Cat - CUDAblender: Fishy Cat - OpenCLpennant: sedovbigpennant: leblancbigappleseed: Emilyblender: Fishy Cat - CPU-Onlylibgav1: Chimera 1080p 10-bitrodinia: OpenMP LavaMDblender: BMW27 - CUDAblender: BMW27 - NVIDIA OptiXblender: BMW27 - CPU-Onlyospray: San Miguel - Path Tracerospray: XFrog Forest - Path Traceryafaray: Total Time For Sample Sceneappleseed: Disney Materialappleseed: Material Testerkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumsvt-av1: Enc Mode 0 - 1080pgromacs: Water Benchmarkembree: Pathtracer ISPC - Asian Dragon Objradiance: Serialnpb: EP.Dlibgav1: Summer Nature 4Kasmfish: 1024 Hash Memory, 26 Depthrodinia: OpenMP Leukocytelibgav1: Chimera 1080pbuild2: Time To Compilem-queens: Time To Solvenamd: ATPase Simulation - 327,506 Atomstachyon: Total Timeospray: XFrog Forest - SciVismt-dgemm: Sustained Floating-Point Ratedav1d: Chimera 1080p 10-bitnpb: BT.Cavifenc: 0build-linux-kernel: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelbuild-gdb: Time To Compileembree: Pathtracer - Asian Dragon Objospray: NASA Streamlines - Path Tracerembree: Pathtracer - Crownopenvkl: vklBenchmarkpovray: Trace Timeembree: Pathtracer ISPC - Crownbuild-ffmpeg: Time To Compilerodinia: OpenMP HotSpot3Dembree: Pathtracer - Asian Dragonvpxenc: Speed 0avifenc: 2stockfish: Total Timekvazaar: Bosphorus 4K - Very Fastbuild-php: Time To Compilenpb: LU.Ccompress-zstd: 19hpcg: x265: Bosphorus 4Kembree: Pathtracer ISPC - Asian Dragonbuild-mplayer: Time To Compilecassandra: Writesradiance: SMP Parallelxsbench: rust-mandel: Time To Complete Serial/Parallel Mandelbrotopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP16 - CPUkvazaar: Bosphorus 1080p - Slowopenvino: Person Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP32 - CPUopenvkl: vklBenchmarkVdbVolumekvazaar: Bosphorus 1080p - Mediumopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP16 - CPUttsiod-renderer: Phong Rendering With Soft-Shadow Mappingparboil: OpenMP LBMrav1e: 5openvino: Face Detection 0106 FP32 - CPUopenvino: Face Detection 0106 FP32 - CPUtungsten: Hairospray: San Miguel - SciVisebizzy: rav1e: 1svt-av1: Enc Mode 4 - 1080pbuild-imagemagick: Time To Compilekvazaar: Bosphorus 4K - Ultra Fastluxcorerender: DLSCluxcorerender: Rainbow Colors and Prismgraphics-magick: Sharpengraphics-magick: Noise-Gaussianlibgav1: Summer Nature 1080pjohn-the-ripper: MD5openvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUgraphics-magick: Swirlopenvino: Age Gender Recognition Retail 0013 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUgraphics-magick: Enhancedgraphics-magick: HWB Color Spacegraphics-magick: Resizinggraphics-magick: Rotatex264: H.264 Video Encodingonednn: IP Batch All - bf16bf16bf16 - CPUonednn: IP Batch All - u8s8f32 - CPUonednn: IP Batch All - f32 - CPUopenvkl: vklBenchmarkStructuredVolumeprimesieve: 1e12 Prime Number Generationaom-av1: Speed 0 Two-Passdav1d: Summer Nature 4Kaom-av1: Speed 6 Realtimerav1e: 6parboil: OpenMP MRI Griddingdav1d: Chimera 1080ptungsten: Water Causticrodinia: OpenMP CFD Solvernpb: FT.Crays1bench: Large Sceneaom-av1: Speed 6 Two-Passcompress-7zip: Compress Speed Testsysbench: Memoryaobench: 2048 x 2048 - Total Timeonednn: Recurrent Neural Network Training - f32 - CPUcompress-zstd: 3onednn: Recurrent Neural Network Inference - f32 - CPUcpuminer-opt: m7mcompress-pbzip2: 256MB File Compressionn-queens: Elapsed Timecpuminer-opt: deepnpb: SP.Brav1e: 10build-apache: Time To Compileospray: Magnetic Reconnection - SciVisvpxenc: Speed 5svt-hevc: 1080p 8-bit YUV To HEVC Video Encodecpuminer-opt: skeincpuminer-opt: sha256tcpuminer-opt: myr-graom-av1: Speed 4 Two-Passcpuminer-opt: lbrysvt-av1: Enc Mode 8 - 1080pospray: NASA Streamlines - SciVisjohn-the-ripper: Blowfishsvt-vp9: VMAF Optimized - Bosphorus 1080pnpb: CG.Ckvazaar: Bosphorus 1080p - Very Fastcoremark: CoreMark Size 666 - Iterations Per Secondoidn: Memorialrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplestungsten: Volumetric Causticffmpeg: H.264 HD To NTSC DVrodinia: OpenMP Streamclusteraom-av1: Speed 8 Realtimex265: Bosphorus 1080pintel-mpi: IMB-P2P PingPongonednn: Deconvolution Batch deconv_1d - bf16bf16bf16 - CPUtungsten: Non-Exponentialonednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 Exchangenpb: EP.Cswet: Averagearrayfire: BLAS CPUaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingkvazaar: Bosphorus 1080p - Ultra Fastonednn: IP Batch 1D - f32 - CPUonednn: IP Batch 1D - bf16bf16bf16 - CPUonednn: IP Batch 1D - u8s8f32 - CPUdav1d: Summer Nature 1080pparboil: OpenMP Stencilintel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Sendrecvnpb: MG.Conednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsysbench: CPUsvt-vp9: Visual Quality Optimized - Bosphorus 1080pavifenc: 8avifenc: 10parboil: OpenMP CUTCPintel-mpi: IMB-MPI1 PingPongsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUlammps: Rhodopsin Proteinneatbench: onednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch deconv_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUospray: Magnetic Reconnection - Path TracerAWSU20-Multicore2.922291.112243.322231.002017.501832.741825.381783.981695.061606.30512.381573006.481431.941413.861384.211378.841348.101204.721111.331093.68878.61826.96974.74591.8916.00568.63525.65518.47430.870.540.55374.48556.82539.901.861.910.060.575.48818.63527.1514.6611694467248.9337.11236.40225.794.53251224.740.991.1264.1514338.01201.65193.97178.78156.374.691.514.2566.03129.424.71125.54124.334.985.25116.8578296725.30110.4419264.0925.205.245.916.2187.1518930251.61106544376.674549.460.888.144459.100.8910700944.898.392720.261.47169.4366.420.872727.221.4667.686.062131840.311.2962.529.650.600.65329962.326681250.954128.981540.984025.518048737142733.67273.9645.83105.1330327318.9054.480.1972.6612.391.1650.38265.8444.7744.909494.8920.442.492311410015398.8939.23579.332845.87275.41108.159.0235.203657.7610476.462.6433.036.8019.1319.2214387290405120.761.571228010.957.87688776.135324.4021.37135719.847.3326.5426.3724.836.6922.4028.4527.892873017.4641.4721.254.407.741358.541675.461444.611521.7681.399127.90524.41606901805572.351367.181694.3138.348.8020.763.38245.5212.3854.657938.9614483.733.027.672.207885.3863.389.638.988.875640.1280.3436.162.779.027.236.8638.476.599.37100OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsAWSU20-Multicore0.6571.3141.9712.6283.285SE +/- 0.00, N = 32.921. (CXX) g++ options: -O3 -pthread -lm

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXAWSU20-Multicore5001000150020002500SE +/- 11.58, N = 32291.11

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: OpenCLAWSU20-Multicore5001000150020002500SE +/- 9.32, N = 32243.32

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDAAWSU20-Multicore5001000150020002500SE +/- 6.31, N = 32231.00

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXAWSU20-Multicore400800120016002000SE +/- 10.04, N = 32017.50

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: OpenCLAWSU20-Multicore400800120016002000SE +/- 1.97, N = 31832.74

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDAAWSU20-Multicore400800120016002000SE +/- 4.11, N = 31825.38

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyAWSU20-Multicore400800120016002000SE +/- 2.48, N = 31783.98

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 9.3.0Time To CompileAWSU20-Multicore400800120016002000SE +/- 2.70, N = 31695.06

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-OnlyAWSU20-Multicore30060090012001500SE +/- 9.02, N = 31606.30

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: OpenCLAWSU20-Multicore110220330440550SE +/- 5.16, N = 9512.38

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUnstructuredVolumeAWSU20-Multicore300K600K900K1200K1500KSE +/- 1948.30, N = 31573006.48

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileAWSU20-Multicore30060090012001500SE +/- 3.67, N = 31431.94

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXAWSU20-Multicore30060090012001500SE +/- 15.91, N = 31413.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDAAWSU20-Multicore30060090012001500SE +/- 2.47, N = 31384.21

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: OpenCLAWSU20-Multicore30060090012001500SE +/- 3.38, N = 31378.84

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-OnlyAWSU20-Multicore30060090012001500SE +/- 0.91, N = 31348.10

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXAWSU20-Multicore30060090012001500SE +/- 18.94, N = 31204.72

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDAAWSU20-Multicore2004006008001000SE +/- 17.36, N = 31111.33

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: OpenCLAWSU20-Multicore2004006008001000SE +/- 16.07, N = 31093.68

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAWSU20-Multicore2004006008001000SE +/- 0.21, N = 3878.611. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAWSU20-Multicore2004006008001000SE +/- 0.16, N = 3826.961. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilyAWSU20-Multicore2004006008001000974.74

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-OnlyAWSU20-Multicore130260390520650SE +/- 0.15, N = 3591.89

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitAWSU20-Multicore48121620SE +/- 0.02, N = 316.001. (CXX) g++ options: -O3 -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDAWSU20-Multicore120240360480600SE +/- 0.07, N = 3568.631. (CXX) g++ options: -O2 -lOpenCL

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDAAWSU20-Multicore110220330440550SE +/- 4.45, N = 3525.65

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXAWSU20-Multicore110220330440550SE +/- 5.92, N = 3518.47

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyAWSU20-Multicore90180270360450SE +/- 1.06, N = 3430.87

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerAWSU20-Multicore0.12150.2430.36450.4860.6075SE +/- 0.00, N = 30.54MIN: 0.52 / MAX: 0.55

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerAWSU20-Multicore0.12380.24760.37140.49520.619SE +/- 0.00, N = 30.55MIN: 0.53

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneAWSU20-Multicore80160240320400SE +/- 0.94, N = 3374.481. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialAWSU20-Multicore120240360480600556.82

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterAWSU20-Multicore120240360480600539.90

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: SlowAWSU20-Multicore0.41850.8371.25551.6742.0925SE +/- 0.00, N = 31.861. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: MediumAWSU20-Multicore0.42980.85961.28941.71922.149SE +/- 0.00, N = 31.911. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAWSU20-Multicore0.01350.0270.04050.0540.0675SE +/- 0.00, N = 30.061. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkAWSU20-Multicore0.12830.25660.38490.51320.6415SE +/- 0.00, N = 30.571. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon ObjAWSU20-Multicore1.2332.4663.6994.9326.165SE +/- 0.06, N = 75.48MIN: 5.17 / MAX: 5.65

Radiance Benchmark

This is a benchmark of NREL Radiance, a synthetic imaging system that is open-source and developed by the Lawrence Berkeley National Laboratory in California. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SerialAWSU20-Multicore2004006008001000818.63

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DAWSU20-Multicore110220330440550SE +/- 0.68, N = 3527.151. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KAWSU20-Multicore48121620SE +/- 0.02, N = 314.661. (CXX) g++ options: -O3 -lpthread

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthAWSU20-Multicore3M6M9M12M15MSE +/- 120390.98, N = 311694467

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteAWSU20-Multicore50100150200250SE +/- 0.49, N = 3248.931. (CXX) g++ options: -O2 -lOpenCL

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pAWSU20-Multicore918273645SE +/- 0.09, N = 337.111. (CXX) g++ options: -O3 -lpthread

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.12Time To CompileAWSU20-Multicore50100150200250SE +/- 1.99, N = 3236.40

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveAWSU20-Multicore50100150200250SE +/- 0.01, N = 3225.791. (CXX) g++ options: -fopenmp -O2 -march=native

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsAWSU20-Multicore1.01982.03963.05944.07925.099SE +/- 0.00338, N = 34.53251

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeAWSU20-Multicore50100150200250SE +/- 0.38, N = 3224.741. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisAWSU20-Multicore0.22280.44560.66840.89121.114SE +/- 0.00, N = 30.99MIN: 0.96 / MAX: 1

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateAWSU20-Multicore0.2520.5040.7561.0081.26SE +/- 0.00, N = 31.121. (CC) gcc options: -O3 -march=native -fopenmp

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitAWSU20-Multicore1428425670SE +/- 0.07, N = 364.151. (CC) gcc options: -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CAWSU20-Multicore3K6K9K12K15KSE +/- 45.21, N = 314338.011. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 0AWSU20-Multicore4080120160200SE +/- 0.15, N = 3201.651. (CXX) g++ options: -O3 -fPIC

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileAWSU20-Multicore4080120160200SE +/- 1.19, N = 3193.97

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAWSU20-Multicore4080120160200SE +/- 0.02, N = 3178.781. (CC) gcc options: -lm -lpthread -O3

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileAWSU20-Multicore306090120150SE +/- 0.30, N = 3156.37

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon ObjAWSU20-Multicore1.05532.11063.16594.22125.2765SE +/- 0.00, N = 34.69MIN: 4.63 / MAX: 4.74

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerAWSU20-Multicore0.33980.67961.01941.35921.699SE +/- 0.00, N = 31.51MIN: 1.44 / MAX: 1.54

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: CrownAWSU20-Multicore0.95631.91262.86893.82524.7815SE +/- 0.01, N = 34.25MIN: 4.2 / MAX: 4.31

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkAWSU20-Multicore1530456075SE +/- 0.10, N = 366.03

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAWSU20-Multicore306090120150SE +/- 0.49, N = 3129.421. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: CrownAWSU20-Multicore1.05982.11963.17944.23925.299SE +/- 0.06, N = 34.71MIN: 4.53 / MAX: 4.85

Timed FFmpeg Compilation

This test times how long it takes to build FFmpeg. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileAWSU20-Multicore306090120150SE +/- 0.21, N = 3125.54

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DAWSU20-Multicore306090120150SE +/- 1.83, N = 3124.331. (CXX) g++ options: -O2 -lOpenCL

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian DragonAWSU20-Multicore1.12052.2413.36154.4825.6025SE +/- 0.03, N = 34.98MIN: 4.83 / MAX: 5.09

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 0AWSU20-Multicore1.18132.36263.54394.72525.9065SE +/- 0.01, N = 35.251. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 2AWSU20-Multicore306090120150SE +/- 0.29, N = 3116.851. (CXX) g++ options: -O3 -fPIC

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeAWSU20-Multicore2M4M6M8M10MSE +/- 104055.19, N = 478296721. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastAWSU20-Multicore1.19252.3853.57754.775.9625SE +/- 0.01, N = 35.301. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed PHP Compilation

This test times how long it takes to build PHP 7. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To CompileAWSU20-Multicore20406080100SE +/- 0.12, N = 3110.44

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CAWSU20-Multicore4K8K12K16K20KSE +/- 9.22, N = 319264.091. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19AWSU20-Multicore612182430SE +/- 0.06, N = 325.201. (CC) gcc options: -O3 -pthread -lz -llzma

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1AWSU20-Multicore1.1792.3583.5374.7165.895SE +/- 0.00, N = 35.241. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAWSU20-Multicore1.32982.65963.98945.31926.649SE +/- 0.05, N = 35.911. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian DragonAWSU20-Multicore246810SE +/- 0.01, N = 36.21MIN: 6.13 / MAX: 6.33

Timed MPlayer Compilation

This test times how long it takes to build the MPlayer open-source media player program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To CompileAWSU20-Multicore20406080100SE +/- 0.06, N = 387.15

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 3.11.4Test: WritesAWSU20-Multicore4K8K12K16K20KSE +/- 114.08, N = 318930

Radiance Benchmark

This is a benchmark of NREL Radiance, a synthetic imaging system that is open-source and developed by the Lawrence Berkeley National Laboratory in California. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP ParallelAWSU20-Multicore50100150200250251.61

Xsbench

XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06AWSU20-Multicore200K400K600K800K1000KSE +/- 11453.02, N = 310654431. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Rust Mandelbrot

This test profile is of the combined time for the serial and parallel Mandelbrot sets written in Rustlang via willi-kappler/mandel-rust. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel MandelbrotAWSU20-Multicore20406080100SE +/- 0.04, N = 376.671. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUAWSU20-Multicore10002000300040005000SE +/- 8.04, N = 34549.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUAWSU20-Multicore0.1980.3960.5940.7920.99SE +/- 0.00, N = 30.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: SlowAWSU20-Multicore246810SE +/- 0.02, N = 38.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUAWSU20-Multicore10002000300040005000SE +/- 22.14, N = 34459.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUAWSU20-Multicore0.20030.40060.60090.80121.0015SE +/- 0.01, N = 30.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkVdbVolumeAWSU20-Multicore2M4M6M8M10MSE +/- 26107.84, N = 310700944.89

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: MediumAWSU20-Multicore246810SE +/- 0.00, N = 38.391. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUAWSU20-Multicore6001200180024003000SE +/- 4.13, N = 32720.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUAWSU20-Multicore0.33080.66160.99241.32321.654SE +/- 0.00, N = 31.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingAWSU20-Multicore4080120160200SE +/- 0.40, N = 3169.431. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMAWSU20-Multicore1530456075SE +/- 0.25, N = 366.421. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5AWSU20-Multicore0.19580.39160.58740.78320.979SE +/- 0.00, N = 30.87

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUAWSU20-Multicore6001200180024003000SE +/- 7.43, N = 32727.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUAWSU20-Multicore0.32850.6570.98551.3141.6425SE +/- 0.01, N = 31.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairAWSU20-Multicore1530456075SE +/- 0.06, N = 367.681. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisAWSU20-Multicore246810SE +/- 0.00, N = 36.06MIN: 5.95 / MAX: 6.13

ebizzy

This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3AWSU20-Multicore50K100K150K200K250KSE +/- 5556.54, N = 102131841. (CC) gcc options: -pthread -lpthread -O3 -march=native

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1AWSU20-Multicore0.06980.13960.20940.27920.349SE +/- 0.00, N = 30.31

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAWSU20-Multicore0.29030.58060.87091.16121.4515SE +/- 0.01, N = 31.291. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileAWSU20-Multicore1428425670SE +/- 0.22, N = 362.52

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastAWSU20-Multicore3691215SE +/- 0.00, N = 39.651. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCAWSU20-Multicore0.1350.270.4050.540.675SE +/- 0.00, N = 30.60

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismAWSU20-Multicore0.14630.29260.43890.58520.7315SE +/- 0.00, N = 30.65

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenAWSU20-Multicore714212835321. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianAWSU20-Multicore20406080100991. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pAWSU20-Multicore1428425670SE +/- 0.12, N = 362.321. (CXX) g++ options: -O3 -lpthread

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5AWSU20-Multicore140K280K420K560K700KSE +/- 2389.90, N = 36681251. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUAWSU20-Multicore0.21380.42760.64140.85521.069SE +/- 0.00, N = 30.951. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUAWSU20-Multicore9001800270036004500SE +/- 2.98, N = 34128.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlAWSU20-Multicore306090120150SE +/- 0.33, N = 31541. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUAWSU20-Multicore0.22050.4410.66150.8821.1025SE +/- 0.00, N = 30.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUAWSU20-Multicore9001800270036004500SE +/- 5.14, N = 34025.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedAWSU20-Multicore20406080100801. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceAWSU20-Multicore110220330440550SE +/- 1.20, N = 34871. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAWSU20-Multicore80160240320400SE +/- 1.45, N = 33711. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAWSU20-Multicore901802703604504271. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingAWSU20-Multicore816243240SE +/- 0.31, N = 1033.671. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore60120180240300SE +/- 0.05, N = 3273.96MIN: 272.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore1020304050SE +/- 0.04, N = 345.83MIN: 44.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPUAWSU20-Multicore20406080100SE +/- 0.23, N = 3105.13MIN: 102.291. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkStructuredVolumeAWSU20-Multicore6M12M18M24M30MSE +/- 263669.39, N = 330327318.90

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationAWSU20-Multicore1224364860SE +/- 0.22, N = 354.481. (CXX) g++ options: -O3 -lpthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-PassAWSU20-Multicore0.04280.08560.12840.17120.214SE +/- 0.00, N = 30.191. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4KAWSU20-Multicore1632486480SE +/- 0.09, N = 372.661. (CC) gcc options: -pthread

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 RealtimeAWSU20-Multicore3691215SE +/- 0.01, N = 312.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6AWSU20-Multicore0.2610.5220.7831.0441.305SE +/- 0.00, N = 31.16

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingAWSU20-Multicore1122334455SE +/- 0.28, N = 350.381. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080pAWSU20-Multicore60120180240300SE +/- 0.69, N = 3265.841. (CC) gcc options: -pthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticAWSU20-Multicore1020304050SE +/- 0.13, N = 344.771. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverAWSU20-Multicore1020304050SE +/- 0.13, N = 344.901. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CAWSU20-Multicore2K4K6K8K10KSE +/- 12.64, N = 39494.891. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

rays1bench

This is a test of rays1bench, a simple path-tracer / ray-tracing that supports SSE and AVX instructions, multi-threading, and other features. This test profile is measuring the performance of the "large scene" in rays1bench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large SceneAWSU20-Multicore510152025SE +/- 0.00, N = 320.44

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-PassAWSU20-Multicore0.56031.12061.68092.24122.8015SE +/- 0.01, N = 32.491. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestAWSU20-Multicore5K10K15K20K25K231141. (CXX) g++ options: -pipe -lpthread

Sysbench

This is a benchmark of Sysbench with CPU and memory sub-tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: MemoryAWSU20-Multicore2M4M6M8M10MSE +/- 191391.64, N = 1210015398.891. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeAWSU20-Multicore918273645SE +/- 0.14, N = 339.231. (CC) gcc options: -lm -O3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAWSU20-Multicore130260390520650SE +/- 9.94, N = 3579.33MIN: 555.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu ISO) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3AWSU20-Multicore6001200180024003000SE +/- 10.86, N = 32845.871. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAWSU20-Multicore60120180240300SE +/- 1.11, N = 3275.41MIN: 261.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7mAWSU20-Multicore20406080100SE +/- 0.35, N = 3108.151. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Parallel BZIP2 Compression

This test measures the time needed to compress a file (a .tar package of the Linux kernel source code) using BZIP2 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File CompressionAWSU20-Multicore3691215SE +/- 0.12, N = 129.021. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeAWSU20-Multicore816243240SE +/- 0.02, N = 335.201. (CC) gcc options: -static -fopenmp -O3 -march=native

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deepAWSU20-Multicore8001600240032004000SE +/- 0.88, N = 33657.761. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BAWSU20-Multicore2K4K6K8K10KSE +/- 12.87, N = 310476.461. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10AWSU20-Multicore0.5941.1881.7822.3762.97SE +/- 0.01, N = 32.64

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileAWSU20-Multicore816243240SE +/- 0.07, N = 333.03

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisAWSU20-Multicore246810SE +/- 0.00, N = 36.80MIN: 6.67 / MAX: 6.9

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5AWSU20-Multicore510152025SE +/- 0.10, N = 319.131. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.4.11080p 8-bit YUV To HEVC Video EncodeAWSU20-Multicore510152025SE +/- 0.13, N = 319.221. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skeinAWSU20-Multicore3K6K9K12K15KSE +/- 24.04, N = 3143871. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256tAWSU20-Multicore6K12K18K24K30K290401. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-grAWSU20-Multicore11002200330044005500SE +/- 5.96, N = 35120.761. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-PassAWSU20-Multicore0.35330.70661.05991.41321.7665SE +/- 0.00, N = 31.571. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Cpuminer-Opt

Cpuminer benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbryAWSU20-Multicore3K6K9K12K15KSE +/- 120.14, N = 3122801. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAWSU20-Multicore3691215SE +/- 0.07, N = 310.951. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisAWSU20-Multicore246810SE +/- 0.00, N = 37.87MIN: 7.63 / MAX: 8.06

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishAWSU20-Multicore15003000450060007500SE +/- 1.00, N = 368871. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pAWSU20-Multicore20406080100SE +/- 3.15, N = 1076.131. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CAWSU20-Multicore11002200330044005500SE +/- 16.23, N = 35324.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastAWSU20-Multicore510152025SE +/- 0.01, N = 321.371. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondAWSU20-Multicore30K60K90K120K150KSE +/- 369.94, N = 3135719.841. (CC) gcc options: -O2 -lrt" -lrt

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialAWSU20-Multicore246810SE +/- 0.01, N = 37.33

Rust Prime Benchmark

Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000AWSU20-Multicore612182430SE +/- 0.00, N = 326.541. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesAWSU20-Multicore612182430SE +/- 0.02, N = 326.371. (CXX) g++ options: -fopenmp -O3

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticAWSU20-Multicore612182430SE +/- 0.00, N = 324.831. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

FFmpeg

This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVAWSU20-Multicore246810SE +/- 0.06, N = 116.691. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lasound -pthread -lva -lbz2 -llzma -lva-drm -lva-x11 -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterAWSU20-Multicore510152025SE +/- 0.01, N = 322.401. (CXX) g++ options: -O2 -lOpenCL

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 RealtimeAWSU20-Multicore714212835SE +/- 0.06, N = 328.451. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pAWSU20-Multicore714212835SE +/- 0.34, N = 327.891. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongAWSU20-Multicore600K1200K1800K2400K3000KSE +/- 9448.61, N = 32873017.461. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore918273645SE +/- 0.02, N = 341.47MIN: 41.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialAWSU20-Multicore510152025SE +/- 0.11, N = 321.251. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore0.991.982.973.964.95SE +/- 0.01, N = 34.40MIN: 4.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPUAWSU20-Multicore246810SE +/- 0.08, N = 37.74MIN: 7.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingAWSU20-Multicore30060090012001500SE +/- 7.77, N = 31358.541. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingAWSU20-Multicore400800120016002000SE +/- 3.16, N = 31675.461. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingAWSU20-Multicore30060090012001500SE +/- 11.36, N = 31444.611. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingAWSU20-Multicore30060090012001500SE +/- 15.06, N = 31521.761. (CXX) g++ options: -lpthread

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeAWSU20-Multicore20406080100SE +/- 1.04, N = 381.39MIN: 78.56 / MAX: 83.51. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeAWSU20-Multicore2K4K6K8K10KSE +/- 40.69, N = 39127.901. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CAWSU20-Multicore110220330440550SE +/- 2.64, N = 3524.411. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageAWSU20-Multicore130M260M390M520M650MSE +/- 3465848.75, N = 36069018051. (CC) gcc options: -lm -lpthread -lcurses -lrt

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUAWSU20-Multicore120240360480600SE +/- 1.93, N = 3572.351. (CXX) g++ options: -rdynamic

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - DegriddingAWSU20-Multicore30060090012001500SE +/- 1.76, N = 31367.181. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - GriddingAWSU20-Multicore400800120016002000SE +/- 13.33, N = 31694.311. (CXX) g++ options: -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastAWSU20-Multicore918273645SE +/- 0.06, N = 338.341. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPUAWSU20-Multicore246810SE +/- 0.03, N = 38.80MIN: 8.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore510152025SE +/- 0.01, N = 320.76MIN: 20.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore0.76051.5212.28153.0423.8025SE +/- 0.02, N = 33.38MIN: 3.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080pAWSU20-Multicore50100150200250SE +/- 0.54, N = 3245.521. (CC) gcc options: -pthread

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilAWSU20-Multicore3691215SE +/- 0.20, N = 312.381. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvAWSU20-Multicore1224364860SE +/- 1.06, N = 354.65MIN: 50.65 / MAX: 56.811. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvAWSU20-Multicore2K4K6K8K10KSE +/- 47.48, N = 37938.961. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CAWSU20-Multicore3K6K9K12K15KSE +/- 25.24, N = 314483.731. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAWSU20-Multicore0.67951.3592.03852.7183.3975SE +/- 0.01, N = 33.02MIN: 2.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore246810SE +/- 0.00, N = 37.67MIN: 7.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore0.4950.991.4851.982.475SE +/- 0.01, N = 32.20MIN: 2.131. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

This is a benchmark of Sysbench with CPU and memory sub-tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPUAWSU20-Multicore2K4K6K8K10KSE +/- 1.61, N = 37885.381. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=core2 -rdynamic -ldl -laio -lm

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAWSU20-Multicore1428425670SE +/- 0.04, N = 363.381. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 8AWSU20-Multicore3691215SE +/- 0.01, N = 39.631. (CXX) g++ options: -O3 -fPIC

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 10AWSU20-Multicore3691215SE +/- 0.01, N = 38.981. (CXX) g++ options: -O3 -fPIC

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPAWSU20-Multicore246810SE +/- 0.02, N = 38.871. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongAWSU20-Multicore12002400360048006000SE +/- 42.92, N = 35640.121. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAWSU20-Multicore20406080100SE +/- 0.05, N = 380.341. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore816243240SE +/- 0.02, N = 336.16MIN: 35.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinAWSU20-Multicore0.62331.24661.86992.49323.1165SE +/- 0.01, N = 32.771. (CXX) g++ options: -O3 -pthread -lm

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5AWSU20-Multicore3691215SE +/- 0.04, N = 39.02

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore246810SE +/- 0.02, N = 37.23MIN: 7.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAWSU20-Multicore246810SE +/- 0.03, N = 36.86MIN: 6.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: bf16bf16bf16 - Engine: CPUAWSU20-Multicore918273645SE +/- 0.30, N = 338.47MIN: 38.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPUAWSU20-Multicore246810SE +/- 0.01, N = 36.59MIN: 6.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPUAWSU20-Multicore3691215SE +/- 0.01, N = 39.37MIN: 9.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerAWSU20-Multicore20406080100100MIN: 83.33 / MAX: 111.11

214 Results Shown

LAMMPS Molecular Dynamics Simulator
Blender:
  Pabellon Barcelona - NVIDIA OptiX
  Pabellon Barcelona - OpenCL
  Pabellon Barcelona - CUDA
  Barbershop - NVIDIA OptiX
  Barbershop - OpenCL
  Barbershop - CUDA
  Barbershop - CPU-Only
Timed GCC Compilation
Blender:
  Pabellon Barcelona - CPU-Only
  BMW27 - OpenCL
OpenVKL
Timed LLVM Compilation
Blender:
  Classroom - NVIDIA OptiX
  Classroom - CUDA
  Classroom - OpenCL
  Classroom - CPU-Only
  Fishy Cat - NVIDIA OptiX
  Fishy Cat - CUDA
  Fishy Cat - OpenCL
Pennant:
  sedovbig
  leblancbig
Appleseed
Blender
libgav1
Rodinia
Blender:
  BMW27 - CUDA
  BMW27 - NVIDIA OptiX
  BMW27 - CPU-Only
OSPray:
  San Miguel - Path Tracer
  XFrog Forest - Path Tracer
YafaRay
Appleseed:
  Disney Material
  Material Tester
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
SVT-AV1
GROMACS
Embree
Radiance Benchmark
NAS Parallel Benchmarks
libgav1
asmFish
Rodinia
libgav1
Build2
m-queens
NAMD
Tachyon
OSPray
ACES DGEMM
dav1d
NAS Parallel Benchmarks
libavif avifenc
Timed Linux Kernel Compilation
C-Ray
Timed GDB GNU Debugger Compilation
Embree
OSPray
Embree
OpenVKL
POV-Ray
Embree
Timed FFmpeg Compilation
Rodinia
Embree
VP9 libvpx Encoding
libavif avifenc
Stockfish
Kvazaar
Timed PHP Compilation
NAS Parallel Benchmarks
Zstd Compression
High Performance Conjugate Gradient
x265
Embree
Timed MPlayer Compilation
Apache Cassandra
Radiance Benchmark
Xsbench
Rust Mandelbrot
OpenVINO:
  Person Detection 0106 FP16 - CPU:
    ms
    FPS
Kvazaar
OpenVINO:
  Person Detection 0106 FP32 - CPU:
    ms
    FPS
OpenVKL
Kvazaar
OpenVINO:
  Face Detection 0106 FP16 - CPU:
    ms
    FPS
TTSIOD 3D Renderer
Parboil
rav1e
OpenVINO:
  Face Detection 0106 FP32 - CPU:
    ms
    FPS
Tungsten Renderer
OSPray
ebizzy
rav1e
SVT-AV1
Timed ImageMagick Compilation
Kvazaar
LuxCoreRender:
  DLSC
  Rainbow Colors and Prism
GraphicsMagick:
  Sharpen
  Noise-Gaussian
libgav1
John The Ripper
OpenVINO:
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
GraphicsMagick
OpenVINO:
  Age Gender Recognition Retail 0013 FP32 - CPU:
    ms
    FPS
GraphicsMagick:
  Enhanced
  HWB Color Space
  Resizing
  Rotate
x264
oneDNN:
  IP Batch All - bf16bf16bf16 - CPU
  IP Batch All - u8s8f32 - CPU
  IP Batch All - f32 - CPU
OpenVKL
Primesieve
AOM AV1
dav1d
AOM AV1
rav1e
Parboil
dav1d
Tungsten Renderer
Rodinia
NAS Parallel Benchmarks
rays1bench
AOM AV1
7-Zip Compression
Sysbench
AOBench
oneDNN
Zstd Compression
oneDNN
Cpuminer-Opt
Parallel BZIP2 Compression
N-Queens
Cpuminer-Opt
NAS Parallel Benchmarks
rav1e
Timed Apache Compilation
OSPray
VP9 libvpx Encoding
SVT-HEVC
Cpuminer-Opt:
  skein
  sha256t
  myr-gr
AOM AV1
Cpuminer-Opt
SVT-AV1
OSPray
John The Ripper
SVT-VP9
NAS Parallel Benchmarks
Kvazaar
Coremark
Intel Open Image Denoise
Rust Prime Benchmark
Smallpt
Tungsten Renderer
FFmpeg
Rodinia
AOM AV1
x265
Intel MPI Benchmarks
oneDNN
Tungsten Renderer
oneDNN:
  Deconvolution Batch deconv_1d - u8s8f32 - CPU
  Deconvolution Batch deconv_1d - f32 - CPU
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
Intel MPI Benchmarks:
  IMB-MPI1 Exchange:
    Average usec
    Average Mbytes/sec
NAS Parallel Benchmarks
Swet
ArrayFire
ASKAP:
  tConvolve MPI - Degridding
  tConvolve MPI - Gridding
Kvazaar
oneDNN:
  IP Batch 1D - f32 - CPU
  IP Batch 1D - bf16bf16bf16 - CPU
  IP Batch 1D - u8s8f32 - CPU
dav1d
Parboil
Intel MPI Benchmarks:
  IMB-MPI1 Sendrecv:
    Average usec
    Average Mbytes/sec
NAS Parallel Benchmarks
oneDNN:
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Sysbench
SVT-VP9
libavif avifenc:
  8
  10
Parboil
Intel MPI Benchmarks
SVT-VP9
oneDNN
LAMMPS Molecular Dynamics Simulator
NeatBench
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch deconv_3d - bf16bf16bf16 - CPU
  Deconvolution Batch deconv_3d - u8s8f32 - CPU
  Deconvolution Batch deconv_3d - f32 - CPU
OSPray