y

yy

HTML result view exported from: https://openbenchmarking.org/result/2103201-HA-Y8268526161&grr.

yProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionSystem LayeryY2 x Intel Xeon Platinum 8275CL (48 Cores / 96 Threads)Amazon EC2 c5d.24xlarge (1.0 BIOS)Intel 440FX 82441FX PMC186GB550GB Amazon Elastic Block Store + 4 x 900GB Amazon EC2 NVMe Instance StoragellvmpipeAmazon ElasticUbuntu 20.045.4.0-1039-aws (x86_64)GNOME Shell 3.36.7X Server 1.20.94.5 Mesa 20.2.6 (LLVM 11.0.0 256 bits)GCC 9.3.0ext41280x768KVMOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- y: NONE / discard,relatime,rw / Block Size: 4096Processor Details- CPU Microcode: 0x5003005Java Details- OpenJDK Runtime Environment (build 11.0.10+9-Ubuntu-0ubuntu1.20.04)Python Details- Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

yopenvkl: vklBenchmarkUnstructuredVolumebuild-gcc: Time To Compileopenvkl: vklBenchmarkmysqlslap: 512cassandra: Readsmysqlslap: 256build-clash: Time To Compileparboil: OpenMP MRI Griddinglibgav1: Chimera 1080p 10-bitcassandra: Writesmysqlslap: 128onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUpovray: Trace Timeyafaray: Total Time For Sample Scenelammps: 20k Atomsonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUblogbench: Readmysqlslap: 64onednn: Recurrent Neural Network Training - u8s8f32 - CPUgraphics-magick: Resizingmysqlslap: 32mysqlslap: 16mysqlslap: 8rodinia: OpenMP Leukocytemysqlslap: 4radiance: Serialluxcorerender: Rainbow Colors and Prismbuild-llvm: Time To Compilelibgav1: Summer Nature 1080paskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingsvt-av1: Enc Mode 0 - 1080pcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedblender: Barbershop - CPU-Onlyhpcg: libgav1: Summer Nature 4Kappleseed: Material Testermysqlslap: 1libgav1: Chimera 1080pblender: Pabellon Barcelona - CPU-Onlyparboil: OpenMP LBMbuild-erlang: Time To Compileasmfish: 1024 Hash Memory, 26 Depthaom-av1: Speed 4 Two-Passblender: Classroom - CPU-Onlyrodinia: OpenMP HotSpot3Dintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 Exchangeappleseed: Emilyvpxenc: Speed 0npb: EP.Dsysbench: CPUbuild-eigen: Time To Compileaom-av1: Speed 0 Two-Passbuild-gdb: Time To Compileradiance: SMP Parallelintel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Sendrecvonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUarrayfire: BLAS CPUopenvino: Person Detection 0106 FP16 - CPUopenvino: Person Detection 0106 FP16 - CPUcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedopenvino: Person Detection 0106 FP32 - CPUopenvino: Person Detection 0106 FP32 - CPUbuild-wasmer: Time To Compileopenvino: Face Detection 0106 FP32 - CPUopenvino: Face Detection 0106 FP32 - CPUopenvino: Face Detection 0106 FP16 - CPUopenvino: Face Detection 0106 FP16 - CPUopenvkl: vklBenchmarkVdbVolumeospray: San Miguel - Path Tracerbuild2: Time To Compiledav1d: Chimera 1080p 10-bitcompress-7zip: Compress Speed Testbuild-godot: Time To Compileavifenc: 0luxcorerender: DLSCramspeed: Triad - Floating Pointramspeed: Scale - Floating Pointaobench: 2048 x 2048 - Total Timeblender: Fishy Cat - CPU-Onlyramspeed: Add - Floating Pointramspeed: Average - Floating Pointramspeed: Copy - Floating Pointrodinia: OpenMP LavaMDjohn-the-ripper: MD5openvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUopenvino: Age Gender Recognition Retail 0013 FP32 - CPUgraphics-magick: Sharpengraphics-magick: HWB Color Spacegraphics-magick: Rotategraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Swirlramspeed: Copy - Integerramspeed: Triad - Integerramspeed: Scale - Integerramspeed: Add - Integerramspeed: Average - Integerrav1e: 5stream: Copyaom-av1: Speed 6 Two-Passrav1e: 1compress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedopenvkl: vklBenchmarkStructuredVolumeaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumbuild-php: Time To Compilerust-mandel: Time To Complete Serial/Parallel Mandelbrotappleseed: Disney Materialblender: BMW27 - CPU-Onlyrav1e: 6namd: ATPase Simulation - 327,506 Atomscompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddinggromacs: water_GMX50_baresvt-av1: Enc Mode 4 - 1080pospray: XFrog Forest - Path Traceronednn: IP Shapes 3D - u8s8f32 - CPUstockfish: Total Timeavifenc: 6, Losslessavifenc: 2tungsten: Non-Exponentialbuild-linux-kernel: Time To Compileospray: San Miguel - SciVisvpxenc: Speed 5john-the-ripper: Blowfishtungsten: Water Causticaom-av1: Speed 6 Realtimerav1e: 10embree: Pathtracer - Asian Dragon Objx265: Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondcompilebench: Compileintel-mpi: IMB-P2P PingPongembree: Pathtracer ISPC - Asian Dragon Objbuild-ffmpeg: Time To Compilebuild-apache: Time To Compileospray: XFrog Forest - SciVissvt-hevc: 1 - Bosphorus 1080ppennant: sedovbigtachyon: Total Timeaskap: Hogbom Clean OpenMPonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUttsiod-renderer: Phong Rendering With Soft-Shadow Mappingebizzy: kvazaar: Bosphorus 4K - Very Fastm-queens: Time To Solveneatbench: CPUneatbench: Allintel-mpi: IMB-MPI1 PingPongmt-dgemm: Sustained Floating-Point Ratesvt-vp9: Visual Quality Optimized - Bosphorus 1080pkvazaar: Bosphorus 4K - Ultra Fastbuild-imagemagick: Time To Compileembree: Pathtracer ISPC - Asian Dragonc-ray: Total Time - 4K, 16 Rays Per Pixelnpb: LU.Cospray: NASA Streamlines - Path Tracerswet: Averageonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUavifenc: 6aom-av1: Speed 8 Realtimeembree: Pathtracer - Crownbuild-mplayer: Time To Compiletungsten: Volumetric Causticsvt-av1: Enc Mode 8 - 1080pembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragononednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUkvazaar: Bosphorus 1080p - Slowonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Mediumdav1d: Chimera 1080psvt-vp9: VMAF Optimized - Bosphorus 1080prodinia: OpenMP Streamclusterdav1d: Summer Nature 4Kxsbench: pennant: leblancbigonednn: IP Shapes 3D - f32 - CPUx265: Bosphorus 1080ponednn: IP Shapes 3D - bf16bf16bf16 - CPUsysbench: RAM / Memoryavifenc: 10, Losslesstungsten: Hairrodinia: OpenMP CFD Solverparboil: OpenMP CUTCPonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUdav1d: Summer Nature 1080pavifenc: 10rays1bench: Large Sceneospray: NASA Streamlines - SciVisoidn: Memorialkvazaar: Bosphorus 1080p - Very Fastprimesieve: 1e12 Prime Number Generationrust-prime: Prime Number Test To 200,000,000parboil: OpenMP Stencilospray: Magnetic Reconnection - SciVissvt-hevc: 7 - Bosphorus 1080pkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video Encodingsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pn-queens: Elapsed Timeonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUnpb: EP.Csmallpt: Global Illumination Renderer; 128 Samplessvt-hevc: 10 - Bosphorus 1080plammps: Rhodopsin Proteinospray: Magnetic Reconnection - Path Tracerstream: Addstream: Triadstream: Scalecompilebench: Read Compiled Treecompilebench: Initial Createblogbench: WriteyY48635831456.5224462.7527770.6228770.5831122.1231364.7731312.4527552.7532285.9531040.58184383.11500.84157347.7156304.2141319.72210.04434.38111451952615755.16354045532974465458.090434.46264721.15145222530920.049612.59571.774121.08825.5271020.832663985.054113975780683950.938880755.2366.90234.35886.887335.855019.330.1242619.02802.7164.1727.734923.65242.770447149959.90131.7922.487242124.1101227043525.65115.86115.381215.245420.49156.1391866.015835.4091305.5186.9140.2483.838235.256122.393887.501.61670594.593589.6253814.922751.458.492572.31135.92773.708.4269.0031576.4814.791551.8814.95273893715.8964.947202.1619765862.64762.2616.7137.12160.5760.70976236670.5739499.440.5739503.2441274559651983114611.08116.620.3772285.936.12224.171.08210776110445.48034.1113.3513.6243.86942.82663.93809141.841.4280.404692690.9707.82775.0805.011245.99189.025.6527.5276.214.0476668646362735.98935.0566.8892131.68866.6719.587965221.225920.933.08849.164622.281661244.3125472929274758.157325.06923.14311.3627.6822.2669922.0140691.33912.29834.636730.341781965.196152341930.8119.40630.930.44250.3212.931046262.6945.5417.64266.254215.574133491.2516.956458212811.246783.9729214.48660.5046.943613.61813.190166.62152.422553.72610.32942549.190.8447710.29772250.491029.97313.3610.339368.8788300819.2154782.4943965.382.2840711083.369.0748.349957.9471.4855704.386944.089453.36037948.235.939234.3983.3339.35113.145.1793.5492.99411666.67247.80150.66171.64312.833.1404.426661.188620.3355685755.322.671460.3525.817500OpenBenchmarking.org

OpenVKL

Benchmark: vklBenchmarkUnstructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUnstructuredVolumeY400K800K1200K1600K2000KSE +/- 6010.76, N = 31952615MIN: 18798 / MAX: 6426323

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 9.3.0Time To CompileY160320480640800SE +/- 1.92, N = 3755.16

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkY120240360480600SE +/- 4.63, N = 3540MIN: 1 / MAX: 2150

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512Y100200300400500SE +/- 2.33, N = 34551. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Apache Cassandra

Test: Reads

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 3.11.4Test: ReadsY7K14K21K28K35KSE +/- 12809.54, N = 932974

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256Y100200300400500SE +/- 0.63, N = 34651. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To CompileY100200300400500SE +/- 0.55, N = 3458.09

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingY90180270360450SE +/- 5.89, N = 3434.461. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitY510152025SE +/- 0.07, N = 321.151. (CXX) g++ options: -O3 -lpthread

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 3.11.4Test: WritesY30K60K90K120K150KSE +/- 2349.46, N = 15145222

MariaDB

Clients: 128

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128Y110220330440550SE +/- 1.37, N = 35301. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUY2004006008001000SE +/- 45.02, N = 15920.05MIN: 822.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUY130260390520650SE +/- 18.29, N = 15612.60MIN: 576.661. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeY1632486480SE +/- 7.56, N = 1571.771. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneY306090120150SE +/- 9.59, N = 9121.091. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsY612182430SE +/- 0.02, N = 325.531. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUY2004006008001000SE +/- 70.38, N = 131020.83MIN: 823.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

BlogBench

Test: Read

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: Ready100K200K300K400K500KSE +/- 6279.10, N = 34863581. (CC) gcc options: -O2 -pthread

MariaDB

Clients: 64

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 64Y140280420560700SE +/- 2.19, N = 36631. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUY2004006008001000SE +/- 98.29, N = 12985.05MIN: 823.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingY2004006008001000SE +/- 52.51, N = 1511391. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32Y160320480640800SE +/- 3.49, N = 37571. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 16

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 16Y2004006008001000SE +/- 4.93, N = 38061. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 8

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 8Y2004006008001000SE +/- 5.90, N = 38391. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteY1122334455SE +/- 0.80, N = 1550.941. (CXX) g++ options: -O2 -lOpenCL

MariaDB

Clients: 4

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 4Y2004006008001000SE +/- 1.86, N = 38801. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

Radiance Benchmark

Test: Serial

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SerialY160320480640800755.24

LuxCoreRender

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismY246810SE +/- 0.41, N = 126.90MIN: 2.41 / MAX: 7.52

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileY50100150200250SE +/- 1.78, N = 3234.36

libgav1

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pY20406080100SE +/- 0.93, N = 1586.881. (CXX) g++ options: -O3 -lpthread

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingY16003200480064008000SE +/- 277.39, N = 37335.851. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingY11002200330044005500SE +/- 99.10, N = 35019.331. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pY0.02790.05580.08370.11160.1395SE +/- 0.000, N = 30.1241. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression SpeedY6001200180024003000SE +/- 3.37, N = 82619.01. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression SpeedY6001200180024003000SE +/- 111.24, N = 122802.71. (CC) gcc options: -O3 -pthread -lz -llzma

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-OnlyY4080120160200SE +/- 0.13, N = 3164.17

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Y714212835SE +/- 0.27, N = 327.731. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KY612182430SE +/- 0.10, N = 323.651. (CXX) g++ options: -O3 -lpthread

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterY50100150200250242.77

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1Y30060090012001500SE +/- 4.55, N = 314991. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

libgav1

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pY1326395265SE +/- 0.28, N = 359.901. (CXX) g++ options: -O3 -lpthread

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-OnlyY306090120150SE +/- 0.23, N = 3131.79

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMY510152025SE +/- 0.39, N = 1522.491. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileY306090120150SE +/- 0.33, N = 3124.11

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthY30M60M90M120M150MSE +/- 669866.88, N = 3122704352

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-PassY1.27132.54263.81395.08526.3565SE +/- 0.03, N = 35.651. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-OnlyY306090120150SE +/- 0.20, N = 3115.86

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DY306090120150SE +/- 0.60, N = 3115.381. (CXX) g++ options: -O2 -lOpenCL

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeY50100150200250SE +/- 0.38, N = 3215.24MIN: 1.45 / MAX: 5555.541. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeY12002400360048006000SE +/- 27.43, N = 35420.49MAX: 27220.881. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilyY306090120150156.14

VP9 libvpx Encoding

Speed: Speed 0

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 0Y246810SE +/- 0.01, N = 36.011. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DY13002600390052006500SE +/- 231.22, N = 125835.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUY20K40K60K80K100KSE +/- 29.32, N = 391305.511. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To CompileY20406080100SE +/- 0.04, N = 386.91

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-PassY0.0540.1080.1620.2160.27SE +/- 0.00, N = 30.241. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileY20406080100SE +/- 0.10, N = 383.84

Radiance Benchmark

Test: SMP Parallel

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP ParallelY50100150200250235.26

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvY306090120150SE +/- 1.02, N = 3122.39MIN: 0.9 / MAX: 2940.021. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvY8001600240032004000SE +/- 8.66, N = 33887.50MAX: 16933.761. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUY0.36380.72761.09141.45521.819SE +/- 0.02538, N = 151.61670MIN: 1.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUY130260390520650SE +/- 1.36, N = 3594.59MIN: 578.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUY130260390520650SE +/- 1.58, N = 3589.63MIN: 576.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUY8001600240032004000SE +/- 263.74, N = 153814.921. (CXX) g++ options: -rdynamic

OpenVINO

Model: Person Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUY6001200180024003000SE +/- 7.65, N = 32751.451. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Person Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP16 - Device: CPUY246810SE +/- 0.04, N = 38.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedY6001200180024003000SE +/- 0.30, N = 32572.31. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedY2004006008001000SE +/- 17.62, N = 51135.91. (CC) gcc options: -O3 -pthread -lz -llzma

OpenVINO

Model: Person Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUY6001200180024003000SE +/- 14.87, N = 32773.701. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Person Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Person Detection 0106 FP32 - Device: CPUY246810SE +/- 0.03, N = 38.421. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

Timed Wasmer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To CompileY1530456075SE +/- 0.72, N = 369.001. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

OpenVINO

Model: Face Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUY30060090012001500SE +/- 2.27, N = 31576.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Face Detection 0106 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP32 - Device: CPUY48121620SE +/- 0.11, N = 314.791. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Face Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUY30060090012001500SE +/- 1.47, N = 31551.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Face Detection 0106 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Face Detection 0106 FP16 - Device: CPUY48121620SE +/- 0.13, N = 314.951. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVKL

Benchmark: vklBenchmarkVdbVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkVdbVolumeY6M12M18M24M30MSE +/- 118317.49, N = 327389371MIN: 983498 / MAX: 177949800

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerY1.32532.65063.97595.30126.6265SE +/- 0.01, N = 35.89MIN: 3.98 / MAX: 5.95

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileY1428425670SE +/- 0.21, N = 364.95

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitY4080120160200SE +/- 0.32, N = 3202.16MIN: 137.85 / MAX: 316.321. (CC) gcc options: -pthread -lm

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestY40K80K120K160K200KSE +/- 2078.39, N = 31976581. (CXX) g++ options: -pipe -lpthread

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileY1428425670SE +/- 0.20, N = 362.65

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0Y1428425670SE +/- 0.27, N = 362.261. (CXX) g++ options: -O3 -fPIC -lm

LuxCoreRender

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCY246810SE +/- 0.03, N = 36.71MIN: 6.53 / MAX: 7.12

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Pointy7K14K21K28K35KSE +/- 42.50, N = 331456.521. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointy5K10K15K20K25KSE +/- 39.15, N = 324462.751. (CC) gcc options: -O3 -march=native

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeY918273645SE +/- 0.56, N = 537.121. (CC) gcc options: -lm -O3

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-OnlyY1428425670SE +/- 0.16, N = 360.57

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointy6K12K18K24K30KSE +/- 61.60, N = 327770.621. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointy6K12K18K24K30KSE +/- 118.88, N = 328770.581. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointy7K14K21K28K35KSE +/- 37.59, N = 331122.121. (CC) gcc options: -O3 -march=native

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDY1428425670SE +/- 0.21, N = 360.711. (CXX) g++ options: -O2 -lOpenCL

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Y1.6M3.2M4.8M6.4M8MSE +/- 7125.85, N = 376236671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUY0.12830.25660.38490.51320.6415SE +/- 0.00, N = 30.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUY8K16K24K32K40KSE +/- 21.74, N = 339499.441. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUY0.12830.25660.38490.51320.6415SE +/- 0.00, N = 30.571. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

OpenVINO

Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2021.1Model: Age Gender Recognition Retail 0013 FP32 - Device: CPUY8K16K24K32K40KSE +/- 8.24, N = 339503.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -pie -pthread -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenY90180270360450SE +/- 0.33, N = 34121. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceY160320480640800SE +/- 1.00, N = 37451. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateY130260390520650SE +/- 9.21, N = 35961. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianY110220330440550SE +/- 8.33, N = 35191. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedY2004006008001000SE +/- 1.53, N = 38311. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlY30060090012001500SE +/- 6.36, N = 314611. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integery7K14K21K28K35KSE +/- 120.16, N = 331364.771. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integery7K14K21K28K35KSE +/- 145.01, N = 331312.451. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integery6K12K18K24K30KSE +/- 131.64, N = 327552.751. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integery7K14K21K28K35KSE +/- 29.87, N = 332285.951. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integery7K14K21K28K35KSE +/- 109.84, N = 331040.581. (CC) gcc options: -O3 -march=native

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5Y0.24320.48640.72960.97281.216SE +/- 0.006, N = 31.081

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copyy40K80K120K160K200KSE +/- 2763.68, N = 25184383.11. (CC) gcc options: -O3 -march=native -fopenmp

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-PassY48121620SE +/- 0.07, N = 316.621. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1Y0.08480.16960.25440.33920.424SE +/- 0.000, N = 30.377

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedY5001000150020002500SE +/- 5.93, N = 32285.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedY816243240SE +/- 0.47, N = 336.11. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedY5001000150020002500SE +/- 8.98, N = 32224.11. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedY1632486480SE +/- 0.65, N = 371.01. (CC) gcc options: -O3 -pthread -lz -llzma

OpenVKL

Benchmark: vklBenchmarkStructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkStructuredVolumeY20M40M60M80M100MSE +/- 946535.00, N = 382107761MIN: 1175971 / MAX: 817665264

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingY2K4K6K8K10KSE +/- 61.75, N = 1210445.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingY2K4K6K8K10KSE +/- 185.25, N = 128034.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: SlowY3691215SE +/- 0.03, N = 313.351. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: MediumY48121620SE +/- 0.04, N = 313.621. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To CompileY1020304050SE +/- 0.08, N = 343.87

Rust Mandelbrot

Time To Complete Serial/Parallel Mandelbrot

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel MandelbrotY1020304050SE +/- 0.05, N = 342.831. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialY142842567063.94

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-OnlyY1020304050SE +/- 0.11, N = 341.84

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6Y0.32130.64260.96391.28521.6065SE +/- 0.005, N = 31.428

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsY0.09110.18220.27330.36440.4555SE +/- 0.00324, N = 30.40469

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedY6001200180024003000SE +/- 10.40, N = 32690.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedY150300450600750SE +/- 3.90, N = 3707.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression SpeedY6001200180024003000SE +/- 8.63, N = 32775.01. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression SpeedY2004006008001000SE +/- 3.82, N = 3805.01. (CC) gcc options: -O3 -pthread -lz -llzma

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingY2K4K6K8K10KSE +/- 46.36, N = 311245.91. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingY2K4K6K8K10KSE +/- 17.84, N = 39189.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bareY1.27172.54343.81515.08686.3585SE +/- 0.025, N = 35.6521. (CXX) g++ options: -O3 -pthread

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pY246810SE +/- 0.101, N = 67.5271. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OSPray

Demo: XFrog Forest - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerY246810SE +/- 0.00, N = 36.21MIN: 5 / MAX: 6.29

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUY0.91071.82142.73213.64284.5535SE +/- 3.226057, N = 124.047666MIN: 0.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeY20M40M60M80M100MSE +/- 1033175.15, N = 3864636271. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessY816243240SE +/- 0.51, N = 335.991. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Y816243240SE +/- 0.21, N = 335.061. (CXX) g++ options: -O3 -fPIC -lm

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialY246810SE +/- 0.28804, N = 156.889211. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileY714212835SE +/- 0.64, N = 331.69

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisY1530456075SE +/- 0.00, N = 366.67MIN: 21.74

VP9 libvpx Encoding

Speed: Speed 5

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5Y510152025SE +/- 0.06, N = 319.581. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishY20K40K60K80K100KSE +/- 75.82, N = 3796521. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticY510152025SE +/- 0.05, N = 321.231. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 RealtimeY510152025SE +/- 0.12, N = 320.931. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10Y0.69481.38962.08442.77923.474SE +/- 0.010, N = 33.088

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon ObjY1122334455SE +/- 0.49, N = 349.16MIN: 45.78 / MAX: 50.74

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KY510152025SE +/- 0.05, N = 322.281. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondY400K800K1200K1600K2000KSE +/- 13722.62, N = 31661244.311. (CC) gcc options: -O2 -lrt" -lrt

Compile Bench

Test: Compile

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Compiley30060090012001500SE +/- 4.69, N = 31500.84

Intel MPI Benchmarks

Test: IMB-P2P PingPong

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongY6M12M18M24M30MSE +/- 194733.20, N = 329292747MIN: 9384 / MAX: 828327201. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon ObjY1326395265SE +/- 0.09, N = 358.16MIN: 56.68 / MAX: 59.51

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileY612182430SE +/- 0.09, N = 325.07

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileY612182430SE +/- 0.01, N = 323.14

OSPray

Demo: XFrog Forest - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisY3691215SE +/- 0.00, N = 311.36MIN: 8.06 / MAX: 11.49

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pY714212835SE +/- 0.12, N = 327.681. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigY510152025SE +/- 0.01, N = 322.271. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeY510152025SE +/- 0.39, N = 322.011. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPY150300450600750SE +/- 5.71, N = 3691.341. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUY3691215SE +/- 0.05, N = 312.30MIN: 9.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUY1.04332.08663.12994.17325.2165SE +/- 0.01293, N = 34.63673MIN: 4.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUY0.07690.15380.23070.30760.3845SE +/- 0.000918, N = 30.341781MIN: 0.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingY2004006008001000SE +/- 1.83, N = 3965.201. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3Y300K600K900K1200K1500KSE +/- 11471.93, N = 315234191. (CC) gcc options: -pthread -lpthread -O3 -march=native

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastY714212835SE +/- 0.10, N = 330.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveY510152025SE +/- 0.16, N = 319.411. (CXX) g++ options: -fopenmp -O2 -march=native

NeatBench

Acceleration: CPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPUY714212835SE +/- 0.15, N = 330.9

NeatBench

Acceleration: All

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: AllY714212835SE +/- 0.35, N = 330.4

Intel MPI Benchmarks

Test: IMB-MPI1 PingPong

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongY9001800270036004500SE +/- 57.73, N = 34250.32MIN: 2.77 / MAX: 15945.61. (CXX) g++ options: -O0 -pedantic -fopenmp -pthread -lmpi_cxx -lmpi

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateY3691215SE +/- 0.02, N = 312.931. (CC) gcc options: -O3 -march=native -fopenmp

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pY60120180240300SE +/- 2.41, N = 14262.691. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastY1020304050SE +/- 0.78, N = 445.541. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileY48121620SE +/- 0.10, N = 317.64

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian DragonY1530456075SE +/- 1.03, N = 566.25MIN: 58.77 / MAX: 68.71

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelY48121620SE +/- 0.05, N = 315.571. (CC) gcc options: -lm -lpthread -O3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CY30K60K90K120K150KSE +/- 101.26, N = 3133491.251. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerY48121620SE +/- 0.00, N = 316.95MIN: 8.06 / MAX: 17.24

Swet

Average

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageY140M280M420M560M700MSE +/- 3387555.31, N = 36458212811. (CC) gcc options: -lm -lpthread -lcurses -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUY0.28050.5610.84151.1221.4025SE +/- 0.00439, N = 31.24678MIN: 1.161. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUY0.89391.78782.68173.57564.4695SE +/- 0.00457, N = 33.97292MIN: 3.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Y48121620SE +/- 0.17, N = 314.491. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 RealtimeY1428425670SE +/- 0.93, N = 460.501. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: CrownY1122334455SE +/- 0.52, N = 346.94MIN: 44.3 / MAX: 49.03

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To CompileY3691215SE +/- 0.05, N = 313.62

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticY3691215SE +/- 0.07, N = 313.191. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pY1530456075SE +/- 0.42, N = 366.621. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: CrownY1224364860SE +/- 0.42, N = 352.42MIN: 50.36 / MAX: 54.91

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian DragonY1224364860SE +/- 1.07, N = 353.73MIN: 49.01 / MAX: 55.88

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUY0.07410.14820.22230.29640.3705SE +/- 0.002701, N = 30.329425MIN: 0.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: SlowY1122334455SE +/- 0.24, N = 349.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUY0.19010.38020.57030.76040.9505SE +/- 0.004574, N = 30.844771MIN: 0.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUY0.0670.1340.2010.2680.335SE +/- 0.004669, N = 30.297722MIN: 0.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: MediumY1122334455SE +/- 0.09, N = 350.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pY2004006008001000SE +/- 4.11, N = 31029.97MIN: 584.34 / MAX: 1300.561. (CC) gcc options: -pthread -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pY70140210280350SE +/- 3.59, N = 9313.361. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterY3691215SE +/- 0.09, N = 310.341. (CXX) g++ options: -O2 -lOpenCL

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KY80160240320400SE +/- 1.61, N = 3368.87MIN: 185.37 / MAX: 401.941. (CC) gcc options: -pthread -lm

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06Y2M4M6M8M10MSE +/- 62608.90, N = 388300811. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigY3691215SE +/- 0.028664, N = 39.2154781. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUY0.56121.12241.68362.24482.806SE +/- 0.00564, N = 32.49439MIN: 2.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pY1530456075SE +/- 0.24, N = 365.381. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUY0.51391.02781.54172.05562.5695SE +/- 0.00690, N = 32.28407MIN: 2.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / MemoryY2K4K6K8K10KSE +/- 126.70, N = 311083.361. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessY3691215SE +/- 0.075, N = 39.0741. (CXX) g++ options: -O3 -fPIC -lm

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairY246810SE +/- 0.03564, N = 38.349951. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverY246810SE +/- 0.082, N = 37.9471. (CXX) g++ options: -O2 -lOpenCL

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPY0.33430.66861.00291.33721.6715SE +/- 0.014419, N = 151.4855701. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUY0.98711.97422.96133.94844.9355SE +/- 0.01481, N = 34.38694MIN: 4.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUY0.92011.84022.76033.68044.6005SE +/- 0.00117, N = 34.08945MIN: 3.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUY0.75611.51222.26833.02443.7805SE +/- 0.00289, N = 33.36037MIN: 3.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pY2004006008001000SE +/- 14.62, N = 5948.23MIN: 286.82 / MAX: 1075.041. (CC) gcc options: -pthread -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10Y1.33632.67264.00895.34526.6815SE +/- 0.011, N = 35.9391. (CXX) g++ options: -O3 -fPIC -lm

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large SceneY50100150200250SE +/- 1.91, N = 3234.39

OSPray

Demo: NASA Streamlines - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisY20406080100SE +/- 0.00, N = 383.33MIN: 13.89 / MAX: 90.91

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialY918273645SE +/- 0.26, N = 339.35

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastY306090120150SE +/- 0.13, N = 3113.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationY1.16532.33063.49594.66125.8265SE +/- 0.035, N = 35.1791. (CXX) g++ options: -O3 -lpthread

Rust Prime Benchmark

Prime Number Test To 200,000,000

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000Y0.79851.5972.39553.1943.9925SE +/- 0.062, N = 43.5491. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilY0.67371.34742.02112.69483.3685SE +/- 0.022981, N = 32.9941161. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OSPray

Demo: Magnetic Reconnection - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisY1530456075SE +/- 0.00, N = 366.67MIN: 9.52 / MAX: 71.43

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pY50100150200250SE +/- 0.63, N = 3247.801. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastY306090120150SE +/- 1.83, N = 3150.661. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingY4080120160200SE +/- 1.16, N = 3171.641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pY70140210280350SE +/- 2.93, N = 3312.831. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeY0.70651.4132.11952.8263.5325SE +/- 0.063, N = 33.1401. (CC) gcc options: -static -fopenmp -O3 -march=native

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUY0.9961.9922.9883.9844.98SE +/- 0.00888, N = 34.42666MIN: 4.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUY0.26740.53480.80221.06961.337SE +/- 0.00487, N = 31.18862MIN: 1.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUY0.07550.1510.22650.3020.3775SE +/- 0.001459, N = 30.335568MIN: 0.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CY12002400360048006000SE +/- 99.69, N = 45755.321. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesY0.6011.2021.8032.4043.005SE +/- 0.030, N = 32.6711. (CXX) g++ options: -fopenmp -O3

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pY100200300400500SE +/- 8.77, N = 3460.351. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinY612182430SE +/- 0.33, N = 325.821. (CXX) g++ options: -O3 -pthread -lm

OSPray

Demo: Magnetic Reconnection - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerY110220330440550500MIN: 200 / MAX: 1000

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Addy30K60K90K120K150KSE +/- 1800.63, N = 5157347.71. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triady30K60K90K120K150KSE +/- 1525.59, N = 5156304.21. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scaley30K60K90K120K150KSE +/- 1689.42, N = 5141319.71. (CC) gcc options: -O3 -march=native -fopenmp

Compile Bench

Test: Read Compiled Tree

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Read Compiled Treey5001000150020002500SE +/- 11.70, N = 32210.04

Compile Bench

Test: Initial Create

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Initial Createy90180270360450SE +/- 5.14, N = 3434.38

BlogBench

Test: Write

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: Writey2K4K6K8K10KSE +/- 910.09, N = 3111451. (CC) gcc options: -O2 -pthread


Phoronix Test Suite v10.8.4