EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11

patch testing by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102055-HA-EPYC7F52L26
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Ondemand
February 04 2021
  2 Hours, 34 Minutes
Schedutil
February 04 2021
  2 Hours, 36 Minutes
Performance
February 05 2021
  2 Hours, 32 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 34 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOndemandSchedutilPerformanceAMD EPYC 7F52 16-Core @ 3.91GHz (16 Cores / 32 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse8 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN280GB INTEL SSDPE21D280GAASPEEDVE228Ubuntu 20.045.11.0-rc6-phx (x86_64) 20210203GNOME Shell 3.36.1X Server 1.20.7aspeedGCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ondemand: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034- Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

OndemandSchedutilPerformanceResult OverviewPhoronix Test Suite100%107%115%122%129%dav1dTimed GDB GNU Debugger CompilationRAR Compressionx265SVT-AV1KvazaarOpenVKLRedisx264CLOMPLuxCoreRenderGraphicsMagickTimed Godot Game Engine CompilationOCRMyPDFRodiniaPennantHuginIntel Open Image DenoiseOSPrayBlenderTensorFlow LiteJohn The Ripper

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11blender: Barbershop - CPU-Onlyopenvkl: vklBenchmarkospray: San Miguel - Path Tracerrodinia: OpenMP LavaMDdav1d: Chimera 1080p 10-bitrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocytebuild-gdb: Time To Compileblender: BMW27 - CPU-Onlybuild-godot: Time To Compiletensorflow-lite: Inception V4compress-rar: Linux Source Tree Archiving To RARtensorflow-lite: Inception ResNet V2luxcorerender: DLSCluxcorerender: Rainbow Colors and Prismtensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatjohn-the-ripper: MD5graphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: Swirlgraphics-magick: HWB Color Spacehugin: Panorama Photo Assistant + Stitching Timeclomp: Static OMP Speedupospray: San Miguel - SciVisospray: NASA Streamlines - Path Tracerjohn-the-ripper: Blowfishx265: Bosphorus 4Kpennant: sedovbigkvazaar: Bosphorus 4K - Very Fastredis: SETocrmypdf: Processing 60 Page PDF Documentdav1d: Chimera 1080psvt-av1: Enc Mode 4 - 1080ppennant: leblancbigredis: GETdav1d: Summer Nature 4Koidn: Memorialrodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverkvazaar: Bosphorus 4K - Ultra Fastx265: Bosphorus 1080psvt-av1: Enc Mode 8 - 1080pospray: NASA Streamlines - SciViskvazaar: Bosphorus 1080p - Very Fastdav1d: Summer Nature 1080pkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video EncodingOndemandSchedutilPerformance357.572191.86126.389119.1390.04289.26995.49283.6684.472150441374.98313514403.153.4810726012687870655.169039.317196672363764216381582922127150.65449.724.397.092629821.6525.6107425.351308179.8319.546625.435.24416.029671545026.54254.2914.1913.76913.24346.8665.4741.02333.3387.16592.34155.97172.64357.492171.87125.97792.2191.31090.43189.60683.5783.313150309376.92113506533.193.5710729412598170760.669012.917210002363764216391587923112150.46450.324.397.112631021.0525.5967026.271313860.0319.483536.655.37915.993471599356.29246.8614.2613.75413.24747.6355.0743.04033.3386.89488.95153.01172.60356.902231.87125.881131.7492.08789.70983.70983.7383.164150329767.67613506673.183.5110739012620370739.168976.917213332373774206291609928123950.40850.724.397.112632321.5025.5425726.391318380.4219.281713.185.43115.909801604323.00264.9814.2313.70513.33147.8068.4842.82333.3392.22668.50161.97176.54OpenBenchmarking.org

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyOndemandSchedutilPerformance80160240320400SE +/- 0.51, N = 3SE +/- 0.45, N = 3SE +/- 0.68, N = 3357.57357.49356.90

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkSchedutilOndemandPerformance50100150200250SE +/- 0.88, N = 3SE +/- 0.58, N = 3217219223MIN: 1 / MAX: 786MIN: 1 / MAX: 776MIN: 1 / MAX: 785

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerOndemandSchedutilPerformance0.42080.84161.26241.68322.104SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.861.871.87MIN: 1.85 / MAX: 1.88MIN: 1.85 / MAX: 1.88MIN: 1.86 / MAX: 1.88

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDOndemandSchedutilPerformance306090120150SE +/- 0.25, N = 3SE +/- 0.27, N = 3SE +/- 0.28, N = 3126.39125.98125.881. (CXX) g++ options: -O2 -lOpenCL

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitSchedutilOndemandPerformance306090120150SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 392.21119.13131.74MIN: 59.88 / MAX: 204.44MIN: 78.3 / MAX: 259.88MIN: 85.18 / MAX: 277.621. (CC) gcc options: -pthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DPerformanceSchedutilOndemand20406080100SE +/- 0.46, N = 3SE +/- 0.24, N = 3SE +/- 0.08, N = 392.0991.3190.041. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteSchedutilPerformanceOndemand20406080100SE +/- 0.33, N = 3SE +/- 0.29, N = 3SE +/- 0.58, N = 390.4389.7189.271. (CXX) g++ options: -O2 -lOpenCL

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileOndemandSchedutilPerformance20406080100SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 395.4989.6183.71

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyPerformanceOndemandSchedutil20406080100SE +/- 0.50, N = 3SE +/- 0.38, N = 3SE +/- 0.25, N = 383.7383.6683.57

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileOndemandSchedutilPerformance20406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 384.4783.3183.16

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 385.24, N = 3SE +/- 236.24, N = 3SE +/- 262.95, N = 3150441315032971503093

RAR Compression

This test measures the time needed to archive/compress two copies of the Linux 4.13 kernel source tree using RAR/WinRAR compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRAR Compression 5.6.1Linux Source Tree Archiving To RARSchedutilOndemandPerformance20406080100SE +/- 0.41, N = 3SE +/- 0.18, N = 3SE +/- 0.24, N = 376.9274.9867.68

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 365.29, N = 3SE +/- 483.75, N = 3SE +/- 394.05, N = 3135144013506671350653

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCOndemandPerformanceSchedutil0.71781.43562.15342.87123.589SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 33.153.183.19MIN: 3.06 / MAX: 3.29MIN: 3.05 / MAX: 3.39MIN: 3.04 / MAX: 3.41

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismOndemandPerformanceSchedutil0.80331.60662.40993.21324.0165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.483.513.57MIN: 3.44 / MAX: 3.5MIN: 3.48 / MAX: 3.55MIN: 3.48 / MAX: 3.6

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetPerformanceSchedutilOndemand20K40K60K80K100KSE +/- 43.47, N = 3SE +/- 80.49, N = 3SE +/- 84.90, N = 3107390107294107260

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileOndemandPerformanceSchedutil30K60K90K120K150KSE +/- 136.25, N = 3SE +/- 287.23, N = 3SE +/- 40.55, N = 3126878126203125981

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantSchedutilPerformanceOndemand15K30K45K60K75KSE +/- 30.64, N = 3SE +/- 30.62, N = 3SE +/- 45.84, N = 370760.670739.170655.1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatOndemandSchedutilPerformance15K30K45K60K75KSE +/- 42.21, N = 3SE +/- 21.41, N = 3SE +/- 36.63, N = 369039.369012.968976.9

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5OndemandSchedutilPerformance400K800K1200K1600K2000KSE +/- 3179.80, N = 3SE +/- 3000.00, N = 3SE +/- 2962.73, N = 31719667172100017213331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenOndemandSchedutilPerformance501001502002502362362371. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedOndemandSchedutilPerformance80160240320400SE +/- 0.33, N = 33763763771. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianPerformanceOndemandSchedutil90180270360450SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 34204214211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotatePerformanceOndemandSchedutil140280420560700SE +/- 4.18, N = 3SE +/- 4.70, N = 3SE +/- 4.18, N = 36296386391. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingOndemandSchedutilPerformance30060090012001500SE +/- 5.49, N = 3SE +/- 18.52, N = 3SE +/- 1.20, N = 31582158716091. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlOndemandSchedutilPerformance2004006008001000SE +/- 0.88, N = 39229239281. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceSchedutilPerformanceOndemand30060090012001500SE +/- 2.40, N = 3SE +/- 3.67, N = 3SE +/- 2.52, N = 31121123912711. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Hugin

Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimeOndemandSchedutilPerformance1122334455SE +/- 0.33, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 350.6550.4650.41

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupOndemandSchedutilPerformance1122334455SE +/- 0.25, N = 3SE +/- 0.33, N = 3SE +/- 0.43, N = 349.750.350.71. (CC) gcc options: -fopenmp -O3 -lm

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisOndemandSchedutilPerformance612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 324.3924.3924.39MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerOndemandSchedutilPerformance246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.097.117.11MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.25

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishOndemandSchedutilPerformance6K12K18K24K30KSE +/- 22.67, N = 3SE +/- 22.45, N = 3SE +/- 23.97, N = 32629826310263231. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KSchedutilPerformanceOndemand510152025SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 321.0521.5021.651. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigOndemandSchedutilPerformance612182430SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 325.6125.6025.541. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastOndemandSchedutilPerformance612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 325.3526.2726.391. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETOndemandSchedutilPerformance300K600K900K1200K1500KSE +/- 3347.62, N = 3SE +/- 16949.35, N = 4SE +/- 17244.90, N = 31308179.831313860.031318380.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 9.6.0+dfsgProcessing 60 Page PDF DocumentOndemandSchedutilPerformance510152025SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 319.5519.4819.28

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pSchedutilOndemandPerformance150300450600750SE +/- 0.42, N = 3SE +/- 1.15, N = 3SE +/- 1.61, N = 3536.65625.43713.18MIN: 417.73 / MAX: 665.43MIN: 491.48 / MAX: 770.18MIN: 554.58 / MAX: 886.761. (CC) gcc options: -pthread

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pOndemandSchedutilPerformance1.2222.4443.6664.8886.11SE +/- 0.049, N = 3SE +/- 0.025, N = 3SE +/- 0.023, N = 35.2445.3795.4311. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigOndemandSchedutilPerformance48121620SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 316.0315.9915.911. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETOndemandSchedutilPerformance300K600K900K1200K1500KSE +/- 12901.14, N = 3SE +/- 26190.66, N = 3SE +/- 13020.90, N = 31545026.541599356.291604323.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KSchedutilOndemandPerformance60120180240300SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3246.86254.29264.98MIN: 205.59 / MAX: 285.26MIN: 211.06 / MAX: 289.79MIN: 222.23 / MAX: 302.431. (CC) gcc options: -pthread

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialOndemandPerformanceSchedutil48121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 314.1914.2314.26

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterOndemandSchedutilPerformance48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 313.7713.7513.711. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverPerformanceSchedutilOndemand3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.3313.2513.241. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastOndemandSchedutilPerformance1122334455SE +/- 0.14, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 346.8647.6347.801. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pSchedutilOndemandPerformance1530456075SE +/- 0.34, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 355.0765.4768.481. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pOndemandPerformanceSchedutil1020304050SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 341.0242.8243.041. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisOndemandSchedutilPerformance816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 333.3333.3333.33MIN: 31.25 / MAX: 34.48MIN: 31.25MIN: 31.25 / MAX: 34.48

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastSchedutilOndemandPerformance20406080100SE +/- 0.21, N = 3SE +/- 0.29, N = 3SE +/- 0.08, N = 386.8987.1692.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pSchedutilOndemandPerformance140280420560700SE +/- 1.06, N = 3SE +/- 2.45, N = 3SE +/- 0.74, N = 3488.95592.34668.50MIN: 390.53 / MAX: 532.25MIN: 443.85 / MAX: 644.65MIN: 502.68 / MAX: 730.81. (CC) gcc options: -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastSchedutilOndemandPerformance4080120160200SE +/- 0.28, N = 3SE +/- 0.36, N = 3SE +/- 0.32, N = 3153.01155.97161.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingSchedutilOndemandPerformance4080120160200SE +/- 1.39, N = 3SE +/- 1.12, N = 3SE +/- 1.40, N = 3172.60172.64176.541. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize