EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11

patch testing by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102055-HA-EPYC7F52L26
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Ondemand
February 04 2021
  2 Hours, 34 Minutes
Schedutil
February 04 2021
  2 Hours, 36 Minutes
Performance
February 05 2021
  2 Hours, 32 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 34 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOndemandSchedutilPerformanceAMD EPYC 7F52 16-Core @ 3.91GHz (16 Cores / 32 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse8 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN280GB INTEL SSDPE21D280GAASPEEDVE228Ubuntu 20.045.11.0-rc6-phx (x86_64) 20210203GNOME Shell 3.36.1X Server 1.20.7aspeedGCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ondemand: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034- Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

OndemandSchedutilPerformanceResult OverviewPhoronix Test Suite100%107%115%122%129%dav1dTimed GDB GNU Debugger CompilationRAR Compressionx265SVT-AV1KvazaarOpenVKLRedisx264CLOMPLuxCoreRenderGraphicsMagickTimed Godot Game Engine CompilationOCRMyPDFRodiniaPennantHuginIntel Open Image DenoiseOSPrayBlenderTensorFlow LiteJohn The Ripper

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11dav1d: Chimera 1080p 10-bitdav1d: Summer Nature 1080pdav1d: Chimera 1080px265: Bosphorus 1080pbuild-gdb: Time To Compilecompress-rar: Linux Source Tree Archiving To RARgraphics-magick: HWB Color Spacedav1d: Summer Nature 4Kkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastsvt-av1: Enc Mode 8 - 1080pkvazaar: Bosphorus 4K - Very Fastredis: GETsvt-av1: Enc Mode 4 - 1080px265: Bosphorus 4Kopenvkl: vklBenchmarkluxcorerender: Rainbow Colors and Prismx264: H.264 Video Encodingrodinia: OpenMP HotSpot3Dclomp: Static OMP Speedupkvazaar: Bosphorus 4K - Ultra Fastgraphics-magick: Resizinggraphics-magick: Rotatebuild-godot: Time To Compileocrmypdf: Processing 60 Page PDF Documentrodinia: OpenMP Leukocyteluxcorerender: DLSCredis: SETpennant: leblancbigtensorflow-lite: NASNet Mobilerodinia: OpenMP CFD Solvergraphics-magick: Swirlospray: San Miguel - Path Traceroidn: Memorialhugin: Panorama Photo Assistant + Stitching Timerodinia: OpenMP Streamclustergraphics-magick: Sharpenrodinia: OpenMP LavaMDospray: NASA Streamlines - Path Tracerpennant: sedovbiggraphics-magick: Enhancedgraphics-magick: Noise-Gaussianblender: BMW27 - CPU-Onlyblender: Barbershop - CPU-Onlytensorflow-lite: Mobilenet Quanttensorflow-lite: SqueezeNetjohn-the-ripper: MD5john-the-ripper: Blowfishtensorflow-lite: Mobilenet Floattensorflow-lite: Inception V4tensorflow-lite: Inception ResNet V2ospray: NASA Streamlines - SciVisospray: San Miguel - SciVisOndemandSchedutilPerformance119.13592.34625.4365.4795.49274.9831271254.2987.16155.9741.02325.351545026.545.24421.652193.48172.6490.04249.746.86158263884.47219.54689.2693.151308179.8316.0296712687813.2439221.8614.1950.65413.769236126.3897.0925.6107437642183.66357.5770655.110726017196672629869039.31504413135144033.3324.3992.21488.95536.6555.0789.60676.9211121246.8686.89153.0143.04026.271599356.295.37921.052173.57172.6091.31050.347.63158763983.31319.48390.4313.191313860.0315.9934712598113.2479231.8714.2650.46413.754236125.9777.1125.5967037642183.57357.4970760.610729417210002631069012.91503093135065333.3324.39131.74668.50713.1868.4883.70967.6761239264.9892.22161.9742.82326.391604323.005.43121.502233.51176.5492.08750.747.80160962983.16419.28189.7093.181318380.4215.9098012620313.3319281.8714.2350.40813.705237125.8817.1125.5425737742083.73356.9070739.110739017213332632368976.91503297135066733.3324.39OpenBenchmarking.org

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitSchedutilOndemandPerformance306090120150SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 392.21119.13131.74MIN: 59.88 / MAX: 204.44MIN: 78.3 / MAX: 259.88MIN: 85.18 / MAX: 277.621. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pSchedutilOndemandPerformance140280420560700SE +/- 1.06, N = 3SE +/- 2.45, N = 3SE +/- 0.74, N = 3488.95592.34668.50MIN: 390.53 / MAX: 532.25MIN: 443.85 / MAX: 644.65MIN: 502.68 / MAX: 730.81. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pSchedutilOndemandPerformance150300450600750SE +/- 0.42, N = 3SE +/- 1.15, N = 3SE +/- 1.61, N = 3536.65625.43713.18MIN: 417.73 / MAX: 665.43MIN: 491.48 / MAX: 770.18MIN: 554.58 / MAX: 886.761. (CC) gcc options: -pthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pSchedutilOndemandPerformance1530456075SE +/- 0.34, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 355.0765.4768.481. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileOndemandSchedutilPerformance20406080100SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 395.4989.6183.71

RAR Compression

This test measures the time needed to archive/compress two copies of the Linux 4.13 kernel source tree using RAR/WinRAR compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRAR Compression 5.6.1Linux Source Tree Archiving To RARSchedutilOndemandPerformance20406080100SE +/- 0.41, N = 3SE +/- 0.18, N = 3SE +/- 0.24, N = 376.9274.9867.68

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceSchedutilPerformanceOndemand30060090012001500SE +/- 2.40, N = 3SE +/- 3.67, N = 3SE +/- 2.52, N = 31121123912711. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KSchedutilOndemandPerformance60120180240300SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3246.86254.29264.98MIN: 205.59 / MAX: 285.26MIN: 211.06 / MAX: 289.79MIN: 222.23 / MAX: 302.431. (CC) gcc options: -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastSchedutilOndemandPerformance20406080100SE +/- 0.21, N = 3SE +/- 0.29, N = 3SE +/- 0.08, N = 386.8987.1692.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastSchedutilOndemandPerformance4080120160200SE +/- 0.28, N = 3SE +/- 0.36, N = 3SE +/- 0.32, N = 3153.01155.97161.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pOndemandPerformanceSchedutil1020304050SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 341.0242.8243.041. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastOndemandSchedutilPerformance612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 325.3526.2726.391. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETOndemandSchedutilPerformance300K600K900K1200K1500KSE +/- 12901.14, N = 3SE +/- 26190.66, N = 3SE +/- 13020.90, N = 31545026.541599356.291604323.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pOndemandSchedutilPerformance1.2222.4443.6664.8886.11SE +/- 0.049, N = 3SE +/- 0.025, N = 3SE +/- 0.023, N = 35.2445.3795.4311. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KSchedutilPerformanceOndemand510152025SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 321.0521.5021.651. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkSchedutilOndemandPerformance50100150200250SE +/- 0.88, N = 3SE +/- 0.58, N = 3217219223MIN: 1 / MAX: 786MIN: 1 / MAX: 776MIN: 1 / MAX: 785

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismOndemandPerformanceSchedutil0.80331.60662.40993.21324.0165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.483.513.57MIN: 3.44 / MAX: 3.5MIN: 3.48 / MAX: 3.55MIN: 3.48 / MAX: 3.6

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingSchedutilOndemandPerformance4080120160200SE +/- 1.39, N = 3SE +/- 1.12, N = 3SE +/- 1.40, N = 3172.60172.64176.541. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DPerformanceSchedutilOndemand20406080100SE +/- 0.46, N = 3SE +/- 0.24, N = 3SE +/- 0.08, N = 392.0991.3190.041. (CXX) g++ options: -O2 -lOpenCL

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupOndemandSchedutilPerformance1122334455SE +/- 0.25, N = 3SE +/- 0.33, N = 3SE +/- 0.43, N = 349.750.350.71. (CC) gcc options: -fopenmp -O3 -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastOndemandSchedutilPerformance1122334455SE +/- 0.14, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 346.8647.6347.801. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingOndemandSchedutilPerformance30060090012001500SE +/- 5.49, N = 3SE +/- 18.52, N = 3SE +/- 1.20, N = 31582158716091. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotatePerformanceOndemandSchedutil140280420560700SE +/- 4.18, N = 3SE +/- 4.70, N = 3SE +/- 4.18, N = 36296386391. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileOndemandSchedutilPerformance20406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 384.4783.3183.16

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 9.6.0+dfsgProcessing 60 Page PDF DocumentOndemandSchedutilPerformance510152025SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 319.5519.4819.28

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteSchedutilPerformanceOndemand20406080100SE +/- 0.33, N = 3SE +/- 0.29, N = 3SE +/- 0.58, N = 390.4389.7189.271. (CXX) g++ options: -O2 -lOpenCL

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCOndemandPerformanceSchedutil0.71781.43562.15342.87123.589SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 33.153.183.19MIN: 3.06 / MAX: 3.29MIN: 3.05 / MAX: 3.39MIN: 3.04 / MAX: 3.41

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETOndemandSchedutilPerformance300K600K900K1200K1500KSE +/- 3347.62, N = 3SE +/- 16949.35, N = 4SE +/- 17244.90, N = 31308179.831313860.031318380.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigOndemandSchedutilPerformance48121620SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 316.0315.9915.911. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileOndemandPerformanceSchedutil30K60K90K120K150KSE +/- 136.25, N = 3SE +/- 287.23, N = 3SE +/- 40.55, N = 3126878126203125981

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverPerformanceSchedutilOndemand3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.3313.2513.241. (CXX) g++ options: -O2 -lOpenCL

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlOndemandSchedutilPerformance2004006008001000SE +/- 0.88, N = 39229239281. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerOndemandSchedutilPerformance0.42080.84161.26241.68322.104SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.861.871.87MIN: 1.85 / MAX: 1.88MIN: 1.85 / MAX: 1.88MIN: 1.86 / MAX: 1.88

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialOndemandPerformanceSchedutil48121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 314.1914.2314.26

Hugin

Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimeOndemandSchedutilPerformance1122334455SE +/- 0.33, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 350.6550.4650.41

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterOndemandSchedutilPerformance48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 313.7713.7513.711. (CXX) g++ options: -O2 -lOpenCL

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenOndemandSchedutilPerformance501001502002502362362371. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDOndemandSchedutilPerformance306090120150SE +/- 0.25, N = 3SE +/- 0.27, N = 3SE +/- 0.28, N = 3126.39125.98125.881. (CXX) g++ options: -O2 -lOpenCL

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerOndemandSchedutilPerformance246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.097.117.11MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.25

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigOndemandSchedutilPerformance612182430SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 325.6125.6025.541. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedOndemandSchedutilPerformance80160240320400SE +/- 0.33, N = 33763763771. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianPerformanceOndemandSchedutil90180270360450SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 34204214211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyPerformanceOndemandSchedutil20406080100SE +/- 0.50, N = 3SE +/- 0.38, N = 3SE +/- 0.25, N = 383.7383.6683.57

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyOndemandSchedutilPerformance80160240320400SE +/- 0.51, N = 3SE +/- 0.45, N = 3SE +/- 0.68, N = 3357.57357.49356.90

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantSchedutilPerformanceOndemand15K30K45K60K75KSE +/- 30.64, N = 3SE +/- 30.62, N = 3SE +/- 45.84, N = 370760.670739.170655.1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetPerformanceSchedutilOndemand20K40K60K80K100KSE +/- 43.47, N = 3SE +/- 80.49, N = 3SE +/- 84.90, N = 3107390107294107260

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5OndemandSchedutilPerformance400K800K1200K1600K2000KSE +/- 3179.80, N = 3SE +/- 3000.00, N = 3SE +/- 2962.73, N = 31719667172100017213331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishOndemandSchedutilPerformance6K12K18K24K30KSE +/- 22.67, N = 3SE +/- 22.45, N = 3SE +/- 23.97, N = 32629826310263231. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatOndemandSchedutilPerformance15K30K45K60K75KSE +/- 42.21, N = 3SE +/- 21.41, N = 3SE +/- 36.63, N = 369039.369012.968976.9

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 385.24, N = 3SE +/- 236.24, N = 3SE +/- 262.95, N = 3150441315032971503093

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 365.29, N = 3SE +/- 483.75, N = 3SE +/- 394.05, N = 3135144013506671350653

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisOndemandSchedutilPerformance816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 333.3333.3333.33MIN: 31.25 / MAX: 34.48MIN: 31.25MIN: 31.25 / MAX: 34.48

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisOndemandSchedutilPerformance612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 324.3924.3924.39MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32