EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11

patch testing by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102055-HA-EPYC7F52L26
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Ondemand
February 04 2021
  2 Hours, 34 Minutes
Schedutil
February 04 2021
  2 Hours, 36 Minutes
Performance
February 05 2021
  2 Hours, 32 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 34 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOndemandSchedutilPerformanceAMD EPYC 7F52 16-Core @ 3.91GHz (16 Cores / 32 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse8 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN280GB INTEL SSDPE21D280GAASPEEDVE228Ubuntu 20.045.11.0-rc6-phx (x86_64) 20210203GNOME Shell 3.36.1X Server 1.20.7aspeedGCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ondemand: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034- Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

OndemandSchedutilPerformanceResult OverviewPhoronix Test Suite100%107%115%122%129%dav1dTimed GDB GNU Debugger CompilationRAR Compressionx265SVT-AV1KvazaarOpenVKLRedisx264CLOMPLuxCoreRenderGraphicsMagickTimed Godot Game Engine CompilationOCRMyPDFRodiniaPennantHuginIntel Open Image DenoiseOSPrayBlenderTensorFlow LiteJohn The Ripper

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11blender: BMW27 - CPU-Onlyblender: Barbershop - CPU-Onlyclomp: Static OMP Speedupdav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacehugin: Panorama Photo Assistant + Stitching Timeoidn: Memorialjohn-the-ripper: Blowfishjohn-the-ripper: MD5kvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastluxcorerender: DLSCluxcorerender: Rainbow Colors and Prismocrmypdf: Processing 60 Page PDF Documentopenvkl: vklBenchmarkospray: San Miguel - SciVisospray: San Miguel - Path Tracerospray: NASA Streamlines - SciVisospray: NASA Streamlines - Path Tracerpennant: sedovbigpennant: leblancbigcompress-rar: Linux Source Tree Archiving To RARredis: GETredis: SETrodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclustersvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080ptensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2build-gdb: Time To Compilebuild-godot: Time To Compilex264: H.264 Video Encodingx265: Bosphorus 4Kx265: Bosphorus 1080pOndemandSchedutilPerformance83.66357.5749.7625.43254.29592.34119.139226382363761582421127150.65414.1926298171966725.3546.8687.16155.973.153.4819.54621924.391.8633.337.0925.6107416.0296774.9831545026.541308179.83126.38990.04289.26913.24313.7695.24441.023107260150441312687869039.370655.1135144095.49284.472172.6421.6565.4783.57357.4950.3536.65246.86488.9592.219236392363761587421112150.46414.2626310172100026.2747.6386.89153.013.193.5719.48321724.391.8733.337.1125.5967015.9934776.9211599356.291313860.03125.97791.31090.43113.24713.7545.37943.040107294150309312598169012.970760.6135065389.60683.313172.6021.0555.0783.73356.9050.7713.18264.98668.50131.749286292373771609420123950.40814.2326323172133326.3947.8092.22161.973.183.5119.28122324.391.8733.337.1125.5425715.9098067.6761604323.001318380.42125.88192.08789.70913.33113.7055.43142.823107390150329712620368976.970739.1135066783.70983.164176.5421.5068.48OpenBenchmarking.org

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlySchedutilPerformanceOndemand20406080100SE +/- 0.25, N = 3SE +/- 0.50, N = 3SE +/- 0.38, N = 383.5783.7383.66

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlySchedutilPerformanceOndemand80160240320400SE +/- 0.45, N = 3SE +/- 0.68, N = 3SE +/- 0.51, N = 3357.49356.90357.57

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupSchedutilPerformanceOndemand1122334455SE +/- 0.33, N = 3SE +/- 0.43, N = 3SE +/- 0.25, N = 350.350.749.71. (CC) gcc options: -fopenmp -O3 -lm

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pSchedutilPerformanceOndemand150300450600750SE +/- 0.42, N = 3SE +/- 1.61, N = 3SE +/- 1.15, N = 3536.65713.18625.43MIN: 417.73 / MAX: 665.43MIN: 554.58 / MAX: 886.76MIN: 491.48 / MAX: 770.181. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KSchedutilPerformanceOndemand60120180240300SE +/- 0.18, N = 3SE +/- 0.38, N = 3SE +/- 0.06, N = 3246.86264.98254.29MIN: 205.59 / MAX: 285.26MIN: 222.23 / MAX: 302.43MIN: 211.06 / MAX: 289.791. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pSchedutilPerformanceOndemand140280420560700SE +/- 1.06, N = 3SE +/- 0.74, N = 3SE +/- 2.45, N = 3488.95668.50592.34MIN: 390.53 / MAX: 532.25MIN: 502.68 / MAX: 730.8MIN: 443.85 / MAX: 644.651. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitSchedutilPerformanceOndemand306090120150SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 392.21131.74119.13MIN: 59.88 / MAX: 204.44MIN: 85.18 / MAX: 277.62MIN: 78.3 / MAX: 259.881. (CC) gcc options: -pthread

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlSchedutilPerformanceOndemand2004006008001000SE +/- 0.88, N = 39239289221. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateSchedutilPerformanceOndemand140280420560700SE +/- 4.18, N = 3SE +/- 4.18, N = 3SE +/- 4.70, N = 36396296381. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenSchedutilPerformanceOndemand501001502002502362372361. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedSchedutilPerformanceOndemand80160240320400SE +/- 0.33, N = 33763773761. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingSchedutilPerformanceOndemand30060090012001500SE +/- 18.52, N = 3SE +/- 1.20, N = 3SE +/- 5.49, N = 31587160915821. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianSchedutilPerformanceOndemand90180270360450SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 34214204211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceSchedutilPerformanceOndemand30060090012001500SE +/- 2.40, N = 3SE +/- 3.67, N = 3SE +/- 2.52, N = 31121123912711. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Hugin

Hugin is an open-source, cross-platform panorama photo stitcher software package. This test profile times how long it takes to run the assistant and panorama photo stitching on a set of images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimeSchedutilPerformanceOndemand1122334455SE +/- 0.17, N = 3SE +/- 0.29, N = 3SE +/- 0.33, N = 350.4650.4150.65

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialSchedutilPerformanceOndemand48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 314.2614.2314.19

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishSchedutilPerformanceOndemand6K12K18K24K30KSE +/- 22.45, N = 3SE +/- 23.97, N = 3SE +/- 22.67, N = 32631026323262981. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5SchedutilPerformanceOndemand400K800K1200K1600K2000KSE +/- 3000.00, N = 3SE +/- 2962.73, N = 3SE +/- 3179.80, N = 31721000172133317196671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastSchedutilPerformanceOndemand612182430SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 326.2726.3925.351. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastSchedutilPerformanceOndemand1122334455SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 347.6347.8046.861. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastSchedutilPerformanceOndemand20406080100SE +/- 0.21, N = 3SE +/- 0.08, N = 3SE +/- 0.29, N = 386.8992.2287.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastSchedutilPerformanceOndemand4080120160200SE +/- 0.28, N = 3SE +/- 0.32, N = 3SE +/- 0.36, N = 3153.01161.97155.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCSchedutilPerformanceOndemand0.71781.43562.15342.87123.589SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 33.193.183.15MIN: 3.04 / MAX: 3.41MIN: 3.05 / MAX: 3.39MIN: 3.06 / MAX: 3.29

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismSchedutilPerformanceOndemand0.80331.60662.40993.21324.0165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.573.513.48MIN: 3.48 / MAX: 3.6MIN: 3.48 / MAX: 3.55MIN: 3.44 / MAX: 3.5

OCRMyPDF

OCRMyPDF is an optical character recognition (OCR) text layer to scanned PDF files, producing new PDFs with the text now selectable/searchable/copy-paste capable. OCRMyPDF leverages the Tesseract OCR engine and is written in Python. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 9.6.0+dfsgProcessing 60 Page PDF DocumentSchedutilPerformanceOndemand510152025SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 319.4819.2819.55

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkSchedutilPerformanceOndemand50100150200250SE +/- 0.88, N = 3SE +/- 0.58, N = 3217223219MIN: 1 / MAX: 786MIN: 1 / MAX: 785MIN: 1 / MAX: 776

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisSchedutilPerformanceOndemand612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 324.3924.3924.39MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerSchedutilPerformanceOndemand0.42080.84161.26241.68322.104SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.871.871.86MIN: 1.85 / MAX: 1.88MIN: 1.86 / MAX: 1.88MIN: 1.85 / MAX: 1.88

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisSchedutilPerformanceOndemand816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 333.3333.3333.33MIN: 31.25MIN: 31.25 / MAX: 34.48MIN: 31.25 / MAX: 34.48

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerSchedutilPerformanceOndemand246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 37.117.117.09MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.25MIN: 6.99 / MAX: 7.19

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigSchedutilPerformanceOndemand612182430SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 325.6025.5425.611. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigSchedutilPerformanceOndemand48121620SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 315.9915.9116.031. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

RAR Compression

This test measures the time needed to archive/compress two copies of the Linux 4.13 kernel source tree using RAR/WinRAR compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRAR Compression 5.6.1Linux Source Tree Archiving To RARSchedutilPerformanceOndemand20406080100SE +/- 0.41, N = 3SE +/- 0.24, N = 3SE +/- 0.18, N = 376.9267.6874.98

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETSchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 26190.66, N = 3SE +/- 13020.90, N = 3SE +/- 12901.14, N = 31599356.291604323.001545026.541. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETSchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 16949.35, N = 4SE +/- 17244.90, N = 3SE +/- 3347.62, N = 31313860.031318380.421308179.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDSchedutilPerformanceOndemand306090120150SE +/- 0.27, N = 3SE +/- 0.28, N = 3SE +/- 0.25, N = 3125.98125.88126.391. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DSchedutilPerformanceOndemand20406080100SE +/- 0.24, N = 3SE +/- 0.46, N = 3SE +/- 0.08, N = 391.3192.0990.041. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteSchedutilPerformanceOndemand20406080100SE +/- 0.33, N = 3SE +/- 0.29, N = 3SE +/- 0.58, N = 390.4389.7189.271. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverSchedutilPerformanceOndemand3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.2513.3313.241. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterSchedutilPerformanceOndemand48121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 313.7513.7113.771. (CXX) g++ options: -O2 -lOpenCL

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pSchedutilPerformanceOndemand1.2222.4443.6664.8886.11SE +/- 0.025, N = 3SE +/- 0.023, N = 3SE +/- 0.049, N = 35.3795.4315.2441. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pSchedutilPerformanceOndemand1020304050SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 343.0442.8241.021. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetSchedutilPerformanceOndemand20K40K60K80K100KSE +/- 80.49, N = 3SE +/- 43.47, N = 3SE +/- 84.90, N = 3107294107390107260

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4SchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 262.95, N = 3SE +/- 236.24, N = 3SE +/- 385.24, N = 3150309315032971504413

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileSchedutilPerformanceOndemand30K60K90K120K150KSE +/- 40.55, N = 3SE +/- 287.23, N = 3SE +/- 136.25, N = 3125981126203126878

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatSchedutilPerformanceOndemand15K30K45K60K75KSE +/- 21.41, N = 3SE +/- 36.63, N = 3SE +/- 42.21, N = 369012.968976.969039.3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantSchedutilPerformanceOndemand15K30K45K60K75KSE +/- 30.64, N = 3SE +/- 30.62, N = 3SE +/- 45.84, N = 370760.670739.170655.1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2SchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 394.05, N = 3SE +/- 483.75, N = 3SE +/- 365.29, N = 3135065313506671351440

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileSchedutilPerformanceOndemand20406080100SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 389.6183.7195.49

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileSchedutilPerformanceOndemand20406080100SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 383.3183.1684.47

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingSchedutilPerformanceOndemand4080120160200SE +/- 1.39, N = 3SE +/- 1.40, N = 3SE +/- 1.12, N = 3172.60176.54172.641. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KSchedutilPerformanceOndemand510152025SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 321.0521.5021.651. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pSchedutilPerformanceOndemand1530456075SE +/- 0.34, N = 3SE +/- 0.12, N = 3SE +/- 0.15, N = 355.0768.4865.471. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma