EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11

patch testing by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2102055-HA-EPYC7F52L26&sro&grr.

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOndemandSchedutilPerformanceAMD EPYC 7F52 16-Core @ 3.91GHz (16 Cores / 32 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse8 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN280GB INTEL SSDPE21D280GAASPEEDVE228Ubuntu 20.045.11.0-rc6-phx (x86_64) 20210203GNOME Shell 3.36.1X Server 1.20.7aspeedGCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ondemand: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034- Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11blender: Barbershop - CPU-Onlyopenvkl: vklBenchmarkospray: San Miguel - Path Tracerrodinia: OpenMP LavaMDdav1d: Chimera 1080p 10-bitrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocytebuild-gdb: Time To Compileblender: BMW27 - CPU-Onlybuild-godot: Time To Compiletensorflow-lite: Inception V4compress-rar: Linux Source Tree Archiving To RARtensorflow-lite: Inception ResNet V2luxcorerender: DLSCluxcorerender: Rainbow Colors and Prismtensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatjohn-the-ripper: MD5graphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: Swirlgraphics-magick: HWB Color Spacehugin: Panorama Photo Assistant + Stitching Timeclomp: Static OMP Speedupospray: San Miguel - SciVisospray: NASA Streamlines - Path Tracerjohn-the-ripper: Blowfishx265: Bosphorus 4Kpennant: sedovbigkvazaar: Bosphorus 4K - Very Fastredis: SETocrmypdf: Processing 60 Page PDF Documentdav1d: Chimera 1080psvt-av1: Enc Mode 4 - 1080ppennant: leblancbigredis: GETdav1d: Summer Nature 4Koidn: Memorialrodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverkvazaar: Bosphorus 4K - Ultra Fastx265: Bosphorus 1080psvt-av1: Enc Mode 8 - 1080pospray: NASA Streamlines - SciViskvazaar: Bosphorus 1080p - Very Fastdav1d: Summer Nature 1080pkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video EncodingOndemandSchedutilPerformance357.572191.86126.389119.1390.04289.26995.49283.6684.472150441374.98313514403.153.4810726012687870655.169039.317196672363764216381582922127150.65449.724.397.092629821.6525.6107425.351308179.8319.546625.435.24416.029671545026.54254.2914.1913.76913.24346.8665.4741.02333.3387.16592.34155.97172.64357.492171.87125.97792.2191.31090.43189.60683.5783.313150309376.92113506533.193.5710729412598170760.669012.917210002363764216391587923112150.46450.324.397.112631021.0525.5967026.271313860.0319.483536.655.37915.993471599356.29246.8614.2613.75413.24747.6355.0743.04033.3386.89488.95153.01172.60356.902231.87125.881131.7492.08789.70983.70983.7383.164150329767.67613506673.183.5110739012620370739.168976.917213332373774206291609928123950.40850.724.397.112632321.5025.5425726.391318380.4219.281713.185.43115.909801604323.00264.9814.2313.70513.33147.8068.4842.82333.3392.22668.50161.97176.54OpenBenchmarking.org

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyOndemandPerformanceSchedutil80160240320400SE +/- 0.51, N = 3SE +/- 0.68, N = 3SE +/- 0.45, N = 3357.57356.90357.49

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkOndemandPerformanceSchedutil50100150200250SE +/- 0.58, N = 3SE +/- 0.88, N = 3219223217MIN: 1 / MAX: 776MIN: 1 / MAX: 785MIN: 1 / MAX: 786

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerOndemandPerformanceSchedutil0.42080.84161.26241.68322.104SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.861.871.87MIN: 1.85 / MAX: 1.88MIN: 1.86 / MAX: 1.88MIN: 1.85 / MAX: 1.88

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDOndemandPerformanceSchedutil306090120150SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.27, N = 3126.39125.88125.981. (CXX) g++ options: -O2 -lOpenCL

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitOndemandPerformanceSchedutil306090120150SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3119.13131.7492.21MIN: 78.3 / MAX: 259.88MIN: 85.18 / MAX: 277.62MIN: 59.88 / MAX: 204.441. (CC) gcc options: -pthread

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DOndemandPerformanceSchedutil20406080100SE +/- 0.08, N = 3SE +/- 0.46, N = 3SE +/- 0.24, N = 390.0492.0991.311. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteOndemandPerformanceSchedutil20406080100SE +/- 0.58, N = 3SE +/- 0.29, N = 3SE +/- 0.33, N = 389.2789.7190.431. (CXX) g++ options: -O2 -lOpenCL

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileOndemandPerformanceSchedutil20406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 395.4983.7189.61

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyOndemandPerformanceSchedutil20406080100SE +/- 0.38, N = 3SE +/- 0.50, N = 3SE +/- 0.25, N = 383.6683.7383.57

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileOndemandPerformanceSchedutil20406080100SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 384.4783.1683.31

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 385.24, N = 3SE +/- 236.24, N = 3SE +/- 262.95, N = 3150441315032971503093

RAR Compression

Linux Source Tree Archiving To RAR

OpenBenchmarking.orgSeconds, Fewer Is BetterRAR Compression 5.6.1Linux Source Tree Archiving To RAROndemandPerformanceSchedutil20406080100SE +/- 0.18, N = 3SE +/- 0.24, N = 3SE +/- 0.41, N = 374.9867.6876.92

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2OndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 365.29, N = 3SE +/- 483.75, N = 3SE +/- 394.05, N = 3135144013506671350653

LuxCoreRender

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCOndemandPerformanceSchedutil0.71781.43562.15342.87123.589SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 33.153.183.19MIN: 3.06 / MAX: 3.29MIN: 3.05 / MAX: 3.39MIN: 3.04 / MAX: 3.41

LuxCoreRender

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismOndemandPerformanceSchedutil0.80331.60662.40993.21324.0165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.483.513.57MIN: 3.44 / MAX: 3.5MIN: 3.48 / MAX: 3.55MIN: 3.48 / MAX: 3.6

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetOndemandPerformanceSchedutil20K40K60K80K100KSE +/- 84.90, N = 3SE +/- 43.47, N = 3SE +/- 80.49, N = 3107260107390107294

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileOndemandPerformanceSchedutil30K60K90K120K150KSE +/- 136.25, N = 3SE +/- 287.23, N = 3SE +/- 40.55, N = 3126878126203125981

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantOndemandPerformanceSchedutil15K30K45K60K75KSE +/- 45.84, N = 3SE +/- 30.62, N = 3SE +/- 30.64, N = 370655.170739.170760.6

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatOndemandPerformanceSchedutil15K30K45K60K75KSE +/- 42.21, N = 3SE +/- 36.63, N = 3SE +/- 21.41, N = 369039.368976.969012.9

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5OndemandPerformanceSchedutil400K800K1200K1600K2000KSE +/- 3179.80, N = 3SE +/- 2962.73, N = 3SE +/- 3000.00, N = 31719667172133317210001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenOndemandPerformanceSchedutil501001502002502362372361. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedOndemandPerformanceSchedutil80160240320400SE +/- 0.33, N = 33763773761. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianOndemandPerformanceSchedutil90180270360450SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 34214204211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateOndemandPerformanceSchedutil140280420560700SE +/- 4.70, N = 3SE +/- 4.18, N = 3SE +/- 4.18, N = 36386296391. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingOndemandPerformanceSchedutil30060090012001500SE +/- 5.49, N = 3SE +/- 1.20, N = 3SE +/- 18.52, N = 31582160915871. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlOndemandPerformanceSchedutil2004006008001000SE +/- 0.88, N = 39229289231. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceOndemandPerformanceSchedutil30060090012001500SE +/- 2.52, N = 3SE +/- 3.67, N = 3SE +/- 2.40, N = 31271123911211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Hugin

Panorama Photo Assistant + Stitching Time

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimeOndemandPerformanceSchedutil1122334455SE +/- 0.33, N = 3SE +/- 0.29, N = 3SE +/- 0.17, N = 350.6550.4150.46

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupOndemandPerformanceSchedutil1122334455SE +/- 0.25, N = 3SE +/- 0.43, N = 3SE +/- 0.33, N = 349.750.750.31. (CC) gcc options: -fopenmp -O3 -lm

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisOndemandPerformanceSchedutil612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 324.3924.3924.39MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerOndemandPerformanceSchedutil246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.097.117.11MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.25MIN: 6.99 / MAX: 7.19

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishOndemandPerformanceSchedutil6K12K18K24K30KSE +/- 22.67, N = 3SE +/- 23.97, N = 3SE +/- 22.45, N = 32629826323263101. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KOndemandPerformanceSchedutil510152025SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 321.6521.5021.051. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigOndemandPerformanceSchedutil612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 325.6125.5425.601. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastOndemandPerformanceSchedutil612182430SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 325.3526.3926.271. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETOndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 3347.62, N = 3SE +/- 17244.90, N = 3SE +/- 16949.35, N = 41308179.831318380.421313860.031. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OCRMyPDF

Processing 60 Page PDF Document

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 9.6.0+dfsgProcessing 60 Page PDF DocumentOndemandPerformanceSchedutil510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 319.5519.2819.48

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pOndemandPerformanceSchedutil150300450600750SE +/- 1.15, N = 3SE +/- 1.61, N = 3SE +/- 0.42, N = 3625.43713.18536.65MIN: 491.48 / MAX: 770.18MIN: 554.58 / MAX: 886.76MIN: 417.73 / MAX: 665.431. (CC) gcc options: -pthread

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pOndemandPerformanceSchedutil1.2222.4443.6664.8886.11SE +/- 0.049, N = 3SE +/- 0.023, N = 3SE +/- 0.025, N = 35.2445.4315.3791. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigOndemandPerformanceSchedutil48121620SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 316.0315.9115.991. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETOndemandPerformanceSchedutil300K600K900K1200K1500KSE +/- 12901.14, N = 3SE +/- 13020.90, N = 3SE +/- 26190.66, N = 31545026.541604323.001599356.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KOndemandPerformanceSchedutil60120180240300SE +/- 0.06, N = 3SE +/- 0.38, N = 3SE +/- 0.18, N = 3254.29264.98246.86MIN: 211.06 / MAX: 289.79MIN: 222.23 / MAX: 302.43MIN: 205.59 / MAX: 285.261. (CC) gcc options: -pthread

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialOndemandPerformanceSchedutil48121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 314.1914.2314.26

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterOndemandPerformanceSchedutil48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 313.7713.7113.751. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverOndemandPerformanceSchedutil3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.2413.3313.251. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastOndemandPerformanceSchedutil1122334455SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 346.8647.8047.631. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pOndemandPerformanceSchedutil1530456075SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.34, N = 365.4768.4855.071. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pOndemandPerformanceSchedutil1020304050SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 341.0242.8243.041. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OSPray

Demo: NASA Streamlines - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisOndemandPerformanceSchedutil816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 333.3333.3333.33MIN: 31.25 / MAX: 34.48MIN: 31.25 / MAX: 34.48MIN: 31.25

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastOndemandPerformanceSchedutil20406080100SE +/- 0.29, N = 3SE +/- 0.08, N = 3SE +/- 0.21, N = 387.1692.2286.891. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pOndemandPerformanceSchedutil140280420560700SE +/- 2.45, N = 3SE +/- 0.74, N = 3SE +/- 1.06, N = 3592.34668.50488.95MIN: 443.85 / MAX: 644.65MIN: 502.68 / MAX: 730.8MIN: 390.53 / MAX: 532.251. (CC) gcc options: -pthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastOndemandPerformanceSchedutil4080120160200SE +/- 0.36, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 3155.97161.97153.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingOndemandPerformanceSchedutil4080120160200SE +/- 1.12, N = 3SE +/- 1.40, N = 3SE +/- 1.39, N = 3172.64176.54172.601. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.4