Linux 4.7 CPUFreq Schedutil Testing

Linux 4.7 kernel benchmarking. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1605180-HA-LINUX47CP71&rdt&grs.

Linux 4.7 CPUFreq Schedutil TestingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State PerformanceIntel Xeon E5-2687W v3 @ 3.10GHz (20 Cores)MSI X99S SLI PLUS (MS-7885) v1.0Intel Xeon E7 v3/Xeon16384MBPNY CS1211 120GB + 80GB INTEL SSDSCKGW08AMD FirePro V7900 2048MBRealtek ALC892ASUS PB278Intel ConnectionUbuntu 16.044.6.0-phx-schedutil (x86_64)Unity 7.4.0X Server 1.18.3modesetting 1.18.34.1 Mesa 11.2.0 Gallium 0.4GCC 5.3.1 20160413ext42560x1440Intel Xeon E5-2687W v3 @ 3.50GHz (20 Cores)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- CPUFreq Schedutil: Scaling Governor: acpi-cpufreq schedutil- CPUFreq Ondemand: Scaling Governor: acpi-cpufreq ondemand- CPUFreq Conservative: Scaling Governor: acpi-cpufreq conservative- CPUFreq: Powersave: Scaling Governor: acpi-cpufreq powersave- CPUFreq Performance: Scaling Governor: acpi-cpufreq performance- P-State Powersave: Scaling Governor: intel_pstate powersave- P-State Performance: Scaling Governor: intel_pstate performanceGraphics Details- EXA

Linux 4.7 CPUFreq Schedutil Testingnpb: EP.Bhpcc: G-HPLjohn-the-ripper: Blowfishencode-mp3: WAV To MP3graphics-magick: Resizingffte: N=64, 1D Complex FFT Routinehimeno: Poisson Pressure Solverlammps: Rhodopsin Proteinc-ray: Total Timeencode-flac: WAV To FLACx264: H.264 Video Encodingbuild-linux-kernel: Time To Compileopm-git: Upscale-Relperm - 16apache: Static Web Page Servingmultichase: 256MB Array, 256 Byte Strideredis: GETmultichase: 1GB Array, 256 Byte Stride, 4 Threadsclomp: Static OMP Speeduppgbench: Buffer Test - Heavy Contention - Read Writetesseract: 2560 x 1440xonotic: 2560 x 1440 - LowCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance368.29103.823671413613.891784965.761732.8140.3010.777.69350.6054.3113.3920379.4665.24606865.1474.956.405273.8063.6659.38370.18103.778001411212.601955276.471772.8340.0610.776.92309.9554.4713.5022338.8868.19615364.5872.746.405315.3564.3459.12360.80103.937001412612.861635232.431749.2640.1510.907.05316.8756.2513.7621548.4164.77526209.7474.356.594695.5863.7859.36139.0940.40830528137.55701814.06624.45113.4828.7118.92142.94135.3526.6113062.79106.10379283.76110.976.954905.5263.4459.33370.35103.868001411712.971984991.631748.5441.0010.736.90369.2752.4313.5624517.7068.08609017.6271.936.425327.9663.6759.12111.9731.57813458437.29681814.70625.02113.4829.5718.95136.02136.3313282.16107.03387649.31112.776.114852.1063.7159.65105.23104.851671413637.41684988.74623.92112.4429.5918.93134.53135.8913077.7669.46387296.9473.166.424832.1963.2959.26OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: EP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: EP.BCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance80160240320400SE +/- 2.78, N = 3SE +/- 1.48, N = 3SE +/- 5.17, N = 3SE +/- 0.74, N = 3SE +/- 1.54, N = 3SE +/- 8.66, N = 3SE +/- 8.12, N = 3368.29370.18360.80139.09370.35111.97105.231. (F9X) gfortran options: -fopenmp

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-HPLCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance20406080100SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3103.82103.78103.9440.41103.8731.58104.851. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance3K6K9K12K15KSE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 9.67, N = 3SE +/- 6.94, N = 3SE +/- 9.67, N = 3SE +/- 411.91, N = 3SE +/- 0.00, N = 31413614112141265281141174584141361. (CC) gcc options: -fopenmp -lcrypt

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3CPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance918273645SE +/- 0.54, N = 5SE +/- 0.02, N = 5SE +/- 0.14, N = 5SE +/- 0.17, N = 5SE +/- 0.04, N = 5SE +/- 0.21, N = 5SE +/- 0.12, N = 513.8912.6012.8637.5512.9737.2937.411. (CC) gcc options: -O3 -ffast-math -funroll-loops -pipe -lncurses -lm

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: ResizingCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance4080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31781951637019868681. (CC) gcc options: -fopenmp -O2 -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread

FFTE

Test: N=64, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 5.0Test: N=64, 1D Complex FFT RoutineCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance11002200330044005500SE +/- 82.32, N = 3SE +/- 3.96, N = 3SE +/- 46.09, N = 3SE +/- 0.04, N = 3SE +/- 161.87, N = 3SE +/- 0.06, N = 3SE +/- 146.87, N = 34965.765276.475232.431814.064991.631814.704988.741. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance400800120016002000SE +/- 7.34, N = 3SE +/- 2.65, N = 3SE +/- 9.47, N = 3SE +/- 0.20, N = 3SE +/- 7.56, N = 3SE +/- 0.81, N = 3SE +/- 0.44, N = 31732.811772.831749.26624.451748.54625.02623.921. (CC) gcc options: -O3 -mavx2

LAMMPS Molecular Dynamics Simulator

Test: Rhodopsin Protein

OpenBenchmarking.orgLoop Time, Fewer Is BetterLAMMPS Molecular Dynamics Simulator 1.0Test: Rhodopsin ProteinCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance306090120150SE +/- 0.33, N = 3SE +/- 0.27, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.17, N = 3SE +/- 0.21, N = 3SE +/- 0.32, N = 340.3040.0640.15113.4841.00113.48112.441. (CXX) g++ options: -lfftw -lmpich

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 310.7710.7710.9028.7110.7329.5729.591. (CC) gcc options: -lm -lpthread -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance510152025SE +/- 0.46, N = 5SE +/- 0.01, N = 5SE +/- 0.07, N = 5SE +/- 0.01, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 57.696.927.0518.926.9018.9518.931. (CXX) g++ options: -O2 -fvisibility=hidden -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2015-11-02H.264 Video EncodingCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance80160240320400SE +/- 2.84, N = 5SE +/- 1.38, N = 5SE +/- 27.47, N = 5SE +/- 0.68, N = 5SE +/- 2.51, N = 5SE +/- 0.57, N = 5SE +/- 0.56, N = 5350.60309.95316.87142.94369.27136.02134.531. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.3Time To CompileCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance306090120150SE +/- 1.35, N = 3SE +/- 0.88, N = 3SE +/- 0.74, N = 3SE +/- 2.43, N = 3SE +/- 0.80, N = 3SE +/- 1.92, N = 3SE +/- 2.42, N = 354.3154.4756.25135.3552.43136.33135.89

Open Porous Media Git

OPM Benchmark: Upscale-Relperm - Threads: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Upscale-Relperm - Threads: 16CPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq Performance612182430SE +/- 0.20, N = 3SE +/- 0.22, N = 3SE +/- 0.13, N = 3SE +/- 0.21, N = 3SE +/- 0.20, N = 313.3913.5013.7626.6113.561. Build Time Tue May 17 12:05:40 EDT 2016;

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance5K10K15K20K25KSE +/- 82.65, N = 3SE +/- 117.89, N = 3SE +/- 982.79, N = 3SE +/- 9.66, N = 3SE +/- 183.42, N = 3SE +/- 31.83, N = 3SE +/- 73.74, N = 320379.4622338.8821548.4113062.7924517.7013282.1613077.761. (CC) gcc options: -shared -fPIC -O2 -pthread

Multichase Pointer Chaser

Test: 256MB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 256MB Array, 256 Byte StrideCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance20406080100SE +/- 2.29, N = 3SE +/- 1.23, N = 3SE +/- 2.80, N = 3SE +/- 0.81, N = 3SE +/- 1.02, N = 3SE +/- 0.27, N = 3SE +/- 0.05, N = 365.2468.1964.77106.1068.08107.0369.461. (CC) gcc options: -O2 -static -pthread -lrt

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance130K260K390K520K650KSE +/- 4579.25, N = 3SE +/- 10591.07, N = 3SE +/- 9602.66, N = 3SE +/- 3493.78, N = 3SE +/- 1131.35, N = 3SE +/- 674.73, N = 3SE +/- 229.02, N = 3606865.14615364.58526209.74379283.76609017.62387649.31387296.941. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl -std=gnu99 -pipe -g3 -O3 -funroll-loops

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 4 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 4 ThreadsCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance306090120150SE +/- 0.13, N = 3SE +/- 0.92, N = 3SE +/- 0.27, N = 3SE +/- 1.09, N = 3SE +/- 1.60, N = 3SE +/- 0.12, N = 3SE +/- 0.64, N = 374.9572.7474.35110.9771.93112.7773.161. (CC) gcc options: -O2 -static -pthread -lrt

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance246810SE +/- 0.08, N = 5SE +/- 0.10, N = 5SE +/- 0.23, N = 5SE +/- 0.05, N = 5SE +/- 0.10, N = 5SE +/- 0.85, N = 5SE +/- 0.11, N = 56.406.406.596.956.426.116.421. (CC) gcc options: --openmp -O3 -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Heavy Contention - Mode: Read WriteCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance11002200330044005500SE +/- 50.60, N = 3SE +/- 8.74, N = 3SE +/- 205.11, N = 3SE +/- 5.76, N = 3SE +/- 302.93, N = 3SE +/- 9.49, N = 3SE +/- 4.75, N = 35273.805315.354695.584905.525327.964852.104832.191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Tesseract

Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is BetterTesseract 2014-05-12Resolution: 2560 x 1440CPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance1428425670SE +/- 0.23, N = 3SE +/- 0.50, N = 3SE +/- 0.31, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.16, N = 363.6664.3463.7863.4463.6763.7163.29

Xonotic

Resolution: 2560 x 1440 - Effects Quality: Low

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8Resolution: 2560 x 1440 - Effects Quality: LowCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State PowersaveP-State Performance1326395265SE +/- 0.14, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 359.3859.1259.3659.3359.1259.6559.26MIN: 42 / MAX: 61MIN: 41 / MAX: 60MIN: 42 / MAX: 61MIN: 41 / MAX: 61MIN: 42 / MAX: 60MIN: 42 / MAX: 61MIN: 42 / MAX: 62

CPU Frequency (CPU0) Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgMegahertzCPU Frequency (CPU0) MonitorPhoronix Test Suite System MonitoringCPUFreq SchedutilCPUFreq OndemandCPUFreq ConservativeCPUFreq: PowersaveCPUFreq PerformanceP-State Performance6001200180024003000Min: 1200 / Avg: 2475.16 / Max: 3101Min: 1200 / Avg: 2318.98 / Max: 3101Min: 1200 / Avg: 2249.66 / Max: 3101Min: 1200 / Avg: 1200 / Max: 1200Min: 3101 / Avg: 3101 / Max: 3101Min: 0 / Avg: 1512.39 / Max: 3500.93


Phoronix Test Suite v10.8.4