Linux 3.12 Kernel IO Scheduler Performance Benchmarks Deadline, CFQ, and Noop I/O scheduler benchmarks from the Linux 3.12 Git kernel using an Intel Core i7 4960X Ivy Bridge Extreme processor on Ubuntu 13.10 base. Benchmarking by Michael Larabel of Phoronix.com. Just some quick tests for now... More on the way :)
HTML result view exported from: https://openbenchmarking.org/result/1311139-UT-AUTOCOMPA33 .
Linux 3.12 Kernel IO Scheduler Performance Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Deadline CFQ Noop bdver2 bdver1 btver2 btver1 barcelona athlon64 Intel Core i5 750 azw-haupt Test-12-11-2013 Intel Core i7-4960X @ 4.00GHz (12 Cores) MSI X79MA-GD45 (MS-7738) v1.0 Intel Xeon E5 v2/Core 8192MB 320GB Seagate ST3320620AS Sapphire AMD Radeon HD 4870 512MB Realtek ALC892 AL2223W Realtek RTL8111/8168/8411 Ubuntu 13.10 3.12.0-999-generic (x86_64) Unity 7.1.1 X Server 1.14.3 radeon 7.2.0 3.1 Mesa 9.3.0-devel (git-e166a58) Gallium 0.4 GCC 4.8 + LLVM 3.2 ext4 1680x1050 AMD A10-6800K APU @ 4.70GHz (4 Cores) MSI FM2-A85XA-G65 (MS-7793) v1.0 AMD Family 15h 7168MB 64GB OCZ AGILITY AMD Radeon HD 8670D 768MB (844/1066MHz) ATI R6xx HDMI SyncMaster Realtek RTL8111/8168 3.9.0-6-generic (x86_64) Xfce 4.10 X Server 1.13.3 fglrx 13.10.10 4.2.12337 GCC 4.8 2560x1600 Intel Core i5 750 @ 2.67GHz (4 Cores) MSI P55-CD53 (MS-7586) v1.0 Intel Core DMI 8192MB 160GB INTEL SSDSA2M160 + 1000GB Seagate ST1000DM005 HD10 + 1500GB Western Digital WD15EADS-00P Gigabyte NVIDIA GeForce GTX 670 2048MB (705/3004MHz) Realtek ALC889 Realtek RTL8111/8168/8411 3.11.0-12-generic (x86_64) GNOME Shell 3.8.4 X Server 1.14.3 NVIDIA 319.32 4.3.0 1680x1050 AMD A6-5400K APU @ 3.60GHz (2 Cores) BIOSTAR TA75MH2 AMD Family 15h 1000GB Seagate ST1000DL002-9TT1 NVIDIA GeForce GTX 650 1024MB (1202/2500MHz) Realtek ALC662 rev1 3.8.0-30-generic (x86_64) Unity 7.1.2 1600x900 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Disk Details - Deadline: DEADLINE / data=ordered,errors=remount-ro,relatime,rw - CFQ: CFQ / data=ordered,errors=remount-ro,relatime,rw - Noop: NOOP / data=ordered,errors=remount-ro,relatime,rw Processor Details - Deadline: Scaling Governor: intel_pstate powersave - CFQ: Scaling Governor: intel_pstate powersave - Noop: Scaling Governor: intel_pstate powersave - Intel Core i5 750: Scaling Governor: acpi-cpufreq ondemand - azw-haupt: Scaling Governor: acpi-cpufreq ondemand - Test-12-11-2013: Scaling Governor: acpi- freq ondemand System Details - Deadline: Disk Scheduler: DEADLINE. Python 2.7.5+. - CFQ: Disk Scheduler: CFQ. Python 2.7.5+. - Noop: Disk Scheduler: NOOP. Python 2.7.5+. Disk Mount Options Details - azw-haupt: data=ordered,relatime,rw
Linux 3.12 Kernel IO Scheduler Performance Benchmarks build-linux-kernel: Time To Compile build-php: Time To Compile scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping x264: H.264 Video Encoding graphics-magick: Blur graphics-magick: Sharpen graphics-magick: Resizing graphics-magick: HWB Color Space graphics-magick: Local Adaptive Thresholding himeno: Poisson Pressure Solver c-ray: Total Time primesieve: 1e12 Prime Number Generation smallpt: Global Illumination Renderer; 100 Samples encode-flac: WAV To FLAC encode-mp3: WAV To MP3 ffmpeg: H.264 HD To NTSC DV Deadline CFQ Noop bdver2 bdver1 btver2 btver1 barcelona athlon64 Intel Core i5 750 azw-haupt Test-12-11-2013 61.91 21.88 62.23 21.80 62.73 21.88 505.89 67.22 1008.88 1221.85 88.10 97.88 116 92 152 159 83 835.46 33.10 298.53 57 6.81 16.81 15.57 496.21 69.46 999.02 1219.08 88.86 98.24 119 92 151 168 83 838.19 32.87 302.03 58 6.80 16.80 17.32 484.80 65.37 888.52 1106.55 48.29 84.96 84 56 135 169 80 823.58 42.43 293.42 59 17.24 481.50 69.12 999.17 1227.48 87.15 98.25 91 48 135 174 79 806.26 62.39 302.42 63 7.11 17.95 15.43 468.89 69.40 984.90 1199.73 49.07 85.27 98 80 140 171 78 812.90 44.01 306.52 61 7.11 18.24 15.73 487.51 76.47 974.17 960.11 87.09 97.94 90 48 135 162 83 812.40 73.83 290.87 72 6.91 17.31 17.34 78.24 79.50 123 1288.91 36.88 8.28 20.06 120.54 395.14 59.34 783.94 928.80 30.74 36.71 645.16 117.47 752.04 439 7.98 21.75 29.30 OpenBenchmarking.org
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 3.1 Time To Compile Deadline CFQ Noop 14 28 42 56 70 SE +/- 0.71, N = 3 SE +/- 0.80, N = 3 SE +/- 0.69, N = 3 61.91 62.23 62.73
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile Deadline CFQ Noop Test-12-11-2013 30 60 90 120 150 SE +/- 0.48, N = 6 SE +/- 0.35, N = 4 SE +/- 0.43, N = 6 SE +/- 0.45, N = 3 21.88 21.80 21.88 120.54 1. (CC) gcc options: -O2 -pedantic -ldl -lz -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 110 220 330 440 550 SE +/- 0.59, N = 4 SE +/- 6.51, N = 4 SE +/- 2.60, N = 4 SE +/- 0.00, N = 4 SE +/- 0.84, N = 4 SE +/- 0.55, N = 4 SE +/- 1.60, N = 4 505.89 496.21 484.80 481.50 468.89 487.51 395.14 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 1. (CXX) g++ options:
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 20 40 60 80 100 SE +/- 0.10, N = 4 SE +/- 0.22, N = 4 SE +/- 0.28, N = 4 SE +/- 0.56, N = 2 SE +/- 0.32, N = 4 SE +/- 0.26, N = 4 SE +/- 0.05, N = 4 67.22 69.46 65.37 69.12 69.40 76.47 59.34 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 1. (CXX) g++ options:
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 200 400 600 800 1000 SE +/- 2.03, N = 4 SE +/- 0.00, N = 3 SE +/- 2.49, N = 4 SE +/- 6.96, N = 4 SE +/- 21.58, N = 4 SE +/- 5.51, N = 4 SE +/- 1.88, N = 4 1008.88 999.02 888.52 999.17 984.90 974.17 783.94 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 1. (CXX) g++ options:
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 300 600 900 1200 1500 SE +/- 1.39, N = 4 SE +/- 3.52, N = 4 SE +/- 4.19, N = 4 SE +/- 2.71, N = 4 SE +/- 31.28, N = 4 SE +/- 2.18, N = 4 SE +/- 19.42, N = 4 1221.85 1219.08 1106.55 1227.48 1199.73 960.11 928.80 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 1. (CXX) g++ options:
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2z Phong Rendering With Soft-Shadow Mapping bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 SE +/- 0.27, N = 3 SE +/- 0.93, N = 3 SE +/- 0.03, N = 3 SE +/- 0.76, N = 3 SE +/- 0.05, N = 3 88.10 88.86 48.29 87.15 49.07 87.09 30.74 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2013-06-08 H.264 Video Encoding bdver2 bdver1 btver2 btver1 barcelona athlon64 Intel Core i5 750 azw-haupt Test-12-11-2013 20 40 60 80 100 SE +/- 0.12, N = 5 SE +/- 0.15, N = 5 SE +/- 0.20, N = 5 SE +/- 0.27, N = 5 SE +/- 0.24, N = 5 SE +/- 0.29, N = 5 SE +/- 0.23, N = 5 SE +/- 0.70, N = 5 SE +/- 0.15, N = 5 97.88 98.24 84.96 98.25 85.27 97.94 78.24 79.50 36.71 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur bdver2 bdver1 btver2 btver1 barcelona athlon64 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 116 119 84 91 98 90 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen bdver2 bdver1 btver2 btver1 barcelona athlon64 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 92 92 56 48 80 48 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing bdver2 bdver1 btver2 btver1 barcelona athlon64 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 152 151 135 135 140 135 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: HWB Color Space bdver2 bdver1 btver2 btver1 barcelona athlon64 azw-haupt 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.58, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 159 168 169 174 171 162 123 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 -O2 -ljbig -ljpeg -llzma 1. (CC) gcc options: -std=gnu99 -fopenmp -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding bdver2 bdver1 btver2 btver1 barcelona athlon64 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 83 83 80 79 78 83 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -pthread -lXext -lSM -lICE -lX11 -lz -lm -lgomp -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver bdver2 bdver1 btver2 btver1 barcelona athlon64 azw-haupt Test-12-11-2013 300 600 900 1200 1500 SE +/- 0.75, N = 3 SE +/- 0.40, N = 3 SE +/- 1.96, N = 3 SE +/- 1.85, N = 3 SE +/- 1.32, N = 3 SE +/- 2.90, N = 3 SE +/- 1.43, N = 3 SE +/- 0.31, N = 3 835.46 838.19 823.58 806.26 812.90 812.40 1288.91 645.16 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -O3
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time bdver2 bdver1 btver2 btver1 barcelona athlon64 azw-haupt Test-12-11-2013 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 33.10 32.87 42.43 62.39 44.01 73.83 36.88 117.47 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -lm -lpthread -O3
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 4.2 1e12 Prime Number Generation bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 160 320 480 640 800 SE +/- 4.88, N = 4 SE +/- 3.22, N = 3 SE +/- 5.56, N = 3 SE +/- 5.13, N = 3 SE +/- 0.39, N = 3 SE +/- 4.18, N = 5 SE +/- 0.52, N = 3 298.53 302.03 293.42 302.42 306.52 290.87 752.04 1. (CXX) g++ options: -O2 -fopenmp
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples bdver2 bdver1 btver2 btver1 barcelona athlon64 Test-12-11-2013 100 200 300 400 500 SE +/- 0.00, N = 3 SE +/- 1.40, N = 6 SE +/- 0.33, N = 3 SE +/- 0.93, N = 5 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 57 58 59 63 61 72 439 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver2 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 1. (CXX) g++ options: -fopenmp
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.0 WAV To FLAC bdver2 bdver1 btver1 barcelona athlon64 azw-haupt Test-12-11-2013 2 4 6 8 10 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 6.81 6.80 7.11 7.11 6.91 8.28 7.98 -O3 -march=bdver2 -O3 -march=bdver1 -O3 -march=btver1 -O3 -march=barcelona -O3 -march=athlon64 -O2 -O2 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 bdver2 bdver1 btver2 btver1 barcelona athlon64 azw-haupt Test-12-11-2013 5 10 15 20 25 SE +/- 0.03, N = 5 SE +/- 0.04, N = 5 SE +/- 0.03, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.02, N = 5 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 16.81 16.80 17.24 17.95 18.24 17.31 20.06 21.75 -march=bdver2 -march=bdver1 -march=btver2 -march=btver1 -march=barcelona -march=athlon64 -fomit-frame-pointer -ffast-math -fomit-frame-pointer -ffast-math 1. (CC) gcc options: -pipe -O3 -lm
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.1 H.264 HD To NTSC DV bdver2 bdver1 btver1 barcelona athlon64 Test-12-11-2013 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 15.57 17.32 15.43 15.73 17.34 29.30 -march=bdver2 -march=bdver1 -march=btver1 -march=barcelona -march=athlon64 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -O3 -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
Phoronix Test Suite v10.8.4