LLVM Clang 3.4 SVN - SLP Vectorizer

Benchmarking the SLP Vectorizer via -fslp-vectorize on LLVM Clang 3.4 SVN for a future article on Phoronix.com

HTML result view exported from: https://openbenchmarking.org/result/1307291-SO-FSLPVECTO83&grt&sro.

LLVM Clang 3.4 SVN - SLP VectorizerProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionLLVM Clang 3.4 SVN-fslp-vectorizeIntel Core i7 720Q @ 1.60GHz (8 Cores)LENOVO 4318CTOIntel Core DMI4096MB160GB INTEL SSDSA2M160NVIDIA Quadro FX 880M 1024MB (405/324MHz)Conexant CX20585Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300Ubuntu 13.103.11.0-031100rc2-generic (x86_64)Xfce 4.10X Server 1.14.2nouveau 1.0.83.0 Mesa 9.1.4 Gallium 0.4Clang 3.4 (SVN 187338) + LLVM 3.4svnext41600x900OpenBenchmarking.orgCompiler Details- Optimized build; Built Jul 28 2013 (21:43:17); Default target: x86_64-unknown-linux-gnu; Host CPU: corei7 Processor Details- Scaling Governor: acpi-cpufreq ondemand

LLVM Clang 3.4 SVN - SLP Vectorizerapache: Static Web Page Servingblake2: Phoronix Test Suite v4.8.0m4c-ray: Total Timeencode-flac: WAV To FLACgraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverencode-mp3: WAV To MP3n-queens: Elapsed Timeprimesieve: 1e12 Prime Number Generationscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsmallpt: Global Illumination Renderer; 100 Sampleshmmer: Pfam Database Searchbuild-imagemagick: Time To Compilemafft: Multiple Sequence Alignmentbuild-php: Time To Compilex264: H.264 Video EncodingLLVM Clang 3.4 SVN-fslp-vectorize9830.614.2075.988.8653366680321055.0822.64345.38614.07976.45378.75173.571234.662003.231092.0329327.3365.8815.4241.9759.639715.204.2075.878.8653366680321127.3722.78343.15614.34977.27379.41189.101228.191995.651093.9830127.3066.3115.4442.0759.83OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page Serving-fslp-vectorizeLLVM Clang 3.4 SVN2K4K6K8K10KSE +/- 44.32, N = 3SE +/- 60.14, N = 39715.209830.611. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

BLAKE2

Phoronix Test Suite v4.8.0m4

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.0m4-fslp-vectorizeLLVM Clang 3.4 SVN0.9451.892.8353.784.725SE +/- 0.00, N = 3SE +/- 0.01, N = 34.204.201. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time-fslp-vectorizeLLVM Clang 3.4 SVN20406080100SE +/- 0.01, N = 3SE +/- 0.04, N = 375.8775.98-fslp-vectorize1. (CC) gcc options: -lm -lpthread -O3 -march=native

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLAC-fslp-vectorizeLLVM Clang 3.4 SVN246810SE +/- 0.01, N = 5SE +/- 0.01, N = 58.868.86-fslp-vectorize1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Blur-fslp-vectorizeLLVM Clang 3.4 SVN1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 35353-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Sharpen-fslp-vectorizeLLVM Clang 3.4 SVN816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 33636-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Resizing-fslp-vectorizeLLVM Clang 3.4 SVN1530456075SE +/- 0.33, N = 3SE +/- 0.00, N = 36666-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: HWB Color Space-fslp-vectorizeLLVM Clang 3.4 SVN20406080100SE +/- 0.33, N = 3SE +/- 0.00, N = 38080-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive Thresholding-fslp-vectorizeLLVM Clang 3.4 SVN714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 33232-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-fslp-vectorizeLLVM Clang 3.4 SVN2004006008001000SE +/- 0.79, N = 3SE +/- 5.78, N = 31127.371055.08-fslp-vectorize1. (CC) gcc options: -O3 -march=native

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3-fslp-vectorizeLLVM Clang 3.4 SVN510152025SE +/- 0.01, N = 5SE +/- 0.02, N = 522.7822.64-fslp-vectorize1. (CC) gcc options: -pipe -O3 -march=native -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Time-fslp-vectorizeLLVM Clang 3.4 SVN80160240320400SE +/- 0.03, N = 3SE +/- 0.89, N = 3343.15345.38-fslp-vectorize1. (CC) gcc options: -static -fopenmp -O3 -march=native

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number Generation-fslp-vectorizeLLVM Clang 3.4 SVN130260390520650SE +/- 0.14, N = 3SE +/- 0.26, N = 3614.34614.071. (CXX) g++ options: -O2

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-fslp-vectorizeLLVM Clang 3.4 SVN2004006008001000SE +/- 0.78, N = 4SE +/- 1.47, N = 4977.27976.45-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-fslp-vectorizeLLVM Clang 3.4 SVN80160240320400SE +/- 0.00, N = 4SE +/- 0.38, N = 4379.41378.75-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-fslp-vectorizeLLVM Clang 3.4 SVN4080120160200SE +/- 0.40, N = 4SE +/- 0.61, N = 4189.10173.57-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-fslp-vectorizeLLVM Clang 3.4 SVN30060090012001500SE +/- 1.06, N = 4SE +/- 0.93, N = 41228.191234.66-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-fslp-vectorizeLLVM Clang 3.4 SVN400800120016002000SE +/- 3.58, N = 4SE +/- 8.33, N = 41995.652003.23-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-fslp-vectorizeLLVM Clang 3.4 SVN2004006008001000SE +/- 0.98, N = 4SE +/- 0.98, N = 41093.981092.03-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samples-fslp-vectorizeLLVM Clang 3.4 SVN70140210280350SE +/- 0.88, N = 3SE +/- 0.58, N = 3301293-fslp-vectorize1. (CXX) g++ options: -fopenmp -O3 -march=native

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database Search-fslp-vectorizeLLVM Clang 3.4 SVN612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 327.3027.33-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.8.1-10Time To Compile-fslp-vectorizeLLVM Clang 3.4 SVN1530456075SE +/- 0.13, N = 3SE +/- 0.18, N = 366.3165.88

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence Alignment-fslp-vectorizeLLVM Clang 3.4 SVN48121620SE +/- 0.23, N = 5SE +/- 0.25, N = 615.4415.421. (CC) gcc options: -O3 -lm -lpthread

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To Compile-fslp-vectorizeLLVM Clang 3.4 SVN1020304050SE +/- 0.07, N = 3SE +/- 0.10, N = 342.0741.97-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video Encoding-fslp-vectorizeLLVM Clang 3.4 SVN1326395265SE +/- 0.11, N = 5SE +/- 0.14, N = 559.8359.63-fslp-vectorize1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.5