LLVM Clang 3.4 SVN - SLP Vectorizer

Benchmarking the SLP Vectorizer via -fslp-vectorize on LLVM Clang 3.4 SVN for a future article on Phoronix.com

HTML result view exported from: https://openbenchmarking.org/result/1307291-SO-FSLPVECTO83&grs&rdt.

LLVM Clang 3.4 SVN - SLP VectorizerProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionLLVM Clang 3.4 SVN-fslp-vectorizeIntel Core i7 720Q @ 1.60GHz (8 Cores)LENOVO 4318CTOIntel Core DMI4096MB160GB INTEL SSDSA2M160NVIDIA Quadro FX 880M 1024MB (405/324MHz)Conexant CX20585Intel 82577LM Gigabit Connection + Intel Centrino Ultimate-N 6300Ubuntu 13.103.11.0-031100rc2-generic (x86_64)Xfce 4.10X Server 1.14.2nouveau 1.0.83.0 Mesa 9.1.4 Gallium 0.4Clang 3.4 (SVN 187338) + LLVM 3.4svnext41600x900OpenBenchmarking.orgCompiler Details- Optimized build; Built Jul 28 2013 (21:43:17); Default target: x86_64-unknown-linux-gnu; Host CPU: corei7 Processor Details- Scaling Governor: acpi-cpufreq ondemand

LLVM Clang 3.4 SVN - SLP Vectorizerscimark2: Fast Fourier Transformhimeno: Poisson Pressure Solversmallpt: Global Illumination Renderer; 100 Samplesapache: Static Web Page Servingbuild-imagemagick: Time To Compilen-queens: Elapsed Timeencode-mp3: WAV To MP3scimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationx264: H.264 Video Encodingbuild-php: Time To Compilescimark2: Jacobi Successive Over-Relaxationscimark2: Monte Carloc-ray: Total Timemafft: Multiple Sequence Alignmenthmmer: Pfam Database Searchscimark2: Compositeprimesieve: 1e12 Prime Number Generationencode-flac: WAV To FLACgraphics-magick: Local Adaptive Thresholdinggraphics-magick: HWB Color Spacegraphics-magick: Resizinggraphics-magick: Sharpengraphics-magick: Blurblake2: Phoronix Test Suite v4.8.0m4LLVM Clang 3.4 SVN-fslp-vectorize173.571055.082939830.6165.88345.3822.641234.662003.2359.6341.971092.03378.7575.9815.4227.33976.45614.078.8632806636534.20189.101127.373019715.2066.31343.1522.781228.191995.6559.8342.071093.98379.4175.8715.4427.30977.27614.348.8632806636534.20OpenBenchmarking.org

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformLLVM Clang 3.4 SVN-fslp-vectorize4080120160200SE +/- 0.61, N = 4SE +/- 0.40, N = 4173.57189.10-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLLVM Clang 3.4 SVN-fslp-vectorize2004006008001000SE +/- 5.78, N = 3SE +/- 0.79, N = 31055.081127.37-fslp-vectorize1. (CC) gcc options: -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesLLVM Clang 3.4 SVN-fslp-vectorize70140210280350SE +/- 0.58, N = 3SE +/- 0.88, N = 3293301-fslp-vectorize1. (CXX) g++ options: -fopenmp -O3 -march=native

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page ServingLLVM Clang 3.4 SVN-fslp-vectorize2K4K6K8K10KSE +/- 60.14, N = 3SE +/- 44.32, N = 39830.619715.201. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.8.1-10Time To CompileLLVM Clang 3.4 SVN-fslp-vectorize1530456075SE +/- 0.18, N = 3SE +/- 0.13, N = 365.8866.31

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeLLVM Clang 3.4 SVN-fslp-vectorize80160240320400SE +/- 0.89, N = 3SE +/- 0.03, N = 3345.38343.15-fslp-vectorize1. (CC) gcc options: -static -fopenmp -O3 -march=native

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3LLVM Clang 3.4 SVN-fslp-vectorize510152025SE +/- 0.02, N = 5SE +/- 0.01, N = 522.6422.78-fslp-vectorize1. (CC) gcc options: -pipe -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyLLVM Clang 3.4 SVN-fslp-vectorize30060090012001500SE +/- 0.93, N = 4SE +/- 1.06, N = 41234.661228.19-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationLLVM Clang 3.4 SVN-fslp-vectorize400800120016002000SE +/- 8.33, N = 4SE +/- 3.58, N = 42003.231995.65-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video EncodingLLVM Clang 3.4 SVN-fslp-vectorize1326395265SE +/- 0.14, N = 5SE +/- 0.11, N = 559.6359.83-fslp-vectorize1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileLLVM Clang 3.4 SVN-fslp-vectorize1020304050SE +/- 0.10, N = 3SE +/- 0.07, N = 341.9742.07-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationLLVM Clang 3.4 SVN-fslp-vectorize2004006008001000SE +/- 0.98, N = 4SE +/- 0.98, N = 41092.031093.98-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloLLVM Clang 3.4 SVN-fslp-vectorize80160240320400SE +/- 0.38, N = 4SE +/- 0.00, N = 4378.75379.41-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeLLVM Clang 3.4 SVN-fslp-vectorize20406080100SE +/- 0.04, N = 3SE +/- 0.01, N = 375.9875.87-fslp-vectorize1. (CC) gcc options: -lm -lpthread -O3 -march=native

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentLLVM Clang 3.4 SVN-fslp-vectorize48121620SE +/- 0.25, N = 6SE +/- 0.23, N = 515.4215.441. (CC) gcc options: -O3 -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchLLVM Clang 3.4 SVN-fslp-vectorize612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 327.3327.30-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeLLVM Clang 3.4 SVN-fslp-vectorize2004006008001000SE +/- 1.47, N = 4SE +/- 0.78, N = 4976.45977.27-fslp-vectorize1. (CXX) g++ options: -O3 -march=native

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationLLVM Clang 3.4 SVN-fslp-vectorize130260390520650SE +/- 0.26, N = 3SE +/- 0.14, N = 3614.07614.341. (CXX) g++ options: -O2

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACLLVM Clang 3.4 SVN-fslp-vectorize246810SE +/- 0.01, N = 5SE +/- 0.01, N = 58.868.86-fslp-vectorize1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive ThresholdingLLVM Clang 3.4 SVN-fslp-vectorize714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 33232-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: HWB Color SpaceLLVM Clang 3.4 SVN-fslp-vectorize20406080100SE +/- 0.00, N = 3SE +/- 0.33, N = 38080-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: ResizingLLVM Clang 3.4 SVN-fslp-vectorize1530456075SE +/- 0.00, N = 3SE +/- 0.33, N = 36666-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: SharpenLLVM Clang 3.4 SVN-fslp-vectorize816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 33636-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: BlurLLVM Clang 3.4 SVN-fslp-vectorize1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 35353-fslp-vectorize1. (CC) gcc options: -O3 -march=native -pthread -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread

BLAKE2

Phoronix Test Suite v4.8.0m4

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.0m4LLVM Clang 3.4 SVN-fslp-vectorize0.9451.892.8353.784.725SE +/- 0.01, N = 3SE +/- 0.00, N = 34.204.201. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz


Phoronix Test Suite v10.8.5