NVIDIA Tegra K1 GCC Clang Compilers

GCC 4.8.2 vs. Clang 3.4 on NVIDIA's Tegra K1 SoC via the NVIDIA Jetson TK1 ARM quad-core Cortex-A15 development board. Benchmarks by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1405138-KH-NVIDIATEG53.

NVIDIA Tegra K1 GCC Clang CompilersProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionClang 3.4GCC 4.8.2ARMv7 rev 3 @ 2.32GHz (4 Cores)laguna2048MB16GB SEM16GGK20A/AXIRealtek RTL8111/8168/8411Ubuntu 14.043.10.24-g6a2d13a (armv7l)Unity 7.2.0X Server 1.15.1NVIDIA 19.24.3.0Clang 3.4-1ubuntu3ext41920x1080GCC 4.8.2 + Clang 3.4-1ubuntu3OpenBenchmarking.orgProcessor Details- Scaling Governor: tegra ondemandCompiler Details- GCC 4.8.2: --build=arm-linux-gnueabihf --disable-browser-plugin --disable-libitm --disable-libmudflap --disable-libquadmath --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch-directory=arm --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v

NVIDIA Tegra K1 GCC Clang Compilersscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationgraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverbuild-apache: Time To Compilebuild-imagemagick: Time To Compilec-ray: Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3apache: Static Web Page ServingClang 3.4GCC 4.8.2393.13275.4258.04369.93624.29637.993013295530224.6783.68146.80207.4817.9337.994565.50351.71231.9145.98374.74625.32480.594846657236142.37128.21346.3086.0115.1636.634601.34OpenBenchmarking.org

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 3.4GCC 4.8.290180270360450SE +/- 0.21, N = 4SE +/- 0.11, N = 4393.13351.71

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 3.4GCC 4.8.260120180240300SE +/- 0.07, N = 4SE +/- 0.01, N = 4275.42231.91

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 3.4GCC 4.8.21326395265SE +/- 0.04, N = 4SE +/- 0.28, N = 458.0445.98

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 3.4GCC 4.8.280160240320400SE +/- 0.40, N = 4SE +/- 0.22, N = 4369.93374.74

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 3.4GCC 4.8.2140280420560700SE +/- 0.31, N = 4SE +/- 0.40, N = 4624.29625.32

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.4GCC 4.8.2140280420560700SE +/- 0.45, N = 4SE +/- 0.03, N = 4637.99480.59

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: BlurClang 3.4GCC 4.8.21122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 33048-std=gnu99 -fopenmp -lgomp1. (CC) gcc options: -O2 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: SharpenClang 3.4GCC 4.8.21020304050SE +/- 0.21, N = 6SE +/- 0.00, N = 31346-std=gnu99 -fopenmp -lgomp1. (CC) gcc options: -O2 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: ResizingClang 3.4GCC 4.8.21530456075SE +/- 0.00, N = 3SE +/- 0.33, N = 32965-std=gnu99 -fopenmp -lgomp1. (CC) gcc options: -O2 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color SpaceClang 3.4GCC 4.8.21632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 35572-std=gnu99 -fopenmp -lgomp1. (CC) gcc options: -O2 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive ThresholdingClang 3.4GCC 4.8.2816243240SE +/- 0.00, N = 3SE +/- 0.33, N = 33036-std=gnu99 -fopenmp -lgomp1. (CC) gcc options: -O2 -pthread -ljbig -lwebp -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClang 3.4GCC 4.8.250100150200250SE +/- 3.56, N = 3SE +/- 2.19, N = 5224.67142.371. (CC) gcc options: -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileClang 3.4GCC 4.8.2306090120150SE +/- 0.45, N = 3SE +/- 0.68, N = 383.68128.21

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.8.1-10Time To CompileClang 3.4GCC 4.8.280160240320400SE +/- 0.62, N = 3SE +/- 0.20, N = 3146.80346.30

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeClang 3.4GCC 4.8.250100150200250SE +/- 1.04, N = 3SE +/- 0.18, N = 3207.4886.011. (CC) gcc options: -lm -lpthread -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACClang 3.4GCC 4.8.248121620SE +/- 0.15, N = 5SE +/- 0.12, N = 517.9315.161. (CXX) g++ options: -O2 -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Clang 3.4GCC 4.8.2918273645SE +/- 0.16, N = 5SE +/- 0.14, N = 537.9936.63-funroll-loops-fomit-frame-pointer1. (CC) gcc options: -O3 -ffast-math -pipe -lm

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingClang 3.4GCC 4.8.210002000300040005000SE +/- 23.54, N = 3SE +/- 45.72, N = 34565.504601.341. (CC) gcc options: -shared -fPIC -O2 -pthread


Phoronix Test Suite v10.8.4