Xavier Carmel CPU Core Compiler Tests

NVIDIA Jetson Xavier ARMv8 compiler benchmarks on GCC and LLVM Clang for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1901055-SP-XAVIERCAR07&sor&grr.

Xavier Carmel CPU Core Compiler TestsProcessorMotherboardMemoryDiskGraphicsOSKernelDesktopDisplay ServerDisplay DriverVulkanCompilerFile-SystemScreen ResolutionGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVNARMv8 rev 0 @ 2.27GHz (8 Cores)jetson-xavier16384MB31GB HBG4a2NVIDIA TEGRAUbuntu 18.044.9.108-tegra (aarch64)Unity 7.5.0X Server 1.19.6NVIDIA 1.0.01.1.76GCC 7.3.0 + CUDA 10.0ext41920x2160GCC 8.2.0 + clang (GCC) 8.2.0 + CUDA 10.0GCC 9.0.0 20181230 + clang (GCC) 9.0.0 20181230 (experimental) + CUDA 10.0Clang 6.0.0-1ubuntu2 + CUDA 10.0Clang 7.0.1 + LLVM 7.0.1 + CUDA 10.0Clang 8.0.0 (SVN 350356) + LLVM 8.0.0svn + CUDA 10.0OpenBenchmarking.orgCompiler Details- GCC 7.3.0: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v - GCC 8.2.0: --enable-checking=release- GCC 9.0.0: --enable-checking=release- Clang 7.0.1: Optimized build; Default target: aarch64-unknown-linux-gnu; Host CPU: (unknown)- Clang 8.0 SVN: Optimized build; Default target: aarch64-unknown-linux-gnu; Host CPU: (unknown)Processor Details- Scaling Governor: tegra_cpufreq schedutil

Xavier Carmel CPU Core Compiler Testsbuild-linux-kernel: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelasmfish: 1024 Hash Memory, 26 Depthbuild-php: Time To Compileprimesieve: 1e12 Prime Number Generationencode-flac: WAV To FLACttsiod-renderer: Phong Rendering With Soft-Shadow Mappingapache: Static Web Page Servingcompress-7zip: Compress Speed Testencode-mp3: WAV To MP3himeno: Poisson Pressure Solverscimark2: Compositebullet: Raytestsredis: SETredis: SADDredis: LPUSHredis: GETredis: LPOPbullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte CarloGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN831258829504427610657.27146107821813931.832695725.5375655089906670523297350011052702.322.127.309.9512.8612.6887996262018321713124697195166409134152.90127621820239160.3624855413.922740123108862548963910004176425.523.8512.9326.9625.0119.238799665651641979634514991900140.1260098651139.8636812613.022743023031002393893637003773195.043.3511.4725.5022.1617.1324515111869.7947.45408830539831910654.611371074118.86265540816377923930744500113849312546729358405461632147965083834337144166.52604049.8628110226878230232923491036630137727414819778.1152.8934.337965046980355137166.90606153.2627010226265029973123743936687037295914819878.1852.8634.30OpenBenchmarking.org

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileGCC 7.3.0GCC 9.0.0GCC 8.2.030060090012001500SE +/- 4.78, N = 3SE +/- 4.39, N = 3SE +/- 10.52, N = 38319631312

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 7.3.0Clang 6.0GCC 9.0.0GCC 8.2.0Clang 7.0.1Clang 8.0 SVN2004006008001000SE +/- 15.86, N = 9SE +/- 9.68, N = 9SE +/- 0.12, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 32584084514697967961. (CC) gcc options: -lm -lpthread -O3

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthClang 6.0GCC 7.3.0GCC 8.2.0Clang 7.0.1Clang 8.0 SVNGCC 9.0.02M4M6M8M10MSE +/- 140386.45, N = 3SE +/- 52344.77, N = 3SE +/- 146948.80, N = 9SE +/- 36318.07, N = 3SE +/- 37685.79, N = 3SE +/- 36548.95, N = 3830539882950447195166508383450469804991900

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 7.3.0Clang 6.0Clang 7.0.1Clang 8.0 SVNGCC 8.2.090180270360450SE +/- 0.49, N = 3SE +/- 4.66, N = 3SE +/- 0.35, N = 3SE +/- 0.95, N = 3SE +/- 0.74, N = 3276319337355409

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.21e12 Prime Number GenerationGCC 7.3.0Clang 6.0GCC 8.2.0Clang 8.0 SVNClang 7.0.1306090120150SE +/- 2.56, N = 12SE +/- 1.17, N = 12SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 31061061341371441. (CXX) g++ options: -O3 -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACClang 6.0GCC 7.3.0GCC 9.0.0GCC 8.2.0Clang 7.0.1Clang 8.0 SVN4080120160200SE +/- 0.39, N = 5SE +/- 0.26, N = 5SE +/- 0.30, N = 5SE +/- 0.12, N = 5SE +/- 0.28, N = 5SE +/- 0.29, N = 554.6157.27140.12152.90166.52166.90-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O2 -logg -lm

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingGCC 7.3.0Clang 6.0GCC 8.2.0306090120150SE +/- 2.80, N = 3SE +/- 1.58, N = 3SE +/- 2.94, N = 12146137127-mtune=native-ltbb-mtune=native1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -flto -lSDL -fopenmp -fwhole-program -lstdc++

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 7.3.0Clang 6.0GCC 8.2.0Clang 8.0 SVNClang 7.0.1GCC 9.0.02K4K6K8K10KSE +/- 46.91, N = 3SE +/- 79.09, N = 3SE +/- 18.16, N = 3SE +/- 13.10, N = 3SE +/- 12.48, N = 3SE +/- 26.79, N = 3107821074162186061604060091. (CC) gcc options: -shared -fPIC -O2 -pthread

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.2.0GCC 7.3.0GCC 9.0.04K8K12K16K20KSE +/- 203.21, N = 12SE +/- 479.86, N = 12SE +/- 10.27, N = 3202391813986511. (CXX) g++ options: -pipe -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3Clang 6.0GCC 7.3.0Clang 7.0.1Clang 8.0 SVNGCC 9.0.0GCC 8.2.04080120160200SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.81, N = 318.8631.8349.8653.26139.86160.36-O3 -pipe-O3 -pipe-O3 -pipe1. (CC) gcc options: -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 9.0.0Clang 7.0.1Clang 8.0 SVNGCC 7.3.0Clang 6.0GCC 8.2.080160240320400SE +/- 0.28, N = 3SE +/- 1.87, N = 3SE +/- 0.20, N = 3SE +/- 4.23, N = 3SE +/- 0.99, N = 3SE +/- 3.17, N = 73682812702692652481. (CC) gcc options: -O3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 7.3.0GCC 8.2.0Clang 6.0GCC 9.0.0Clang 8.0 SVNClang 7.0.1120240360480600SE +/- 3.05, N = 3SE +/- 0.74, N = 3SE +/- 1.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 35725545401261021021. (CC) gcc options: -lm

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 7.3.0GCC 9.0.0GCC 8.2.048121620SE +/- 0.07, N = 8SE +/- 0.00, N = 3SE +/- 0.07, N = 35.5313.0213.92-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETClang 6.0GCC 7.3.0GCC 9.0.0GCC 8.2.0Clang 7.0.1Clang 8.0 SVN200K400K600K800K1000KSE +/- 14106.11, N = 3SE +/- 13045.74, N = 4SE +/- 749.65, N = 3SE +/- 3860.69, N = 6SE +/- 1314.35, N = 3SE +/- 1710.87, N = 38163777565502743022740122687822626501. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADDClang 6.0GCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 7.0.1Clang 8.0 SVN200K400K600K800K1000KSE +/- 12868.83, N = 5SE +/- 12029.26, N = 3SE +/- 2388.40, N = 3SE +/- 2439.85, N = 3SE +/- 410.29, N = 3SE +/- 333.80, N = 39239308990663108863031003023292997311. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSHClang 6.0GCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 8.0 SVNClang 7.0.1160K320K480K640K800KSE +/- 10043.38, N = 3SE +/- 7980.16, N = 3SE +/- 1181.72, N = 3SE +/- 514.78, N = 3SE +/- 622.41, N = 3SE +/- 1581.62, N = 37445007052322548962393892374392349101. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETClang 6.0GCC 7.3.0GCC 8.2.0Clang 8.0 SVNClang 7.0.1GCC 9.0.0200K400K600K800K1000KSE +/- 21615.34, N = 3SE +/- 14230.07, N = 3SE +/- 1882.65, N = 3SE +/- 2409.99, N = 3SE +/- 5813.65, N = 4SE +/- 4402.98, N = 311384939735003910003668703663013637001. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOPClang 6.0GCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 7.0.1Clang 8.0 SVN300K600K900K1200K1500KSE +/- 25123.07, N = 3SE +/- 19718.57, N = 3SE +/- 3139.15, N = 3SE +/- 1253.32, N = 3SE +/- 4473.18, N = 3SE +/- 4358.43, N = 3125467211052704176423773193772743729591. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 7.3.0GCC 9.0.0GCC 8.2.01.2422.4843.7264.9686.21SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 32.325.045.52-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 7.3.0GCC 9.0.0GCC 8.2.00.86631.73262.59893.46524.3315SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.123.353.85-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 7.3.0GCC 9.0.0GCC 8.2.03691215SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 37.3011.4712.93-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 7.3.0GCC 9.0.0GCC 8.2.0612182430SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.31, N = 39.9525.5026.96-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 7.3.0GCC 9.0.0GCC 8.2.0612182430SE +/- 0.39, N = 3SE +/- 0.03, N = 3SE +/- 0.31, N = 312.8622.1625.01-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 7.3.0GCC 9.0.0GCC 8.2.0510152025SE +/- 1.11, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 312.6817.1319.23-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 6.0GCC 8.2.0GCC 7.3.0GCC 9.0.0Clang 8.0 SVNClang 7.0.12004006008001000SE +/- 1.40, N = 3SE +/- 1.42, N = 3SE +/- 1.25, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 39358798792451481481. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 8.2.0GCC 7.3.0Clang 6.0Clang 8.0 SVNClang 7.0.1GCC 9.0.02004006008001000SE +/- 4.72, N = 3SE +/- 2.27, N = 3SE +/- 1.97, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 39669628401981971511. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 7.3.0GCC 8.2.0Clang 6.0GCC 9.0.0Clang 8.0 SVNClang 7.0.1130260390520650SE +/- 10.12, N = 3SE +/- 1.06, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3620.00565.00546.00118.0078.1878.111. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 7.3.0GCC 8.2.0Clang 6.0GCC 9.0.0Clang 7.0.1Clang 8.0 SVN4080120160200SE +/- 1.66, N = 3SE +/- 1.17, N = 3SE +/- 1.69, N = 3SE +/- 0.15, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3183.00164.00163.0069.7952.8952.861. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 7.3.0Clang 6.0GCC 8.2.0GCC 9.0.0Clang 7.0.1Clang 8.0 SVN50100150200250SE +/- 5.39, N = 3SE +/- 0.65, N = 3SE +/- 0.68, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3217.00214.00197.0047.4534.3334.301. (CC) gcc options: -lm


Phoronix Test Suite v10.8.5