Xavier Carmel CPU Core Compiler Tests

NVIDIA Jetson Xavier ARMv8 compiler benchmarks on GCC and LLVM Clang for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1901055-SP-XAVIERCAR07.

Xavier Carmel CPU Core Compiler TestsProcessorMotherboardMemoryDiskGraphicsOSKernelDesktopDisplay ServerDisplay DriverVulkanCompilerFile-SystemScreen ResolutionGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVNARMv8 rev 0 @ 2.27GHz (8 Cores)jetson-xavier16384MB31GB HBG4a2NVIDIA TEGRAUbuntu 18.044.9.108-tegra (aarch64)Unity 7.5.0X Server 1.19.6NVIDIA 1.0.01.1.76GCC 7.3.0 + CUDA 10.0ext41920x2160GCC 8.2.0 + clang (GCC) 8.2.0 + CUDA 10.0GCC 9.0.0 20181230 + clang (GCC) 9.0.0 20181230 (experimental) + CUDA 10.0Clang 6.0.0-1ubuntu2 + CUDA 10.0Clang 7.0.1 + LLVM 7.0.1 + CUDA 10.0Clang 8.0.0 (SVN 350356) + LLVM 8.0.0svn + CUDA 10.0OpenBenchmarking.orgCompiler Details- GCC 7.3.0: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v - GCC 8.2.0: --enable-checking=release- GCC 9.0.0: --enable-checking=release- Clang 7.0.1: Optimized build; Default target: aarch64-unknown-linux-gnu; Host CPU: (unknown)- Clang 8.0 SVN: Optimized build; Default target: aarch64-unknown-linux-gnu; Host CPU: (unknown)Processor Details- Scaling Governor: tegra_cpufreq schedutil

Xavier Carmel CPU Core Compiler Testsscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationttsiod-renderer: Phong Rendering With Soft-Shadow Mappinghimeno: Poisson Pressure Solvercompress-7zip: Compress Speed Testasmfish: 1024 Hash Memory, 26 Depthbuild-linux-kernel: Time To Compilebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelprimesieve: 1e12 Prime Number Generationbullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshencode-flac: WAV To FLACencode-mp3: WAV To MP3redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETapache: Static Web Page ServingGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN5722171836209628791462691813982950448312762581065.5312.6812.869.957.302.122.3257.2731.83110527089906670523297350075655010782554197164565966879127248202397195166131240946913413.9219.2325.0126.9612.933.855.52152.90160.36417642310886254896391000274012621812647.4569.791181512453688651499190096345113.0217.1322.1625.5011.473.355.04140.12139.863773193031002393893637002743026009540214163546840935137265830539831940810654.6118.86125467292393074450011384938163771074110234.3352.8978.111971482815083834337796144166.5249.86377274302329234910366301268782604010234.3052.8678.181981482705046980355796137166.9053.263729592997312374393668702626506061OpenBenchmarking.org

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN120240360480600SE +/- 3.05, N = 3SE +/- 0.74, N = 3SE +/- 0.04, N = 3SE +/- 1.05, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 35725541265401021021. (CC) gcc options: -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN50100150200250SE +/- 5.39, N = 3SE +/- 0.68, N = 3SE +/- 0.01, N = 3SE +/- 0.65, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3217.00197.0047.45214.0034.3334.301. (CC) gcc options: -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN4080120160200SE +/- 1.66, N = 3SE +/- 1.17, N = 3SE +/- 0.15, N = 3SE +/- 1.69, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3183.00164.0069.79163.0052.8952.861. (CC) gcc options: -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN130260390520650SE +/- 10.12, N = 3SE +/- 1.06, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3620.00565.00118.00546.0078.1178.181. (CC) gcc options: -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN2004006008001000SE +/- 2.27, N = 3SE +/- 4.72, N = 3SE +/- 0.04, N = 3SE +/- 1.97, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 39629661518401971981. (CC) gcc options: -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN2004006008001000SE +/- 1.25, N = 3SE +/- 1.42, N = 3SE +/- 0.04, N = 3SE +/- 1.40, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 38798792459351481481. (CC) gcc options: -lm

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingGCC 7.3.0GCC 8.2.0Clang 6.0306090120150SE +/- 2.80, N = 3SE +/- 2.94, N = 12SE +/- 1.58, N = 3146127137-mtune=native-mtune=native-ltbb1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -flto -lSDL -fopenmp -fwhole-program -lstdc++

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN80160240320400SE +/- 4.23, N = 3SE +/- 3.17, N = 7SE +/- 0.28, N = 3SE +/- 0.99, N = 3SE +/- 1.87, N = 3SE +/- 0.20, N = 32692483682652812701. (CC) gcc options: -O3

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 7.3.0GCC 8.2.0GCC 9.0.04K8K12K16K20KSE +/- 479.86, N = 12SE +/- 203.21, N = 12SE +/- 10.27, N = 3181392023986511. (CXX) g++ options: -pipe -lpthread

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN2M4M6M8M10MSE +/- 52344.77, N = 3SE +/- 146948.80, N = 9SE +/- 36548.95, N = 3SE +/- 140386.45, N = 3SE +/- 36318.07, N = 3SE +/- 37685.79, N = 3829504471951664991900830539850838345046980

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileGCC 7.3.0GCC 8.2.0GCC 9.0.030060090012001500SE +/- 4.78, N = 3SE +/- 10.52, N = 3SE +/- 4.39, N = 38311312963

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 7.3.0GCC 8.2.0Clang 6.0Clang 7.0.1Clang 8.0 SVN90180270360450SE +/- 0.49, N = 3SE +/- 0.74, N = 3SE +/- 4.66, N = 3SE +/- 0.35, N = 3SE +/- 0.95, N = 3276409319337355

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN2004006008001000SE +/- 15.86, N = 9SE +/- 0.29, N = 3SE +/- 0.12, N = 3SE +/- 9.68, N = 9SE +/- 0.24, N = 3SE +/- 0.21, N = 32584694514087967961. (CC) gcc options: -lm -lpthread -O3

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.21e12 Prime Number GenerationGCC 7.3.0GCC 8.2.0Clang 6.0Clang 7.0.1Clang 8.0 SVN306090120150SE +/- 2.56, N = 12SE +/- 0.05, N = 3SE +/- 1.17, N = 12SE +/- 0.05, N = 3SE +/- 0.09, N = 31061341061441371. (CXX) g++ options: -O3 -lpthread

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 7.3.0GCC 8.2.0GCC 9.0.048121620SE +/- 0.07, N = 8SE +/- 0.07, N = 3SE +/- 0.00, N = 35.5313.9213.02-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 7.3.0GCC 8.2.0GCC 9.0.0510152025SE +/- 1.11, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 312.6819.2317.13-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 7.3.0GCC 8.2.0GCC 9.0.0612182430SE +/- 0.39, N = 3SE +/- 0.31, N = 3SE +/- 0.03, N = 312.8625.0122.16-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 7.3.0GCC 8.2.0GCC 9.0.0612182430SE +/- 0.13, N = 3SE +/- 0.31, N = 3SE +/- 0.01, N = 39.9526.9625.50-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 7.3.0GCC 8.2.0GCC 9.0.03691215SE +/- 0.33, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 37.3012.9311.47-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 7.3.0GCC 8.2.0GCC 9.0.00.86631.73262.59893.46524.3315SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 32.123.853.35-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 7.3.0GCC 8.2.0GCC 9.0.01.2422.4843.7264.9686.21SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 32.325.525.04-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -rdynamic

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN4080120160200SE +/- 0.26, N = 5SE +/- 0.12, N = 5SE +/- 0.30, N = 5SE +/- 0.39, N = 5SE +/- 0.28, N = 5SE +/- 0.29, N = 557.27152.90140.1254.61166.52166.90-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O2 -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN4080120160200SE +/- 0.22, N = 3SE +/- 0.81, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 331.83160.36139.8618.8649.8653.26-O3 -pipe-O3 -pipe-O3 -pipe1. (CC) gcc options: -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOPGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN300K600K900K1200K1500KSE +/- 19718.57, N = 3SE +/- 3139.15, N = 3SE +/- 1253.32, N = 3SE +/- 25123.07, N = 3SE +/- 4473.18, N = 3SE +/- 4358.43, N = 3110527041764237731912546723772743729591. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADDGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN200K400K600K800K1000KSE +/- 12029.26, N = 3SE +/- 2388.40, N = 3SE +/- 2439.85, N = 3SE +/- 12868.83, N = 5SE +/- 410.29, N = 3SE +/- 333.80, N = 38990663108863031009239303023292997311. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSHGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN160K320K480K640K800KSE +/- 7980.16, N = 3SE +/- 1181.72, N = 3SE +/- 514.78, N = 3SE +/- 10043.38, N = 3SE +/- 1581.62, N = 3SE +/- 622.41, N = 37052322548962393897445002349102374391. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN200K400K600K800K1000KSE +/- 14230.07, N = 3SE +/- 1882.65, N = 3SE +/- 4402.98, N = 3SE +/- 21615.34, N = 3SE +/- 5813.65, N = 4SE +/- 2409.99, N = 397350039100036370011384933663013668701. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN200K400K600K800K1000KSE +/- 13045.74, N = 4SE +/- 3860.69, N = 6SE +/- 749.65, N = 3SE +/- 14106.11, N = 3SE +/- 1314.35, N = 3SE +/- 1710.87, N = 37565502740122743028163772687822626501. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 7.3.0GCC 8.2.0GCC 9.0.0Clang 6.0Clang 7.0.1Clang 8.0 SVN2K4K6K8K10KSE +/- 46.91, N = 3SE +/- 18.16, N = 3SE +/- 26.79, N = 3SE +/- 79.09, N = 3SE +/- 12.48, N = 3SE +/- 13.10, N = 3107826218600910741604060611. (CC) gcc options: -shared -fPIC -O2 -pthread


Phoronix Test Suite v10.8.4