NVIDIA Tegra 3 vs. Jetson TX1

Benchmarking simple tests of NVIDIA Linux For Tegra using the L4T R16 latest (hardfp) release and its Ubuntu 12.04 sample filesystem compared to L4T R15 (softfp) and its Ubuntu 11.04 filesystem. Benchmarking for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1511151-HA-TEGRA352560
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 3 Tests
CPU Massive 4 Tests
Creator Workloads 2 Tests
Multi-Core 3 Tests
Renderers 2 Tests
Server CPU Tests 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Cardhu Tegra 3
November 29 2012
 
Jetson TX1
November 12 2015
 
Invert Hiding All Results Option
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA Tegra 3 vs. Jetson TX1ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionDesktopDisplay ServerDisplay DriverCardhu Tegra 3Jetson TX1ARMv7 rev 9 @ 1.40GHz (4 Cores)cardhu1024MB16GB SEM16G + 32GB SD32GNVIDIA TEGRARealtek RTL8111/8168BUbuntu 12.043.1.10-gfc993d9 (armv7l)GCC 4.6ext31366x1536Cortex A57 rev 1 @ 1.91GHz (4 Cores)jetson_tx14096MB16GB 016G32 + 16GB SL16GUbuntu 14.043.10.67-g3a5c467 (aarch64)Unity 7.2.2X Server 1.15.1NVIDIA 1.0.0GCC 4.8.4ext41920x1080OpenBenchmarking.orgCompiler Details- Cardhu Tegra 3: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - Jetson TX1: --build=arm-linux-gnueabihf --disable-browser-plugin --disable-libitm --disable-libmudflap --disable-libquadmath --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch-directory=arm --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v Processor Details- Cardhu Tegra 3: Scaling Governor: ondemand- Jetson TX1: Scaling Governor: tegra interactive

Cardhu Tegra 3 vs. Jetson TX1 ComparisonPhoronix Test SuiteBaseline+148.1%+148.1%+296.2%+296.2%+444.3%+444.3%592.2%474.7%376.7%282.2%182.8%G.I.R.1.STotal TimeStatic OMP SpeedupR.M.WC.S.TSmallptC-RayCLOMPCacheBench7-Zip CompressionCardhu Tegra 3Jetson TX1

NVIDIA Tegra 3 vs. Jetson TX1c-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplescachebench: Read / Modify / Writeclomp: Static OMP Speedupcompress-7zip: Compress Speed TestCardhu Tegra 3Jetson TX1549.4042572914.990.73152595.5961511141.453.484313OpenBenchmarking.org

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCardhu Tegra 3Jetson TX1120240360480600SE +/- 8.47, N = 5SE +/- 5.00, N = 6549.4095.591. (CC) gcc options: -lm -lpthread -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCardhu Tegra 3Jetson TX1100200300400500Min: 515.82 / Avg: 549.4 / Max: 560.87Min: 83.88 / Avg: 95.59 / Max: 107.021. (CC) gcc options: -lm -lpthread -O3

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesCardhu Tegra 3Jetson TX19001800270036004500SE +/- 3.71, N = 3SE +/- 1.00, N = 342576151. (CXX) g++ options: -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesCardhu Tegra 3Jetson TX17001400210028003500Min: 4252 / Avg: 4256.67 / Max: 4264Min: 614 / Avg: 615 / Max: 6171. (CXX) g++ options: -fopenmp

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteCardhu Tegra 3Jetson TX12K4K6K8K10KSE +/- 26.07, N = 3SE +/- 14.81, N = 32914.9911141.451. (CC) gcc options: -lrt
OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteCardhu Tegra 3Jetson TX12K4K6K8K10KMin: 2865.83 / Avg: 2914.99 / Max: 2954.63Min: 11113.66 / Avg: 11141.45 / Max: 11164.191. (CC) gcc options: -lrt

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupCardhu Tegra 3Jetson TX10.7831.5662.3493.1323.915SE +/- 0.01, N = 5SE +/- 0.01, N = 50.733.481. (CC) gcc options: --openmp -O3 -lm
OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupCardhu Tegra 3Jetson TX1246810Min: 0.71 / Avg: 0.73 / Max: 0.74Min: 3.46 / Avg: 3.48 / Max: 3.51. (CC) gcc options: --openmp -O3 -lm

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestCardhu Tegra 3Jetson TX19001800270036004500SE +/- 26.12, N = 3SE +/- 18.26, N = 3152543131. (CXX) g++ options: -pipe -lpthread
OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestCardhu Tegra 3Jetson TX17001400210028003500Min: 1489 / Avg: 1525.33 / Max: 1576Min: 4287 / Avg: 4312.67 / Max: 43481. (CXX) g++ options: -pipe -lpthread