Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3

Exynos ARMv7 rev 4 testing on Ubuntu 12.04 via the Phoronix Test Suite. Benchmarking for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1212124-SU-1212087RA52.

Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3ProcessorMotherboardMemoryDiskGraphicsNetworkChipsetAudioOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionDisplay DriverARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku1Exynos ARMv7 rev 4 @ 1.70GHz (2 Cores)SAMSUNG EXYNOS5 (Flattened Device Tree)2048MB16GB SEM16GUbuntu 12.043.4.0 (armv7l)Unity 2D 5.16.0X Server 1.11.3GCC 4.7ext41366x768ARMv7 rev 9 @ 1.40GHz (4 Cores)cardhu1024MB16GB SEM16G + 32GB SD32GNVIDIA TEGRARealtek RTL8111/8168B3.1.10-gfc993d9 (armv7l)GCC 4.6ext31366x1536Ubuntu 12.10Intel Pentium M @ 1.70GHz (1 Core)IBM 2672CB3Intel 82855PM to I/O + ICH4-M1536MB40GB FUJITSU MHV2040AAMD Mobility Radeon 7000Intel 82801DB/DBL/DBMIntel 82540EP Gigabit + Intel PRO/WirelessUbuntu 12.043.2.0-34-generic (i686)LXDE 0.5.8X Server 1.11.3radeon 6.14.99ext41024x768OpenBenchmarking.orgCompiler Details- ARM Cortex A15 1.7GHz Dual: --build=arm-linux-gnueabihf --disable-libitm --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - NVIDIA Tegra 3 - Ubuntu 12.04: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - NVIDIA Tegra 3 - Ubuntu 12.10: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v Processor Details- Scaling Governor: ondemandSystem Details- Python 2.7.3.

Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3stream: Copystream: Scalestream: Triadstream: Addclomp: Static OMP Speedupdolfyn: Computational Fluid Dynamicsffte: N=64, 1D Complex FFT Routinehmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentcachebench: Readcachebench: Writecachebench: Read / Modify / Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancevpxenc: vpxencx264: H.264 Video Encodingcompress-7zip: Compress Speed Testc-ray: Total Timecompress-pbzip2: 256MB File Compressionprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplescrafty: Elapsed Timeencode-mp3: WAV To MP3encode-ogg: WAV To Oggffmpeg: H.264 HD To NTSC DVpovray: Total Timeopenssl: RSA 4096-bit Performancepybench: Total For Average Test Timesnginx: Static Web Page Servingphpbench: PHP Benchmark SuiteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku16251.246114.155189.585656.601.64159.981484.87115.1137.352572.443723.557115.32270.01160.8636.35339.64455.45357.751462651.4310.141685312.0084.57832.9511770.06142.2731.53159.1619379.9084213273.57220501523.821519.351273.091286.180.74477.22544.44366.2959.451933.822827.602878.200.408.321499559.11113.99427691.72422.9947315.90139642146.61132241387.161339.671268.611285.260.74462.43367.3851.201939.552848.202862.320.408.181513539.61109.38426491.47424.0747045.90133462131.7813486875.46871.231021.821019.81140.251080.68207.0481.82894.123042.135284.26179.03115.3323.23193.34185.22378.071583811.716.9412611261.15176.411583.1620330.0655.18122.3023047.2073274434.136196OpenBenchmarking.org

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: CopyARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku113002600390052006500SE +/- 52.68, N = 10SE +/- 3.98, N = 10SE +/- 35.01, N = 20SE +/- 1.34, N = 106251.241523.821387.16875.461. (CC) gcc options: -O2 -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: ScaleARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku113002600390052006500SE +/- 45.43, N = 10SE +/- 4.74, N = 10SE +/- 44.70, N = 20SE +/- 1.10, N = 106114.151519.351339.67871.231. (CC) gcc options: -O2 -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: TriadARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku111002200330044005500SE +/- 32.03, N = 10SE +/- 0.51, N = 10SE +/- 4.48, N = 10SE +/- 1.41, N = 105189.581273.091268.611021.821. (CC) gcc options: -O2 -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: AddARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku112002400360048006000SE +/- 41.07, N = 10SE +/- 0.68, N = 10SE +/- 2.65, N = 10SE +/- 0.75, N = 105656.601286.181285.261019.811. (CC) gcc options: -O2 -fopenmp

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.100.3690.7381.1071.4761.845SE +/- 0.13, N = 10SE +/- 0.00, N = 5SE +/- 0.00, N = 51.640.740.741. (CC) gcc options: --openmp -O3 -lm

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku1100200300400500SE +/- 0.31, N = 3SE +/- 4.90, N = 3SE +/- 4.13, N = 3SE +/- 2.73, N = 3159.98477.22462.43140.25

FFTE

Test: N=64, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 5.0Test: N=64, 1D Complex FFT RoutineARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04Hiisku130060090012001500SE +/- 0.35, N = 3SE +/- 0.02, N = 3SE +/- 76.60, N = 61484.87544.441080.68-lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread-lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku180160240320400SE +/- 0.36, N = 3SE +/- 4.08, N = 3SE +/- 5.46, N = 3SE +/- 0.32, N = 3115.11366.29367.38207.041. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku120406080100SE +/- 0.17, N = 3SE +/- 2.43, N = 6SE +/- 0.79, N = 3SE +/- 0.32, N = 337.3559.4551.2081.821. (CC) gcc options: -O3 -lm -lpthread

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku16001200180024003000SE +/- 8.68, N = 3SE +/- 22.43, N = 3SE +/- 12.37, N = 3SE +/- 69.27, N = 62572.441933.821939.55894.121. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku18001600240032004000SE +/- 15.25, N = 3SE +/- 27.74, N = 3SE +/- 25.25, N = 3SE +/- 163.37, N = 63723.552827.602848.203042.131. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku115003000450060007500SE +/- 6.72, N = 3SE +/- 49.68, N = 3SE +/- 41.68, N = 3SE +/- 756.31, N = 67115.322878.202862.325284.261. (CC) gcc options: -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeARM Cortex A15 1.7GHz DualHiisku160120180240300SE +/- 0.18, N = 4SE +/- 0.77, N = 4270.01179.03

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloARM Cortex A15 1.7GHz DualHiisku14080120160200SE +/- 0.12, N = 4SE +/- 0.12, N = 4160.86115.33

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformARM Cortex A15 1.7GHz DualHiisku1816243240SE +/- 0.67, N = 4SE +/- 0.10, N = 336.3523.23

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyARM Cortex A15 1.7GHz DualHiisku170140210280350SE +/- 0.73, N = 4SE +/- 2.90, N = 4339.64193.34

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationARM Cortex A15 1.7GHz DualHiisku1100200300400500SE +/- 0.39, N = 4SE +/- 5.23, N = 4455.45185.22

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationARM Cortex A15 1.7GHz DualHiisku180160240320400SE +/- 0.42, N = 4SE +/- 1.92, N = 4357.75378.07

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceARM Cortex A15 1.7GHz DualHiisku130K60K90K120K150KSE +/- 95.53, N = 5SE +/- 4362.35, N = 10146265158381

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 1.1.0vpxencARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku10.38480.76961.15441.53921.924SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 31.430.400.401.711. (CC) gcc options: -lvpx -lm -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2012-10-03H.264 Video EncodingARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku13691215SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 6SE +/- 0.26, N = 610.148.328.186.94

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku1400800120016002000SE +/- 6.01, N = 3SE +/- 11.46, N = 3SE +/- 12.66, N = 3SE +/- 3.51, N = 316851499151312611. (CXX) g++ options: -pipe -lpthread

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku130060090012001500SE +/- 0.68, N = 3SE +/- 0.14, N = 3SE +/- 5.19, N = 3SE +/- 65.13, N = 3312.00559.11539.611261.151. (CC) gcc options: -lm -lpthread -O3

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.6256MB File CompressionARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku14080120160200SE +/- 0.80, N = 3SE +/- 1.25, N = 3SE +/- 0.71, N = 3SE +/- 9.56, N = 684.57113.99109.38176.411. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.01e12 Prime Number GenerationARM Cortex A15 1.7GHz DualHiisku130060090012001500SE +/- 13.74, N = 6SE +/- 3.37, N = 3832.951583.161. (CXX) g++ options: -O2 -fopenmp

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku19001800270036004500SE +/- 4.04, N = 3SE +/- 16.71, N = 3SE +/- 7.00, N = 3SE +/- 5.84, N = 31177427642642033-mtune=cortex-a15 -marm -mfpu=neon1. (CXX) g++ options: -fopenmp

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeARM Cortex A15 1.7GHz DualHiisku10.01350.0270.04050.0540.0675SE +/- 0.00, N = 3SE +/- 0.01, N = 60.060.061. (CC) gcc options: -lstdc++ -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3ARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku1306090120150SE +/- 0.08, N = 5SE +/- 0.13, N = 5SE +/- 0.21, N = 5SE +/- 0.82, N = 5142.2791.7291.4755.18

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggARM Cortex A15 1.7GHz Dual714212835SE +/- 0.11, N = 531.531. (CC) gcc options: -O2 -ffast-math -fsigned-char -mtune=cortex-a15 -marm -mfpu=neon -lm -logg

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 1.0H.264 HD To NTSC DVARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku190180270360450SE +/- 0.95, N = 3SE +/- 2.26, N = 3SE +/- 2.65, N = 3SE +/- 2.09, N = 3159.16422.99424.07122.30-lbz21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lm -pthread -lrt

POV-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.6.1Total TimeARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku1100020003000400050001937473147042304-msse -mfpmath=sse -msse2 -march=pentium-m -mtune=pentium-m -malign-double -lSM -lICE -lX111. (CXX) g++ options: -pipe -O3 -lz -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1cRSA 4096-bit PerformanceARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku13691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 39.905.905.907.201. (CC) gcc options: -march=armv7-a -O3 -lssl -lcrypto -ldl

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2008-08-14Total For Average Test TimesARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku13K6K9K12K15KSE +/- 11.05, N = 3SE +/- 4.06, N = 3SE +/- 6.24, N = 3SE +/- 15.94, N = 3842113964133467327

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.0.11Static Web Page ServingARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku110002000300040005000SE +/- 109.38, N = 6SE +/- 13.99, N = 3SE +/- 39.92, N = 3SE +/- 51.35, N = 33273.572146.612131.784434.13-lcrypto1. (CC) gcc options: -lpthread -lcrypt -lz

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10Hiisku15K10K15K20K25KSE +/- 158.19, N = 3SE +/- 36.10, N = 3SE +/- 19.92, N = 3SE +/- 96.11, N = 42205013224134866196


Phoronix Test Suite v10.8.4