Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3

Exynos ARMv7 rev 4 testing on Ubuntu 12.04 via the Phoronix Test Suite. Benchmarking for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1212235-SU-1212087RA42.

Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3ProcessorMotherboardMemoryDiskGraphicsNetworkChipsetAudioOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionDisplay DriverOpenGLARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60Exynos ARMv7 rev 4 @ 1.70GHz (2 Cores)SAMSUNG EXYNOS5 (Flattened Device Tree)2048MB16GB SEM16GUbuntu 12.043.4.0 (armv7l)Unity 2D 5.16.0X Server 1.11.3GCC 4.7ext41366x768ARMv7 rev 9 @ 1.40GHz (4 Cores)cardhu1024MB16GB SEM16G + 32GB SD32GNVIDIA TEGRARealtek RTL8111/8168B3.1.10-gfc993d9 (armv7l)GCC 4.6ext31366x1536Ubuntu 12.10Intel T2500 @ 2.00GHz (2 Cores)LENOVO 20075TGIntel Mobile 945GM/PM/GMS + ICH7-M80GB FUJITSU MHV2080BAMD ATI Radeon Mobility X1400Analog Devices AD1981Intel 82573L Gigabit + Intel PRO/Wireless3.5.0-19-generic (i686)Unity 6.10.0X Server 1.13.0radeon 6.99.992.1 Mesa 9.0 Gallium 0.4GCC 4.7ext41400x1050OpenBenchmarking.orgCompiler Details- ARM Cortex A15 1.7GHz Dual: --build=arm-linux-gnueabihf --disable-libitm --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - NVIDIA Tegra 3 - Ubuntu 12.04: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - NVIDIA Tegra 3 - Ubuntu 12.10: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - IBM_T60: --build=i686-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-targets=all --enable-threads=posix --host=i686-linux-gnu --target=i686-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details- Scaling Governor: ondemandSystem Details- ARM Cortex A15 1.7GHz Dual: Python 2.7.3.- NVIDIA Tegra 3 - Ubuntu 12.04: Python 2.7.3.- NVIDIA Tegra 3 - Ubuntu 12.10: Python 2.7.3.- IBM_T60: Python 2.7.3. Compiz and Firefox were running on this system.

Samsung ARM Cortex A15 Dual vs. NVIDIA Tegra 3stream: Copystream: Scalestream: Triadstream: Addclomp: Static OMP Speedupdolfyn: Computational Fluid Dynamicsffte: N=64, 1D Complex FFT Routinehmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentcachebench: Readcachebench: Writecachebench: Read / Modify / Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancevpxenc: vpxencx264: H.264 Video Encodingcompress-7zip: Compress Speed Testc-ray: Total Timecompress-pbzip2: 256MB File Compressionprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplescrafty: Elapsed Timeencode-mp3: WAV To MP3encode-ogg: WAV To Oggffmpeg: H.264 HD To NTSC DVpovray: Total Timeopenssl: RSA 4096-bit Performancepybench: Total For Average Test Timesnginx: Static Web Page Servingphpbench: PHP Benchmark SuiteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T606251.246114.155189.585656.601.64159.981484.87115.1137.352572.443723.557115.32270.01160.8636.35339.64455.45357.751462651.4310.141685312.0084.57832.9511770.06142.2731.53159.1619379.9084213273.57220501523.821519.351273.091286.180.74477.22544.44366.2959.451933.822827.602878.200.408.321499559.11113.99427691.72422.9947315.90139642146.61132241387.161339.671268.611285.260.74462.43367.3851.201939.552848.202862.320.408.181513539.61109.38426491.47424.0747045.90133462131.78134862147.522146.532531.222525.82114.971622.7382.1938.011883.244993.298933.53310.24116.8449.69401.57372.96611.344.6414.612472308.2469.71649.547700.0241.2452.6915828.6064029986.6434447OpenBenchmarking.org

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: CopyARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6013002600390052006500SE +/- 52.68, N = 10SE +/- 3.98, N = 10SE +/- 35.01, N = 20SE +/- 1.80, N = 106251.241523.821387.162147.521. (CC) gcc options: -O2 -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: ScaleARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6013002600390052006500SE +/- 45.43, N = 10SE +/- 4.74, N = 10SE +/- 44.70, N = 20SE +/- 8.48, N = 106114.151519.351339.672146.531. (CC) gcc options: -O2 -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: TriadARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6011002200330044005500SE +/- 32.03, N = 10SE +/- 0.51, N = 10SE +/- 4.48, N = 10SE +/- 1.83, N = 105189.581273.091268.612531.221. (CC) gcc options: -O2 -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2009-04-11Type: AddARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6012002400360048006000SE +/- 41.07, N = 10SE +/- 0.68, N = 10SE +/- 2.65, N = 10SE +/- 2.77, N = 105656.601286.181285.262525.821. (CC) gcc options: -O2 -fopenmp

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.100.3690.7381.1071.4761.845SE +/- 0.13, N = 10SE +/- 0.00, N = 5SE +/- 0.00, N = 51.640.740.741. (CC) gcc options: --openmp -O3 -lm

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60100200300400500SE +/- 0.31, N = 3SE +/- 4.90, N = 3SE +/- 4.13, N = 3SE +/- 0.33, N = 3159.98477.22462.43114.97

FFTE

Test: N=64, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 5.0Test: N=64, 1D Complex FFT RoutineARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04IBM_T6030060090012001500SE +/- 0.35, N = 3SE +/- 0.02, N = 3SE +/- 15.64, N = 31484.87544.441622.73-lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread-lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6080160240320400SE +/- 0.36, N = 3SE +/- 4.08, N = 3SE +/- 5.46, N = 3SE +/- 0.61, N = 3115.11366.29367.3882.191. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T601326395265SE +/- 0.17, N = 3SE +/- 2.43, N = 6SE +/- 0.79, N = 3SE +/- 0.20, N = 337.3559.4551.2038.011. (CC) gcc options: -O3 -lm -lpthread

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T606001200180024003000SE +/- 8.68, N = 3SE +/- 22.43, N = 3SE +/- 12.37, N = 3SE +/- 12.12, N = 32572.441933.821939.551883.241. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6011002200330044005500SE +/- 15.25, N = 3SE +/- 27.74, N = 3SE +/- 25.25, N = 3SE +/- 4.65, N = 33723.552827.602848.204993.291. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T602K4K6K8K10KSE +/- 6.72, N = 3SE +/- 49.68, N = 3SE +/- 41.68, N = 3SE +/- 1.48, N = 37115.322878.202862.328933.531. (CC) gcc options: -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeARM Cortex A15 1.7GHz DualIBM_T6070140210280350SE +/- 0.18, N = 4SE +/- 0.10, N = 4270.01310.24

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloARM Cortex A15 1.7GHz DualIBM_T604080120160200SE +/- 0.12, N = 4SE +/- 0.13, N = 4160.86116.84

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformARM Cortex A15 1.7GHz DualIBM_T601122334455SE +/- 0.67, N = 4SE +/- 0.06, N = 436.3549.69

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyARM Cortex A15 1.7GHz DualIBM_T6090180270360450SE +/- 0.73, N = 4339.64401.57

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationARM Cortex A15 1.7GHz DualIBM_T60100200300400500SE +/- 0.39, N = 4SE +/- 0.52, N = 4455.45372.96

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationARM Cortex A15 1.7GHz DualIBM_T60130260390520650SE +/- 0.42, N = 4SE +/- 0.61, N = 4357.75611.34

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceARM Cortex A15 1.7GHz Dual30K60K90K120K150KSE +/- 95.53, N = 5146265

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 1.1.0vpxencARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T601.0442.0883.1324.1765.22SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 31.430.400.404.64-m321. (CC) gcc options: -lvpx -lm -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2012-10-03H.264 Video EncodingARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6048121620SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 6SE +/- 0.20, N = 310.148.328.1814.61

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T605001000150020002500SE +/- 6.01, N = 3SE +/- 11.46, N = 3SE +/- 12.66, N = 3SE +/- 32.75, N = 316851499151324721. (CXX) g++ options: -pipe -lpthread

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60120240360480600SE +/- 0.68, N = 3SE +/- 0.14, N = 3SE +/- 5.19, N = 3SE +/- 0.22, N = 3312.00559.11539.61308.241. (CC) gcc options: -lm -lpthread -O3

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.6256MB File CompressionARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60306090120150SE +/- 0.80, N = 3SE +/- 1.25, N = 3SE +/- 0.71, N = 3SE +/- 1.16, N = 684.57113.99109.3869.711. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.01e12 Prime Number GenerationARM Cortex A15 1.7GHz DualIBM_T602004006008001000SE +/- 13.74, N = 6SE +/- 0.43, N = 3832.95649.541. (CXX) g++ options: -O2 -fopenmp

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T609001800270036004500SE +/- 4.04, N = 3SE +/- 16.71, N = 3SE +/- 7.00, N = 3SE +/- 0.33, N = 3117742764264770-mtune=cortex-a15 -marm -mfpu=neon1. (CXX) g++ options: -fopenmp

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeARM Cortex A15 1.7GHz DualIBM_T600.01350.0270.04050.0540.0675SE +/- 0.00, N = 3SE +/- 0.00, N = 60.060.021. (CC) gcc options: -lstdc++ -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3ARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60306090120150SE +/- 0.08, N = 5SE +/- 0.13, N = 5SE +/- 0.21, N = 5SE +/- 0.19, N = 5142.2791.7291.4741.24

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggARM Cortex A15 1.7GHz Dual714212835SE +/- 0.11, N = 531.531. (CC) gcc options: -O2 -ffast-math -fsigned-char -mtune=cortex-a15 -marm -mfpu=neon -lm -logg

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 1.0H.264 HD To NTSC DVARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T6090180270360450SE +/- 0.95, N = 3SE +/- 2.26, N = 3SE +/- 2.65, N = 3SE +/- 0.59, N = 3159.16422.99424.0752.69-lbz21. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lm -pthread -lrt

POV-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.6.1Total TimeARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T60100020003000400050001937473147041582-msse -mfpmath=sse -msse2 -march=i686 -mtune=i686 -malign-double1. (CXX) g++ options: -pipe -O3 -lz -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1cRSA 4096-bit PerformanceARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T603691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.905.905.908.60-march=armv7-a-march=armv7-a-fomit-frame-pointer1. (CC) gcc options: -O3 -lssl -lcrypto -ldl

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2008-08-14Total For Average Test TimesARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T603K6K9K12K15KSE +/- 11.05, N = 3SE +/- 4.06, N = 3SE +/- 6.24, N = 3SE +/- 6.81, N = 3842113964133466402

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.0.11Static Web Page ServingARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T602K4K6K8K10KSE +/- 109.38, N = 6SE +/- 13.99, N = 3SE +/- 39.92, N = 3SE +/- 18.08, N = 33273.572146.612131.789986.64-lcrypto1. (CC) gcc options: -lpthread -lcrypt -lz

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteARM Cortex A15 1.7GHz DualNVIDIA Tegra 3 - Ubuntu 12.04NVIDIA Tegra 3 - Ubuntu 12.10IBM_T607K14K21K28K35KSE +/- 158.19, N = 3SE +/- 36.10, N = 3SE +/- 19.92, N = 3SE +/- 20.65, N = 322050132241348634447


Phoronix Test Suite v10.8.4