CompuLab Utilite Quad-Core ARM vs. Fit-PC2, TrimSlice

Testing by Michael Larabel of Phoronix for a future article of the different CompuLab devices... More benchmarks coming this is just a teaser.

HTML result view exported from: https://openbenchmarking.org/result/1401254-PL-TRIMSLICE07&sro&grt.

CompuLab Utilite Quad-Core ARM vs. Fit-PC2, TrimSliceProcessorMotherboardMemoryDiskGraphicsNetworkChipsetAudioMonitorOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionDisplay DriverCompuLab UtiliteCompuLab Fit-PC2CompuLab Trim-SliceARMv7 rev 10 @ 1.00GHz (4 Cores)Compulab CM-FX62048MB32GB SanDisk SSD U100GC2000 EngineIntel I211 Gigabit ConnectionUbuntu 12.043.0.35-cm-fx6-5.1 (armv7l)GNOME 3.2.1X Server 1.11.32.1GCC 4.6ext41920x1080Intel Atom Z530 @ 1.60GHz (2 Cores)Intel SBC-FITPC2Intel Hub + SCH1024MB160GB Hitachi HTS54501Intel Hub (SCH Poulsbo)Realtek ALC260DELL S2409WRealtek RTL8111/8168B + Ralink RT3090 Wireless 802.11n 1T/1R3.8.0-29-generic (i686)modesetting 0.7.0ARMv7 rev 0 @ 1.00GHz (2 Cores)trimslice593MB250GB Samsung HM251HINVIDIA TEGRARealtek RTL8111/8168B3.1.10-l4t.r16.01 (armv7l)ext21366x1536OpenBenchmarking.orgCompiler Details- CompuLab Utilite: --build=arm-linux-gnueabi --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabi --target=arm-linux-gnueabi --with-arch=armv7-a --with-float=softfp --with-fpu=vfpv3-d16 --with-mode=thumb -v - CompuLab Fit-PC2: --build=i686-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-targets=all --enable-threads=posix --host=i686-linux-gnu --target=i686-linux-gnu --with-arch-32=i686 --with-tune=generic -v - CompuLab Trim-Slice: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v Processor Details- CompuLab Utilite: Scaling Governor: imx ondemand- CompuLab Fit-PC2: Scaling Governor: acpi-cpufreq ondemand- CompuLab Trim-Slice: Scaling Governor: tegra ondemand

CompuLab Utilite Quad-Core ARM vs. Fit-PC2, TrimSliceapache: Static Web Page Servingc-ray: Total Timeclomp: Static OMP Speedupcrafty: Elapsed Timedolfyn: Computational Fluid Dynamicsffmpeg: H.264 HD To NTSC DVffte: N=64, 1D Complex FFT Routineencode-flac: WAV To FLACcompress-gzip: 2GB File Compressionhimeno: Poisson Pressure Solverhpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthencode-mp3: WAV To MP3n-queens: Elapsed Timeopenssl: RSA 4096-bit Performancepolybench-c: Covariance Computationpolybench-c: Correlation Computationpolybench-c: 3 Matrix Multiplicationsprimesieve: 1e12 Prime Number Generationscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationsmallpt: Global Illumination Renderer; 100 Samplesstream: Copystream: Scalestream: Triadstream: Addbuild-apache: Time To Compiletscp: AI Chess Performancex264: H.264 Video EncodingCompuLab UtiliteCompuLab Fit-PC2CompuLab Trim-Slice2512.24630.474.630.11578.60357.63387.3456.82102.70109.122.016550.249770.569710.089710.514840.001624.328370.14196711.956128.79246.5916.6024.5324.59571.481382.4162.1549.8016.5752.7367.53124.1214191486.551438.231652.731678.17390.001279799.061823.450.03325.91287.6856.7086.34104.050.827620.187200.441010.114040.850120.000438.758240.22299521.511135.4930.8831.22485.772264.13128.8648.1116.15156.46153.00270.6025591447.701506.411672.491670.97473.041979361261.451.930.23551.51489.5361.44146.01103.471.046440.156940.566790.131990.446870.000695.162310.587021280.553135.75518.4421.8721.97551.932513.1664.5356.3317.4854.7753.98140.103049890.48846.85977.26985.78537.40126451OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingCompuLab Utilite5001000150020002500SE +/- 1.78, N = 32512.241. (CC) gcc options: -shared -fPIC -O2 -pthread

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite400800120016002000SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 4.34, N = 31823.451261.45630.471. (CC) gcc options: -lm -lpthread -O3

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupCompuLab Trim-SliceCompuLab Utilite1.04182.08363.12544.16725.209SE +/- 0.01, N = 5SE +/- 0.27, N = 101.934.631. (CC) gcc options: --openmp -O3 -lm

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.05180.10360.15540.20720.259SE +/- 0.00, N = 6SE +/- 0.03, N = 6SE +/- 0.00, N = 60.030.230.111. (CC) gcc options: -lstdc++ -lm

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite130260390520650SE +/- 0.34, N = 3SE +/- 2.88, N = 3SE +/- 1.29, N = 3325.91551.51578.60

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.1.1H.264 HD To NTSC DVCompuLab Trim-SliceCompuLab Utilite110220330440550SE +/- 6.63, N = 3SE +/- 1.40, N = 3489.53357.631. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lm -pthread -lrt -march=armv7-a -std=c99 -fomit-frame-pointer -mthumb -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FFTE

Test: N=64, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 5.0Test: N=64, 1D Complex FFT RoutineCompuLab Fit-PC2CompuLab Utilite80160240320400SE +/- 0.06, N = 3SE +/- 0.47, N = 3287.68387.341. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite1428425670SE +/- 0.06, N = 5SE +/- 0.32, N = 5SE +/- 0.20, N = 556.7061.4456.821. (CXX) g++ options: -O2 -fvisibility=hidden -lm

Gzip Compression

2GB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterGzip Compression2GB File CompressionCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite306090120150SE +/- 0.42, N = 3SE +/- 0.62, N = 3SE +/- 0.14, N = 386.34146.01102.70

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite20406080100SE +/- 0.35, N = 3SE +/- 0.62, N = 3SE +/- 1.14, N = 3104.05103.47109.121. (CC) gcc options: -O3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-HPLCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.45370.90741.36111.81482.2685SE +/- 0.00373, N = 3SE +/- 0.00605, N = 3SE +/- 0.00153, N = 30.827621.046442.016551. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-FfteCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.05620.11240.16860.22480.281SE +/- 0.00030, N = 3SE +/- 0.00186, N = 3SE +/- 0.00089, N = 30.187200.156940.249771. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: EP-DGEMMCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.12820.25640.38460.51280.641SE +/- 0.01115, N = 3SE +/- 0.00877, N = 3SE +/- 0.00029, N = 30.441010.566790.569711. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: G-PtransCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.02970.05940.08910.11880.1485SE +/- 0.00025, N = 3SE +/- 0.00197, N = 3SE +/- 0.00203, N = 30.114040.131990.089711. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: EP-STREAM TriadCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.19130.38260.57390.76520.9565SE +/- 0.00628, N = 3SE +/- 0.00207, N = 3SE +/- 0.00478, N = 30.850120.446870.514841. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.4.3Test / Class: G-Random AccessCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.00040.00080.00120.00160.002SE +/- 0.00000, N = 3SE +/- 0.00000, N = 3SE +/- 0.00001, N = 30.000430.000690.001621. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.4.3Test / Class: Random Ring LatencyCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite246810SE +/- 0.05629, N = 3SE +/- 0.01181, N = 3SE +/- 0.02778, N = 38.758245.162314.328371. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: Random Ring BandwidthCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite0.13210.26420.39630.52840.6605SE +/- 0.01334, N = 3SE +/- 0.00207, N = 3SE +/- 0.00126, N = 30.222990.587020.141961. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.4.3Test / Class: Max Ping Pong BandwidthCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite30060090012001500SE +/- 29.05, N = 3SE +/- 5.35, N = 3SE +/- 7.87, N = 3521.511280.55711.961. (CC) gcc options: -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil -fomit-frame-pointer -O3 -march=native -funroll-loops 2. BLAS + Open MPI 1.4.3

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3CompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite306090120150SE +/- 0.08, N = 5SE +/- 0.09, N = 5SE +/- 0.52, N = 5135.49135.75128.791. (CC) gcc options: -O3 -fomit-frame-pointer -ffast-math -pipe -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeCompuLab Trim-SliceCompuLab Utilite110220330440550SE +/- 0.07, N = 3SE +/- 4.80, N = 3518.44246.591. (CC) gcc options: -static -fopenmp -O3

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1fRSA 4096-bit PerformanceCompuLab Utilite48121620SE +/- 0.00, N = 316.601. (CC) gcc options: -march=armv7-a -O3 -lssl -lcrypto -ldl

PolyBench-C

Test: Covariance Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 3.2Test: Covariance ComputationCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite714212835SE +/- 0.24, N = 3SE +/- 0.19, N = 3SE +/- 0.08, N = 330.8821.8724.531. (CC) gcc options: -O3

PolyBench-C

Test: Correlation Computation

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 3.2Test: Correlation ComputationCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite714212835SE +/- 0.46, N = 6SE +/- 0.21, N = 3SE +/- 0.01, N = 331.2221.9724.591. (CC) gcc options: -O3

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 3.2Test: 3 Matrix MultiplicationsCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite120240360480600SE +/- 0.41, N = 3SE +/- 0.45, N = 3SE +/- 0.96, N = 3485.77551.93571.481. (CC) gcc options: -O3

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite5001000150020002500SE +/- 3.22, N = 3SE +/- 258.63, N = 3SE +/- 44.53, N = 32264.132513.161382.411. (CXX) g++ options: -O2 -fopenmp

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite306090120150SE +/- 0.06, N = 4SE +/- 0.53, N = 4SE +/- 0.07, N = 4128.8664.5362.15

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite1326395265SE +/- 0.00, N = 4SE +/- 0.95, N = 4SE +/- 0.09, N = 448.1156.3349.80

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite48121620SE +/- 0.01, N = 4SE +/- 0.20, N = 4SE +/- 0.07, N = 416.1517.4816.57

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite306090120150SE +/- 0.23, N = 4SE +/- 0.41, N = 4SE +/- 0.05, N = 4156.4654.7752.73

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite306090120150SE +/- 0.09, N = 4SE +/- 0.29, N = 4SE +/- 0.06, N = 4153.0053.9867.53

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite60120180240300SE +/- 0.24, N = 4SE +/- 3.31, N = 4SE +/- 0.10, N = 4270.60140.10124.12

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite7001400210028003500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 32559304914191. (CXX) g++ options: -fopenmp

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: CopyCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite30060090012001500SE +/- 1.48, N = 10SE +/- 2.04, N = 10SE +/- 2.65, N = 101447.70890.481486.551. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: ScaleCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite30060090012001500SE +/- 0.90, N = 9SE +/- 0.56, N = 8SE +/- 2.72, N = 81506.41846.851438.231. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: TriadCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite400800120016002000SE +/- 1.19, N = 7SE +/- 0.63, N = 10SE +/- 3.10, N = 101672.49977.261652.731. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: AddCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite400800120016002000SE +/- 0.97, N = 10SE +/- 0.63, N = 9SE +/- 2.89, N = 101670.97985.781678.171. (CC) gcc options: -O3 -march=native -fopenmp

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite120240360480600SE +/- 0.10, N = 3SE +/- 0.33, N = 3SE +/- 0.79, N = 3473.04537.40390.00

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceCompuLab Fit-PC2CompuLab Trim-SliceCompuLab Utilite40K80K120K160K200KSE +/- 42.80, N = 5SE +/- 1100.08, N = 5SE +/- 843.03, N = 51979361264511279791. (CC) gcc options: -O3 -march=native

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2014-01-09H.264 Video EncodingCompuLab Utilite3691215SE +/- 0.08, N = 59.061. (CC) gcc options: -ldl -lm -lpthread


Phoronix Test Suite v10.8.4