R32_5_1_Xavier_AX720_1

ARMv8 rev 0 testing with a Jetson-AGX and NVIDIA Tegra Xavier on Ubuntu 18.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2106114-HA-R3251XAVI56
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 2 Tests
Common Kernel Benchmarks 2 Tests
Server CPU Tests 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
eMMC
June 10 2021
  25 Minutes
M.2 SSD
June 10 2021
  28 Minutes
USB 3.0
June 10 2021
  23 Minutes
USB2.0
June 10 2021
  24 Minutes
Micro SD Card
June 10 2021
  30 Minutes
Type C
June 10 2021
  14 Minutes
CPU Stress
June 10 2021
  29 Minutes
CUDA Test
June 11 2021
  27 Minutes
LPDDR4
June 11 2021
  2 Minutes
RS-232UART
June 11 2021
  1 Minute
LAN
June 11 2021
  2 Minutes
LAN1
June 11 2021
  2 Minutes
LAN2
June 11 2021
  2 Minutes
Invert Hiding All Results Option
  16 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


R32_5_1_Xavier_AX720_1ProcessorMotherboardMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionDisplay DriverOpenGLeMMCM.2 SSDUSB 3.0USB2.0Micro SD CardType CCPU StressCUDA TestLPDDR4RS-232UARTLANLAN1LAN2ARMv8 rev 0 @ 2.27GHz (8 Cores)Jetson-AGX16GB62GB Ultra USB 3.0 + 31GB HBG4a2NVIDIA TEGRAPHL 247E62 x Intel I210Ubuntu 18.044.9.201-tegra (aarch64)Unity 7.5.0X Server 1.19.61.2.131GCC 7.5.0 + CUDA 10.2ext41920x1080256GB INTEL SSDPEKKW256G8 + 62GB Ultra USB 3.0 + 31GB HBG4a2NVIDIA Tegra XavierNVIDIA 32.5.14.6.0256GB INTEL SSDPEKKW256G8 + 256GB Tech + 62GB Ultra USB 3.0 + 31GB HBG4a2 + 15GB i-TF256GB INTEL SSDPEKKW256G8 + 31GB HBG4a2 + 15GB i-TFEDIIG02192 x Intel I210 + Intel 8265 / 8275OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- eMMC, M.2 SSD, USB 3.0, USB2.0, Micro SD Card, Type C, CPU Stress, CUDA Test, LPDDR4, LAN, LAN1, LAN2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only -v Disk Details- eMMC: CFQ / data=ordered,relatime,rw / Block Size: 4096- M.2 SSD: none / data=ordered,relatime,rw / Block Size: 4096- USB 3.0: none / data=ordered,relatime,rw / Block Size: 4096- USB2.0: none / data=ordered,relatime,rw / Block Size: 4096- Micro SD Card: none / data=ordered,relatime,rw / Block Size: 4096- Type C: none / data=ordered,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: tegra_cpufreq schedutil

R32_5_1_Xavier_AX720_1cuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerostress-ng: Cryptostress-ng: Forkingstress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingfio: Seq Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 8MB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 8MB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 8MB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 8MB - Default Test Directorystream: Copystream: Scalestream: Triadstream: Addiperf: 192.168.1.3 - 5201 - 10 Seconds - UDP - 100Mbit Objective - 1iperf: 192.168.1.3 - 5201 - 10 Seconds - UDP - 1000Mbit Objective - 1eMMCM.2 SSDUSB 3.0USB2.0Micro SD CardType CCPU StressCUDA TestLPDDR4RS-232UARTLANLAN1LAN24133334238461416230493.2137944132935690032369237622228224233126364294.512124.638.6722836426328.253.025.148.41410096054555.210223.666.8758679565800322997902572851189.906591.39766.006287.9024874.741760.83115612.3920.3435.79836.85736.92236.67636.67748644.049412.148397.548763.5100947100956100956OpenBenchmarking.org

Serial Loopback Test

This test will do a simple write/read test on all detected serial interfaces. For this test to work, the relevant serial ports should have a serial loopback plug or have otherwise wired the appropriate pins. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSerial Loopback Test/dev/ttyS2, /dev/ttyTHS0, /dev/ttyTHS1, /dev/ttyTHS4FAILPASSPASSFAIL/dev/ttyTHS4/dev/ttyTHS1/dev/ttyTHS0/dev/ttyS2RS-232UART

CUDA Mini-Nbody

The CUDA version of Harrism's mini-nbody tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalCUDA Test816243240SE +/- 0.00, N = 335.80

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingCUDA Test816243240SE +/- 0.00, N = 336.86

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingCUDA Test816243240SE +/- 0.00, N = 336.92

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutCUDA Test816243240SE +/- 0.00, N = 336.68

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroCUDA Test816243240SE +/- 0.00, N = 336.68

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CryptoCPU Stress30060090012001500SE +/- 7.64, N = 31189.901. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: ForkingCPU Stress14002800420056007000SE +/- 56.89, N = 36591.391. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: CPU StressCPU Stress170340510680850SE +/- 64.49, N = 12766.001. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Matrix MathCPU Stress13002600390052006500SE +/- 13.47, N = 36287.901. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Vector MathCPU Stress5K10K15K20K25KSE +/- 59.69, N = 324874.741. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Memory CopyingCPU Stress400800120016002000SE +/- 1.75, N = 31760.831. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc C String FunctionsCPU Stress20K40K60K80K100KSE +/- 964.21, N = 15115612.391. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.11.07Test: Glibc Qsort Data SortingCPU Stress510152025SE +/- 0.16, N = 1520.341. (CC) gcc options: -O2 -std=gnu99 -lm -laio -lcrypt -lrt -lz -ldl -lpthread -lc

Flexible IO Tester

FIO, the Flexible I/O Tester, is an advanced Linux disk benchmark supporting multiple I/O engines and a wealth of options. FIO was written by Jens Axboe for testing of the Linux I/O subsystem and schedulers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD20K40K60K80K100KSE +/- 11.46, N = 3SE +/- 33.33, N = 3SE +/- 272.85, N = 3SE +/- 466.67, N = 3SE +/- 764.69, N = 15722814100242334133375867944131. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD16K32K48K64K80KMin: 7214 / Avg: 7228.33 / Max: 7251Min: 24200 / Avg: 24233.33 / Max: 24300Min: 40800 / Avg: 41333.33 / Max: 41700Min: 75000 / Avg: 75866.67 / Max: 76600Min: 85300 / Avg: 94413.33 / Max: 985001. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD60120180240300SE +/- 1.33, N = 3391234952931. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD50100150200250Min: 292 / Avg: 293.33 / Max: 2961. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryMicro SD CardUSB 3.0USB2.0eMMCM.2 SSDType C14K28K42K56K70KSE +/- 72.49, N = 15SE +/- 16.44, N = 3SE +/- 12.39, N = 3SE +/- 336.20, N = 13SE +/- 57.74, N = 3SE +/- 568.62, N = 36054636464262384656900658001. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryMicro SD CardUSB 3.0USB2.0eMMCM.2 SSDType C11K22K33K44K55KMin: 5577 / Avg: 6053.8 / Max: 6394Min: 6339 / Avg: 6364 / Max: 6395Min: 6401 / Avg: 6425.67 / Max: 6440Min: 20000 / Avg: 23846.15 / Max: 24800Min: 56800 / Avg: 56900 / Max: 57000Min: 64700 / Avg: 65800 / Max: 666001. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB 3.0USB2.0Micro SD CardeMMCM.2 SSDType C7142128352351432321. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD80160240320400SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 3SE +/- 3.02, N = 1528.255.294.5162.0299.0369.01. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD70140210280350Min: 28.2 / Avg: 28.23 / Max: 28.3Min: 55.1 / Avg: 55.17 / Max: 55.2Min: 94.4 / Avg: 94.53 / Max: 94.7Min: 160 / Avg: 161.67 / Max: 163Min: 297 / Avg: 299 / Max: 301Min: 333 / Avg: 368.93 / Max: 3851. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD5001000150020002500SE +/- 0.33, N = 3SE +/- 10.07, N = 353.0102.0121.0304.0790.02376.01. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB2.0Micro SD CardUSB 3.0eMMCType CM.2 SSD400800120016002000Min: 52.3 / Avg: 52.97 / Max: 53.3Min: 2364 / Avg: 2376 / Max: 23961. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryMicro SD CardUSB 3.0USB2.0eMMCM.2 SSDType C60120180240300SE +/- 0.27, N = 15SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 1.31, N = 13SE +/- 0.33, N = 3SE +/- 2.08, N = 323.624.625.193.2222.0257.01. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test DirectoryMicro SD CardUSB 3.0USB2.0eMMCM.2 SSDType C50100150200250Min: 21.8 / Avg: 23.6 / Max: 24.9Min: 24 / Avg: 24.57 / Max: 24.9Min: 25 / Avg: 25.13 / Max: 25.2Min: 78.2 / Avg: 93.18 / Max: 96.8Min: 222 / Avg: 222.33 / Max: 223Min: 253 / Avg: 257 / Max: 2601. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB 3.0USB2.0Micro SD CardeMMCM.2 SSDType C60120180240300SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 1.15, N = 338.648.466.8137.0282.0285.01. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE
OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Disk Target: Default Test DirectoryUSB 3.0USB2.0Micro SD CardeMMCM.2 SSDType C50100150200250Min: 38.4 / Avg: 38.63 / Max: 38.8Min: 48 / Avg: 48.37 / Max: 48.9Min: 66.7 / Avg: 66.83 / Max: 67Min: 135 / Avg: 137 / Max: 1391. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: CopyLPDDR410K20K30K40K50KSE +/- 54.85, N = 548644.01. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: ScaleLPDDR411K22K33K44K55KSE +/- 32.57, N = 549412.11. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: TriadLPDDR410K20K30K40K50KSE +/- 103.13, N = 548397.51. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: AddLPDDR410K20K30K40K50KSE +/- 123.76, N = 548763.51. (CC) gcc options: -O3 -march=native -fopenmp

iPerf

iPerf is a network bandwidth throughput testing software. This test profile is used for automated testing of an iperf client and requires you have access to an iperf server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMbits/sec, More Is BetteriPerf 3.7Server Address: 192.168.1.3 - Server Port: 5201 - Duration: 10 Seconds - Test: UDP - 100Mbit Objective - Parallel: 1LANLAN1LAN2204060801001001001001. (CC) gcc options: -O3 -march=native -lm

OpenBenchmarking.orgMbits/sec, More Is BetteriPerf 3.7Server Address: 192.168.1.3 - Server Port: 5201 - Duration: 10 Seconds - Test: UDP - 1000Mbit Objective - Parallel: 1LANLAN1LAN22004006008001000SE +/- 1.20, N = 39479569561. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgMbits/sec, More Is BetteriPerf 3.7Server Address: 192.168.1.3 - Server Port: 5201 - Duration: 10 Seconds - Test: UDP - 1000Mbit Objective - Parallel: 1LANLAN1LAN22004006008001000Min: 945 / Avg: 947.33 / Max: 9491. (CC) gcc options: -O3 -march=native -lm