GCC Compiler Intel Sandy Bridge AVX fpmath

GCC testing for a future article on Phoronix.com. Testing with CFLAGS/CXXFLAGS set to -O3 and -march=native. Benchmarks of GCC 4.7 RC1 when comparing the effects of building the GNU Compiler Collection with the --with-fpmath=avx to see how GCC 4.7 is impacted by the AVX floating-point arithmetic.

HTML result view exported from: https://openbenchmarking.org/result/1203140-BY-MATHAVX1508.

GCC Compiler Intel Sandy Bridge AVX fpmathProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionStockAVX fpmathIntel Core i7-3960X @ 3.20GHz (12 Cores)Intel DX79SIIntel Sandy DMI216384MB240GB OCZ VERTEX3AMD Radeon HD 5700 1024MBRealtek ALC892DELL P2210HIntel 82579LM Gigabit ConnectionUbuntu 12.043.2.0-18-generic (x86_64)Unity 5.4.0X Server 1.11.3radeon 6.14.992.1 Mesa 8.0.1 Gallium 0.4GCC 4.7.0ext41920x1080OpenBenchmarking.orgCompiler Details- Stock: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto- AVX fpmath: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto --with-fpmath=avx System Details- Compiz was running on this system.

GCC Compiler Intel Sandy Bridge AVX fpmathc-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesbyte: Dhrystone 2fhourstones: Complex Connect-4 Solvingclomp: Static OMP Speedupcompress-lzma: 256MB File Compressionencode-ogg: WAV To Oggencode-flac: WAV To FLACffmpeg: AVI To NTSC VCDx264: H.264 Video Encodingvpxenc: vpxencopenssl: RSA 4096-bit Performancegmpbench: Total Timehmmer: Pfam Database Searchmafft: Multiple Sequence Alignmenthimeno: Poisson Pressure Solverminion: Bibdminion: Gracefulminion: Quasigroupminion: Solitairenero2d: Total Timegraphics-magick: HWB Color Spacegraphics-magick: Blurgraphics-magick: Local Adaptive Thresholdinggraphics-magick: Resizinggraphics-magick: Sharpenapache: Static Web Page Servingttsiod-renderer: Phong Rendering With Soft-Shadow Mappingnpb: BT.Anpb: FT.Bnpb: LU.Anpb: MG.Bnpb: SP.Anpb: UA.Abuild-linux-kernel: Time To Compilebuild-php: Time To CompileStockAVX fpmath25.702021255660.7310353.836.38155.149.436.2912.34171.6823.5875.452833.409.254.891377.66161.2290.02188.09145.82534.09163123851519816263.52158.6717009.4910069.8516396.3510409.759410.4173.6670.9224.0825.742021254777.5710235.506.34155.149.446.3812.31172.8923.4275.4528349.225.061378.74163.4289.23189.65148.53532.10163123851519816427.85158.9518631.5010613.4316999.5210408.699625.7074.7570.9224.17OpenBenchmarking.org

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.08, N = 325.7025.741. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesStockAVX fpmath510152025SE +/- 0.33, N = 3SE +/- 0.33, N = 320201. (CXX) g++ options: -fopenmp -O3 -march=native

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2StockAVX fpmath5M10M15M20M25MSE +/- 40039.93, N = 3SE +/- 32934.69, N = 321255660.7321254777.571. (CC) gcc options: -O3 -march=native

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingStockAVX fpmath2K4K6K8K10KSE +/- 13.21, N = 3SE +/- 32.53, N = 310353.8310235.501. (CC) gcc options: -O3

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupStockAVX fpmath246810SE +/- 0.03, N = 5SE +/- 0.05, N = 56.386.341. (CC) gcc options: --openmp -O3 -lm

LZMA Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionStockAVX fpmath306090120150SE +/- 0.29, N = 3SE +/- 0.38, N = 3155.14155.141. (CC) gcc options: -O3 -march=native

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggStockAVX fpmath3691215SE +/- 0.00, N = 5SE +/- 0.01, N = 59.439.441. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native -lvorbis -lm -logg

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.2.1WAV To FLACStockAVX fpmath246810SE +/- 0.00, N = 5SE +/- 0.09, N = 66.296.381. (CXX) g++ options: -O3 -march=native -logg -lm

FFmpeg

AVI To NTSC VCD

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 0.10AVI To NTSC VCDStockAVX fpmath3691215SE +/- 0.06, N = 3SE +/- 0.01, N = 312.3412.311. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2011-12-06H.264 Video EncodingStockAVX fpmath4080120160200SE +/- 1.29, N = 3SE +/- 1.56, N = 3171.68172.89

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 0.9.7-p1vpxencStockAVX fpmath612182430SE +/- 0.08, N = 3SE +/- 0.01, N = 323.5823.421. (CC) gcc options: -m64 -lvpx -lm -lpthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0eRSA 4096-bit PerformanceStockAVX fpmath20406080100SE +/- 0.19, N = 4SE +/- 0.25, N = 475.4575.451. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total TimeStockAVX fpmath60012001800240030002833.402834.001. (CC) gcc options: -O3 -march=native

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchStockAVX fpmath3691215SE +/- 0.03, N = 3SE +/- 0.05, N = 39.259.221. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentStockAVX fpmath1.13852.2773.41554.5545.6925SE +/- 0.05, N = 3SE +/- 0.06, N = 34.895.061. (CC) gcc options: -O3 -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverStockAVX fpmath30060090012001500SE +/- 0.77, N = 3SE +/- 1.49, N = 31377.661378.741. (CC) gcc options: -O3 -march=native

Minion

Benchmark: Bibd

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: BibdStockAVX fpmath4080120160200SE +/- 0.53, N = 3SE +/- 1.07, N = 3161.22163.421. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: GracefulStockAVX fpmath20406080100SE +/- 0.49, N = 3SE +/- 0.45, N = 390.0289.231. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Quasigroup

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: QuasigroupStockAVX fpmath4080120160200SE +/- 0.12, N = 3SE +/- 0.64, N = 3188.09189.651. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Solitaire

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: SolitaireStockAVX fpmath306090120150SE +/- 0.80, N = 3SE +/- 0.86, N = 3145.82148.531. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeStockAVX fpmath120240360480600534.09532.101. (CXX) g++ options: -O3 -march=native -lfftw3 -llapack -lblas -lm

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color SpaceStockAVX fpmath4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 31631631. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: BlurStockAVX fpmath306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 31231231. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive ThresholdingStockAVX fpmath20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 385851. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: ResizingStockAVX fpmath306090120150SE +/- 0.00, N = 3SE +/- 0.33, N = 31511511. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: SharpenStockAVX fpmath20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 398981. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingStockAVX fpmath4K8K12K16K20KSE +/- 70.28, N = 3SE +/- 71.91, N = 316263.5216427.851. (CC) gcc options: -pthread -O3 -march=native -lm -lexpat -lrt -lcrypt -lpthread -ldl

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingStockAVX fpmath4080120160200SE +/- 0.44, N = 3SE +/- 0.47, N = 3158.67158.951. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.AStockAVX fpmath4K8K12K16K20KSE +/- 15.33, N = 3SE +/- 23.03, N = 317009.4918631.501. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BStockAVX fpmath2K4K6K8K10KSE +/- 11.49, N = 3SE +/- 20.86, N = 310069.8510613.431. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AStockAVX fpmath4K8K12K16K20KSE +/- 20.47, N = 3SE +/- 42.68, N = 316396.3516999.521. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: MG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: MG.BStockAVX fpmath2K4K6K8K10KSE +/- 21.92, N = 3SE +/- 37.33, N = 310409.7510408.691. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.AStockAVX fpmath2K4K6K8K10KSE +/- 18.78, N = 3SE +/- 26.51, N = 39410.419625.701. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: UA.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: UA.AStockAVX fpmath20406080100SE +/- 0.22, N = 3SE +/- 0.15, N = 373.6674.751. (F9X) gfortran options: -fopenmp

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 3.1Time To CompileStockAVX fpmath1632486480SE +/- 0.74, N = 3SE +/- 0.59, N = 370.9270.92

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 324.0824.171. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm


Phoronix Test Suite v10.8.4