GCC Compiler Intel Sandy Bridge AVX fpmath

GCC testing for a future article on Phoronix.com. Testing with CFLAGS/CXXFLAGS set to -O3 and -march=native. Benchmarks of GCC 4.7 RC1 when comparing the effects of building the GNU Compiler Collection with the --with-fpmath=avx to see how GCC 4.7 is impacted by the AVX floating-point arithmetic.

HTML result view exported from: https://openbenchmarking.org/result/1203140-BY-MATHAVX1508&sor&grs.

GCC Compiler Intel Sandy Bridge AVX fpmathProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionStockAVX fpmathIntel Core i7-3960X @ 3.20GHz (12 Cores)Intel DX79SIIntel Sandy DMI216384MB240GB OCZ VERTEX3AMD Radeon HD 5700 1024MBRealtek ALC892DELL P2210HIntel 82579LM Gigabit ConnectionUbuntu 12.043.2.0-18-generic (x86_64)Unity 5.4.0X Server 1.11.3radeon 6.14.992.1 Mesa 8.0.1 Gallium 0.4GCC 4.7.0ext41920x1080OpenBenchmarking.orgCompiler Details- Stock: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto- AVX fpmath: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto --with-fpmath=avx System Details- Compiz was running on this system.

GCC Compiler Intel Sandy Bridge AVX fpmathnpb: BT.Anpb: FT.Bnpb: LU.Amafft: Multiple Sequence Alignmentnpb: SP.Aminion: Solitairenpb: UA.Aencode-flac: WAV To FLACminion: Bibdfhourstones: Complex Connect-4 Solvingapache: Static Web Page Servingminion: Gracefulminion: Quasigroupx264: H.264 Video Encodingvpxenc: vpxencclomp: Static OMP Speedupnero2d: Total Timebuild-php: Time To Compilehmmer: Pfam Database Searchffmpeg: AVI To NTSC VCDttsiod-renderer: Phong Rendering With Soft-Shadow Mappingc-ray: Total Timeencode-ogg: WAV To Ogghimeno: Poisson Pressure Solvergmpbench: Total Timenpb: MG.Bbyte: Dhrystone 2build-linux-kernel: Time To Compilegraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Local Adaptive Thresholdinggraphics-magick: Blurgraphics-magick: HWB Color Spaceopenssl: RSA 4096-bit Performancecompress-lzma: 256MB File Compressionsmallpt: Global Illumination Renderer; 100 SamplesStockAVX fpmath17009.4910069.8516396.354.899410.41145.8273.666.29161.2210353.8316263.5290.02188.09171.6823.586.38534.0924.089.2512.34158.6725.709.431377.662833.4010409.7521255660.7370.92981518512316375.45155.142018631.5010613.4316999.525.069625.70148.5374.756.38163.4210235.5016427.8589.23189.65172.8923.426.34532.1024.179.2212.31158.9525.749.441378.74283410408.6921254777.5770.92981518512316375.45155.1420OpenBenchmarking.org

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.AAVX fpmathStock4K8K12K16K20KSE +/- 23.03, N = 3SE +/- 15.33, N = 318631.5017009.491. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BAVX fpmathStock2K4K6K8K10KSE +/- 20.86, N = 3SE +/- 11.49, N = 310613.4310069.851. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AAVX fpmathStock4K8K12K16K20KSE +/- 42.68, N = 3SE +/- 20.47, N = 316999.5216396.351. (F9X) gfortran options: -fopenmp

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentStockAVX fpmath1.13852.2773.41554.5545.6925SE +/- 0.05, N = 3SE +/- 0.06, N = 34.895.061. (CC) gcc options: -O3 -lm -lpthread

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.AAVX fpmathStock2K4K6K8K10KSE +/- 26.51, N = 3SE +/- 18.78, N = 39625.709410.411. (F9X) gfortran options: -fopenmp

Minion

Benchmark: Solitaire

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: SolitaireStockAVX fpmath306090120150SE +/- 0.80, N = 3SE +/- 0.86, N = 3145.82148.531. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

NAS Parallel Benchmarks

Test / Class: UA.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: UA.AAVX fpmathStock20406080100SE +/- 0.15, N = 3SE +/- 0.22, N = 374.7573.661. (F9X) gfortran options: -fopenmp

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.2.1WAV To FLACStockAVX fpmath246810SE +/- 0.00, N = 5SE +/- 0.09, N = 66.296.381. (CXX) g++ options: -O3 -march=native -logg -lm

Minion

Benchmark: Bibd

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: BibdStockAVX fpmath4080120160200SE +/- 0.53, N = 3SE +/- 1.07, N = 3161.22163.421. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingStockAVX fpmath2K4K6K8K10KSE +/- 13.21, N = 3SE +/- 32.53, N = 310353.8310235.501. (CC) gcc options: -O3

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingAVX fpmathStock4K8K12K16K20KSE +/- 71.91, N = 3SE +/- 70.28, N = 316427.8516263.521. (CC) gcc options: -pthread -O3 -march=native -lm -lexpat -lrt -lcrypt -lpthread -ldl

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: GracefulAVX fpmathStock20406080100SE +/- 0.45, N = 3SE +/- 0.49, N = 389.2390.021. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Quasigroup

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: QuasigroupStockAVX fpmath4080120160200SE +/- 0.12, N = 3SE +/- 0.64, N = 3188.09189.651. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2011-12-06H.264 Video EncodingAVX fpmathStock4080120160200SE +/- 1.56, N = 3SE +/- 1.29, N = 3172.89171.68

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 0.9.7-p1vpxencStockAVX fpmath612182430SE +/- 0.08, N = 3SE +/- 0.01, N = 323.5823.421. (CC) gcc options: -m64 -lvpx -lm -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupStockAVX fpmath246810SE +/- 0.03, N = 5SE +/- 0.05, N = 56.386.341. (CC) gcc options: --openmp -O3 -lm

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeAVX fpmathStock120240360480600532.10534.091. (CXX) g++ options: -O3 -march=native -lfftw3 -llapack -lblas -lm

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 324.0824.171. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAVX fpmathStock3691215SE +/- 0.05, N = 3SE +/- 0.03, N = 39.229.251. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

FFmpeg

AVI To NTSC VCD

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 0.10AVI To NTSC VCDAVX fpmathStock3691215SE +/- 0.01, N = 3SE +/- 0.06, N = 312.3112.341. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingAVX fpmathStock4080120160200SE +/- 0.47, N = 3SE +/- 0.44, N = 3158.95158.671. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.08, N = 325.7025.741. (CC) gcc options: -lm -lpthread -O3 -march=native

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggStockAVX fpmath3691215SE +/- 0.00, N = 5SE +/- 0.01, N = 59.439.441. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native -lvorbis -lm -logg

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverAVX fpmathStock30060090012001500SE +/- 1.49, N = 3SE +/- 0.77, N = 31378.741377.661. (CC) gcc options: -O3 -march=native

GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total TimeAVX fpmathStock60012001800240030002834.002833.401. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: MG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: MG.BStockAVX fpmath2K4K6K8K10KSE +/- 21.92, N = 3SE +/- 37.33, N = 310409.7510408.691. (F9X) gfortran options: -fopenmp

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2StockAVX fpmath5M10M15M20M25MSE +/- 40039.93, N = 3SE +/- 32934.69, N = 321255660.7321254777.571. (CC) gcc options: -O3 -march=native

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 3.1Time To CompileStockAVX fpmath1632486480SE +/- 0.74, N = 3SE +/- 0.59, N = 370.9270.92

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: SharpenAVX fpmathStock20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 398981. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: ResizingAVX fpmathStock306090120150SE +/- 0.33, N = 3SE +/- 0.00, N = 31511511. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive ThresholdingAVX fpmathStock20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 385851. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: BlurAVX fpmathStock306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 31231231. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color SpaceAVX fpmathStock4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 31631631. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0eRSA 4096-bit PerformanceAVX fpmathStock20406080100SE +/- 0.25, N = 4SE +/- 0.19, N = 475.4575.451. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

LZMA Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionStockAVX fpmath306090120150SE +/- 0.29, N = 3SE +/- 0.38, N = 3155.14155.141. (CC) gcc options: -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesStockAVX fpmath510152025SE +/- 0.33, N = 3SE +/- 0.33, N = 320201. (CXX) g++ options: -fopenmp -O3 -march=native


Phoronix Test Suite v10.8.4