GCC Compiler Intel Sandy Bridge AVX fpmath

GCC testing for a future article on Phoronix.com. Testing with CFLAGS/CXXFLAGS set to -O3 and -march=native. Benchmarks of GCC 4.7 RC1 when comparing the effects of building the GNU Compiler Collection with the --with-fpmath=avx to see how GCC 4.7 is impacted by the AVX floating-point arithmetic.

HTML result view exported from: https://openbenchmarking.org/result/1203140-BY-MATHAVX1508&grr&sor.

GCC Compiler Intel Sandy Bridge AVX fpmathProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionStockAVX fpmathIntel Core i7-3960X @ 3.20GHz (12 Cores)Intel DX79SIIntel Sandy DMI216384MB240GB OCZ VERTEX3AMD Radeon HD 5700 1024MBRealtek ALC892DELL P2210HIntel 82579LM Gigabit ConnectionUbuntu 12.043.2.0-18-generic (x86_64)Unity 5.4.0X Server 1.11.3radeon 6.14.992.1 Mesa 8.0.1 Gallium 0.4GCC 4.7.0ext41920x1080OpenBenchmarking.orgCompiler Details- Stock: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto- AVX fpmath: --enable-checking=release --enable-languages=c,c++,fortran --enable-lto --with-fpmath=avx System Details- Compiz was running on this system.

GCC Compiler Intel Sandy Bridge AVX fpmathbuild-php: Time To Compilebuild-linux-kernel: Time To Compilenpb: UA.Anpb: SP.Anpb: MG.Bnpb: LU.Anpb: FT.Bnpb: BT.Attsiod-renderer: Phong Rendering With Soft-Shadow Mappingapache: Static Web Page Servinggraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: Local Adaptive Thresholdinggraphics-magick: Blurgraphics-magick: HWB Color Spacenero2d: Total Timeminion: Solitaireminion: Quasigroupminion: Gracefulminion: Bibdhimeno: Poisson Pressure Solvermafft: Multiple Sequence Alignmenthmmer: Pfam Database Searchgmpbench: Total Timeopenssl: RSA 4096-bit Performancevpxenc: vpxencx264: H.264 Video Encodingffmpeg: AVI To NTSC VCDencode-flac: WAV To FLACencode-ogg: WAV To Oggcompress-lzma: 256MB File Compressionclomp: Static OMP Speedupfhourstones: Complex Connect-4 Solvingbyte: Dhrystone 2smallpt: Global Illumination Renderer; 100 Samplesc-ray: Total TimeStockAVX fpmath24.0870.9273.669410.4110409.7516396.3510069.8517009.49158.6716263.529815185123163534.09145.82188.0990.02161.221377.664.899.252833.4075.4523.58171.6812.346.299.43155.146.3810353.8321255660.732025.7024.1770.9274.759625.7010408.6916999.5210613.4318631.50158.9516427.859815185123163532.10148.53189.6589.23163.421378.745.069.22283475.4523.42172.8912.316.389.44155.146.3410235.5021254777.572025.74OpenBenchmarking.org

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 324.0824.171. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 3.1Time To CompileStockAVX fpmath1632486480SE +/- 0.74, N = 3SE +/- 0.59, N = 370.9270.92

NAS Parallel Benchmarks

Test / Class: UA.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: UA.AAVX fpmathStock20406080100SE +/- 0.15, N = 3SE +/- 0.22, N = 374.7573.661. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: SP.AAVX fpmathStock2K4K6K8K10KSE +/- 26.51, N = 3SE +/- 18.78, N = 39625.709410.411. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: MG.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: MG.BStockAVX fpmath2K4K6K8K10KSE +/- 21.92, N = 3SE +/- 37.33, N = 310409.7510408.691. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: LU.AAVX fpmathStock4K8K12K16K20KSE +/- 42.68, N = 3SE +/- 20.47, N = 316999.5216396.351. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: FT.BAVX fpmathStock2K4K6K8K10KSE +/- 20.86, N = 3SE +/- 11.49, N = 310613.4310069.851. (F9X) gfortran options: -fopenmp

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3Test / Class: BT.AAVX fpmathStock4K8K12K16K20KSE +/- 23.03, N = 3SE +/- 15.33, N = 318631.5017009.491. (F9X) gfortran options: -fopenmp

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.2wPhong Rendering With Soft-Shadow MappingAVX fpmathStock4080120160200SE +/- 0.47, N = 3SE +/- 0.44, N = 3158.95158.671. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.2.21Static Web Page ServingAVX fpmathStock4K8K12K16K20KSE +/- 71.91, N = 3SE +/- 70.28, N = 316427.8516263.521. (CC) gcc options: -pthread -O3 -march=native -lm -lexpat -lrt -lcrypt -lpthread -ldl

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: SharpenAVX fpmathStock20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 398981. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: ResizingAVX fpmathStock306090120150SE +/- 0.33, N = 3SE +/- 0.00, N = 31511511. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: Local Adaptive ThresholdingAVX fpmathStock20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 385851. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: BlurAVX fpmathStock306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 31231231. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.12Operation: HWB Color SpaceAVX fpmathStock4080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 31631631. (CC) gcc options: -std=gnu99 -fopenmp -O3 -march=native -pthread -ltiff -lfreetype -lXext -lSM -lICE -lX11 -lbz2 -lz -lm -lrt -lpthread

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeAVX fpmathStock120240360480600532.10534.091. (CXX) g++ options: -O3 -march=native -lfftw3 -llapack -lblas -lm

Minion

Benchmark: Solitaire

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: SolitaireStockAVX fpmath306090120150SE +/- 0.80, N = 3SE +/- 0.86, N = 3145.82148.531. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Quasigroup

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: QuasigroupStockAVX fpmath4080120160200SE +/- 0.12, N = 3SE +/- 0.64, N = 3188.09189.651. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: GracefulAVX fpmathStock20406080100SE +/- 0.45, N = 3SE +/- 0.49, N = 389.2390.021. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Minion

Benchmark: Bibd

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 0.12Benchmark: BibdStockAVX fpmath4080120160200SE +/- 0.53, N = 3SE +/- 1.07, N = 3161.22163.421. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams-mt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverAVX fpmathStock30060090012001500SE +/- 1.49, N = 3SE +/- 0.77, N = 31378.741377.661. (CC) gcc options: -O3 -march=native

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentStockAVX fpmath1.13852.2773.41554.5545.6925SE +/- 0.05, N = 3SE +/- 0.06, N = 34.895.061. (CC) gcc options: -O3 -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchAVX fpmathStock3691215SE +/- 0.05, N = 3SE +/- 0.03, N = 39.229.251. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGMPbench 0.2Total TimeAVX fpmathStock60012001800240030002834.002833.401. (CC) gcc options: -O3 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0eRSA 4096-bit PerformanceAVX fpmathStock20406080100SE +/- 0.25, N = 4SE +/- 0.19, N = 475.4575.451. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 0.9.7-p1vpxencStockAVX fpmath612182430SE +/- 0.08, N = 3SE +/- 0.01, N = 323.5823.421. (CC) gcc options: -m64 -lvpx -lm -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2011-12-06H.264 Video EncodingAVX fpmathStock4080120160200SE +/- 1.56, N = 3SE +/- 1.29, N = 3172.89171.68

FFmpeg

AVI To NTSC VCD

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 0.10AVI To NTSC VCDAVX fpmathStock3691215SE +/- 0.01, N = 3SE +/- 0.06, N = 312.3112.341. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.2.1WAV To FLACStockAVX fpmath246810SE +/- 0.00, N = 5SE +/- 0.09, N = 66.296.381. (CXX) g++ options: -O3 -march=native -logg -lm

Ogg Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Encoding 1.3.0WAV To OggStockAVX fpmath3691215SE +/- 0.00, N = 5SE +/- 0.01, N = 59.439.441. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native -lvorbis -lm -logg

LZMA Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterLZMA Compression256MB File CompressionStockAVX fpmath306090120150SE +/- 0.29, N = 3SE +/- 0.38, N = 3155.14155.141. (CC) gcc options: -O3 -march=native

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 3.3Static OMP SpeedupStockAVX fpmath246810SE +/- 0.03, N = 5SE +/- 0.05, N = 56.386.341. (CC) gcc options: --openmp -O3 -lm

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingStockAVX fpmath2K4K6K8K10KSE +/- 13.21, N = 3SE +/- 32.53, N = 310353.8310235.501. (CC) gcc options: -O3

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2StockAVX fpmath5M10M15M20M25MSE +/- 40039.93, N = 3SE +/- 32934.69, N = 321255660.7321254777.571. (CC) gcc options: -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesStockAVX fpmath510152025SE +/- 0.33, N = 3SE +/- 0.33, N = 320201. (CXX) g++ options: -fopenmp -O3 -march=native

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeStockAVX fpmath612182430SE +/- 0.04, N = 3SE +/- 0.08, N = 325.7025.741. (CC) gcc options: -lm -lpthread -O3 -march=native


Phoronix Test Suite v10.8.4