AMD Kaveri Compiler Tests AMD A10-7850K Kaveri APU on Linux running some quick tests between GCC and LLVM Clang of varying releases and obtained from the Ubuntu Linux archive as well as the LLVM.org APT repository for LLVM/Clang. Compiler AMD Linux tests by Michael Larabel.
HTML result view exported from: https://openbenchmarking.org/result/1401197-PL-AMDKAVERI37&grr .
AMD Kaveri Compiler Tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores) ASUS A88X-PRO AMD Device 1422 7168MB 240GB OCZ VERTEX3 ASUS AMD Radeon R7 1024MB (960/1066MHz) ATI R6xx HDMI VA2431 Realtek RTL8111/8168/8411 Ubuntu 13.10 3.12.0-031200-generic (x86_64) Unity 7.1.2 X Server 1.14.5 fglrx 13.30.1 4.3.12682 GCC 4.8 ext4 1920x1080 Clang 3.2-7ubuntu1 Clang 3.3-5ubuntu4 Clang 3.4-1~gd~s Clang 3.5-1~exp1 OpenBenchmarking.org Environment Details - LIBGL_DRIVERS_PATH=/usr/lib/i386-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/dri Compiler Details - GCC 4.8.2: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: acpi-cpufreq ondemand
AMD Kaveri Compiler Tests apache: Static Web Page Serving nginx: Static Web Page Serving hint: FLOAT openssl: RSA 4096-bit Performance povray: Total Time minion: Quasigroup minion: Solitaire minion: Graceful ffmpeg: H.264 HD To NTSC DV encode-mp3: WAV To MP3 encode-flac: WAV To FLAC smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time build-php: Time To Compile build-apache: Time To Compile himeno: Poisson Pressure Solver x264: H.264 Video Encoding ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping john-the-ripper: Blowfish tscp: AI Chess Performance scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo scimark2: Composite botan: X9.19-MAC botan: CAST-256 botan: Twofish botan: AES-256 botan: KASUMI botan: Tiger blake2: Phoronix Test Suite v5.0.0m0 mrbayes: Primate Phylogeny Analysis hmmer: Pfam Database Search rodinia: OpenMP Streamcluster rodinia: OpenMP CFD Solver parboil: OpenMP Stencil parboil: OpenMP CUTCP parboil: OpenMP LBM stream: Add stream: Triad stream: Scale stream: Copy GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 18626.27 23202.65 167321466.12 272.63 305.66 197.36 148.46 91.28 22.18 22.05 9.11 83 47.94 62.92 62.51 814.79 77.08 60.20 3172 697373 619.37 955.80 709.27 96.36 374.78 551.12 57.46 71.72 153.02 3484.21 57.60 330.03 11.11 22.66 22.64 69.60 181.99 57.17 39.12 492.16 6950.99 6964.29 6545.78 10514.31 18582.56 22909.44 126855139.19 275.70 210.30 162.35 90.57 21.89 22.74 10.38 304 84.93 35.91 39.77 778.87 777 602032 989.00 1001.57 730.13 97.23 289.57 621.50 11.15 27.88 27.69 7417.64 7430.89 6322.46 10144.09 18670.42 22865.18 116609505.33 276.20 209.50 153.27 89.67 22.04 22.37 9.87 307 84.82 35.85 39.07 789.57 77.14 779 601419 976.56 998.79 727.54 97.38 365.03 633.06 10.04 28.49 22.83 6896.16 6850.19 6313.74 9438.23 18530.06 22858.95 131700715.89 277.97 22.04 20.18 8.72 314 83.23 36.53 41.42 796.14 77.50 809 573201 987.41 1046.78 730.16 96.36 362.15 644.57 8.87 28.10 22.59 224.98 403.36 6867.13 6856.47 6342.96 9398.01 18519.78 22886.06 106584321.58 277.87 22.17 25.96 8.58 317 83.25 37.21 41.65 784.69 76.57 806 581799 976.41 1035.60 733.40 96.14 355.84 639.48 9.92 28.17 22.76 225.51 400.26 7193.37 7258.64 6615.34 9942.82 OpenBenchmarking.org
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 4K 8K 12K 16K 20K SE +/- 112.81, N = 3 SE +/- 41.22, N = 3 SE +/- 85.07, N = 3 SE +/- 98.38, N = 3 SE +/- 92.71, N = 3 18626.27 18582.56 18670.42 18530.06 18519.78 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 5K 10K 15K 20K 25K SE +/- 115.62, N = 3 SE +/- 135.22, N = 3 SE +/- 133.79, N = 3 SE +/- 190.79, N = 3 SE +/- 149.50, N = 3 23202.65 22909.44 22865.18 22858.95 22886.06 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 40M 80M 120M 160M 200M SE +/- 1544279.35, N = 3 SE +/- 2101750.07, N = 3 SE +/- 394691.08, N = 3 SE +/- 972549.28, N = 3 SE +/- 862664.30, N = 3 167321466.12 126855139.19 116609505.33 131700715.89 106584321.58 1. (CC) gcc options: -O3 -march=native -lm
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1f RSA 4096-bit Performance GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 60 120 180 240 300 SE +/- 0.41, N = 3 SE +/- 2.50, N = 3 SE +/- 2.97, N = 3 SE +/- 3.47, N = 3 SE +/- 3.56, N = 3 272.63 275.70 276.20 277.97 277.87 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
POV-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0 Total Time GCC 4.8.2 70 140 210 280 350 SE +/- 0.32, N = 3 305.66 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Minion Benchmark: Quasigroup OpenBenchmarking.org Seconds, Fewer Is Better Minion 0.15 Benchmark: Quasigroup GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 50 100 150 200 250 SE +/- 1.47, N = 3 SE +/- 1.28, N = 3 SE +/- 1.27, N = 3 197.36 210.30 209.50 1. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams -lz -lbz2
Minion Benchmark: Solitaire OpenBenchmarking.org Seconds, Fewer Is Better Minion 0.15 Benchmark: Solitaire GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 40 80 120 160 200 SE +/- 2.09, N = 3 SE +/- 1.42, N = 3 SE +/- 0.52, N = 3 148.46 162.35 153.27 1. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams -lz -lbz2
Minion Benchmark: Graceful OpenBenchmarking.org Seconds, Fewer Is Better Minion 0.15 Benchmark: Graceful GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 20 40 60 80 100 SE +/- 0.29, N = 3 SE +/- 0.30, N = 3 SE +/- 0.12, N = 3 91.28 90.57 89.67 1. (CXX) g++ options: -O3 -fomit-frame-pointer -rdynamic -lboost_iostreams -lz -lbz2
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 2.1.1 H.264 HD To NTSC DV GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 22.18 21.89 22.04 22.04 22.17 -fno-tree-vectorize -MF -MT -Qunused-arguments -Qunused-arguments -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lXv -lX11 -lXext -lasound -lSDL -lm -pthread -lbz2 -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -MMD
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 6 12 18 24 30 SE +/- 0.07, N = 5 SE +/- 0.07, N = 5 SE +/- 0.01, N = 5 SE +/- 0.03, N = 5 SE +/- 0.08, N = 5 22.05 22.74 22.37 20.18 25.96 1. (CC) gcc options: -pipe -O3 -march=native -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.0 WAV To FLAC GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 3 6 9 12 15 SE +/- 0.13, N = 10 SE +/- 0.03, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 9.11 10.38 9.87 8.72 8.58 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 70 140 210 280 350 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.45, N = 3 83 304 307 314 317 1. (CXX) g++ options: -fopenmp -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 47.94 84.93 84.82 83.23 83.25 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 14 28 42 56 70 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 62.92 35.91 35.85 36.53 37.21 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 62.51 39.77 39.07 41.42 41.65
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 200 400 600 800 1000 SE +/- 5.26, N = 3 SE +/- 9.67, N = 3 SE +/- 1.85, N = 3 SE +/- 3.44, N = 3 SE +/- 0.84, N = 3 814.79 778.87 789.57 796.14 784.69 1. (CC) gcc options: -O3 -march=native
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2014-01-09 H.264 Video Encoding GCC 4.8.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 20 40 60 80 100 SE +/- 0.19, N = 5 SE +/- 0.11, N = 5 SE +/- 0.27, N = 5 SE +/- 0.24, N = 5 77.08 77.14 77.50 76.57 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.2z Phong Rendering With Soft-Shadow Mapping GCC 4.8.2 13 26 39 52 65 SE +/- 0.17, N = 3 60.20 1. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Blowfish GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 700 1400 2100 2800 3500 SE +/- 16.46, N = 3 SE +/- 3.53, N = 3 SE +/- 3.53, N = 3 SE +/- 3.51, N = 3 SE +/- 1.00, N = 3 3172 777 779 809 806 1. (CC) gcc options: -fopenmp -lcrypt
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 150K 300K 450K 600K 750K SE +/- 1294.55, N = 5 SE +/- 3673.05, N = 5 SE +/- 525.20, N = 5 SE +/- 3223.52, N = 5 SE +/- 3269.61, N = 5 697373 602032 601419 573201 581799 1. (CC) gcc options: -O3 -march=native
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 200 400 600 800 1000 SE +/- 0.72, N = 4 SE +/- 1.30, N = 4 SE +/- 7.42, N = 4 SE +/- 2.05, N = 4 SE +/- 2.21, N = 4 619.37 989.00 976.56 987.41 976.41 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 200 400 600 800 1000 SE +/- 1.40, N = 4 SE +/- 0.94, N = 4 SE +/- 3.42, N = 4 SE +/- 1.03, N = 4 SE +/- 1.16, N = 4 955.80 1001.57 998.79 1046.78 1035.60 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 160 320 480 640 800 SE +/- 0.62, N = 4 SE +/- 0.75, N = 4 SE +/- 1.67, N = 4 SE +/- 2.71, N = 4 SE +/- 1.26, N = 4 709.27 730.13 727.54 730.16 733.40 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 20 40 60 80 100 SE +/- 0.18, N = 4 SE +/- 0.31, N = 4 SE +/- 2.12, N = 4 SE +/- 0.25, N = 4 SE +/- 0.28, N = 4 96.36 97.23 97.38 96.36 96.14 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 80 160 240 320 400 SE +/- 0.38, N = 4 SE +/- 12.30, N = 4 SE +/- 9.59, N = 4 SE +/- 0.79, N = 4 SE +/- 1.00, N = 4 374.78 289.57 365.03 362.15 355.84 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 140 280 420 560 700 SE +/- 0.48, N = 4 SE +/- 2.45, N = 4 SE +/- 3.13, N = 4 SE +/- 1.19, N = 4 SE +/- 1.07, N = 4 551.12 621.50 633.06 644.57 639.48 1. (CXX) g++ options: -O3 -march=native
Botan Test: X9.19-MAC OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: X9.19-MAC GCC 4.8.2 13 26 39 52 65 57.46 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: CAST-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: CAST-256 GCC 4.8.2 16 32 48 64 80 71.72 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: Twofish OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Twofish GCC 4.8.2 30 60 90 120 150 153.02 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: AES-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: AES-256 GCC 4.8.2 700 1400 2100 2800 3500 3484.21 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: KASUMI OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: KASUMI GCC 4.8.2 13 26 39 52 65 57.60 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: Tiger OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Tiger GCC 4.8.2 70 140 210 280 350 330.03 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
BLAKE2 Phoronix Test Suite v5.0.0m0 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20130131 Phoronix Test Suite v5.0.0m0 GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 3 6 9 12 15 SE +/- 0.68, N = 6 SE +/- 0.74, N = 6 SE +/- 0.00, N = 3 SE +/- 0.34, N = 6 SE +/- 0.53, N = 6 11.11 11.15 10.04 8.87 9.92 1. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 22.66 27.88 28.49 28.10 28.17
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 SE +/- 0.35, N = 3 SE +/- 0.09, N = 3 SE +/- 0.30, N = 3 22.64 27.69 22.83 22.59 22.76 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster GCC 4.8.2 LLVM Clang 3.4 LLVM Clang 3.5 SVN 50 100 150 200 250 SE +/- 0.11, N = 3 SE +/- 0.54, N = 3 SE +/- 0.66, N = 3 69.60 224.98 225.51 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver GCC 4.8.2 LLVM Clang 3.4 LLVM Clang 3.5 SVN 90 180 270 360 450 SE +/- 0.13, N = 3 SE +/- 1.49, N = 3 SE +/- 0.52, N = 3 181.99 403.36 400.26 1. (CXX) g++ options: -O2 -lOpenCL
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil GCC 4.8.2 13 26 39 52 65 SE +/- 0.06, N = 3 57.17 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP GCC 4.8.2 9 18 27 36 45 SE +/- 0.16, N = 3 39.12 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM GCC 4.8.2 110 220 330 440 550 SE +/- 0.43, N = 3 492.16 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 1600 3200 4800 6400 8000 SE +/- 1.19, N = 10 SE +/- 5.99, N = 10 SE +/- 4.10, N = 10 SE +/- 5.69, N = 10 SE +/- 13.55, N = 9 6950.99 7417.64 6896.16 6867.13 7193.37 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 1600 3200 4800 6400 8000 SE +/- 1.34, N = 9 SE +/- 2.25, N = 7 SE +/- 4.80, N = 10 SE +/- 3.32, N = 9 SE +/- 12.56, N = 8 6964.29 7430.89 6850.19 6856.47 7258.64 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 1400 2800 4200 5600 7000 SE +/- 2.93, N = 8 SE +/- 9.36, N = 9 SE +/- 2.34, N = 10 SE +/- 3.66, N = 10 SE +/- 21.00, N = 9 6545.78 6322.46 6313.74 6342.96 6615.34 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy GCC 4.8.2 LLVM Clang 3.2 LLVM Clang 3.3 LLVM Clang 3.4 LLVM Clang 3.5 SVN 2K 4K 6K 8K 10K SE +/- 3.78, N = 10 SE +/- 2.50, N = 10 SE +/- 5.61, N = 10 SE +/- 6.09, N = 10 SE +/- 2.97, N = 10 10514.31 10144.09 9438.23 9398.01 9942.82 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.5