Amazon EC2 Cloud Linux Performance Comparison New Amazon c3.large vs. c3.xlarge vs. c3.2xlarge vs. c3.4xlarge vs. c3.8xlarge instance type with Ivy Bridge CPUs being benchmarked at Phoronix by Michael Larabel for a future article. Compared to various systems locally.
HTML result view exported from: https://openbenchmarking.org/result/1311230-SO-1311232SO55&sro&grs .
Amazon EC2 Cloud Linux Performance Comparison Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution System Layer AMD Opteron 2384 AMD FX-8150 AMD FX-8350 Intel Core i7 4960X c3.large c3.xlarge c3.2xlarge c3.4xlarge c3.8xlarge E5-2680v2-256GB c3.4xlarge.mattb AMD Opteron 2384 @ 2.70GHz (4 Cores) TYAN S2927/S2927-E NVIDIA MCP55 4096MB 64GB AGILITY-EX AMD Radeon HD 4870 512MB ATI R6xx HDMI Acer P243W Ubuntu 13.10 3.11.0-12-generic (x86_64) Unity 7.1.2 X Server 1.14.3 radeon 7.2.0 3.1 Mesa 9.2.1 Gallium 0.4 GCC 4.8 ext4 1920x1200 AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores) MSI 760GM-P34(FX) (MS-7641) v4.0 AMD RS780 + SB7x0/SB8x0/SB9x0 500GB Western Digital WD5000AZRX-0 Gigabyte NVIDIA GeForce 8600 GTS 256MB (675/1008MHz) Realtek ALC887-VD Realtek RTL8111/8168/8411 3.11.0-13-generic (x86_64) NVIDIA 304.88 3.3.0 NVIDIA 304.88 GCC 4.8 + Clang 3.2-7ubuntu1 1920x1080 AMD FX-8350 Eight-Core @ 4.00GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 8192MB 64GB OCZ AGILITY Sapphire AMD Radeon HD 5000/6000/7350/8350 512MB Realtek ALC889 VA2431 Intel 82583V Gigabit Connection 3.11.0-12-generic (x86_64) radeon 7.2.0 3.1 Mesa 9.2.1 Gallium 0.4 GCC 4.8 Intel Core i7-4960X @ 3.60GHz (12 Cores) MSI X79MA-GD45 (MS-7738) v1.0 Intel Xeon E5 v2/Core 320GB Seagate ST3320620AS Sapphire AMD Radeon HD 4870 512MB Realtek ALC892 Acer P243W Realtek RTL8111/8168/8411 3.11.0-11-generic (x86_64) Unity 7.1.1 3.1 Mesa 9.3.0-devel (git-e166a58) Gallium 0.4 GCC 4.8 + LLVM 3.2 1920x1200 Intel Xeon E5-2680 v2 @ 2.79GHz (2 Cores) 4096MB 8GB 3.11.0-12-generic (x86_64) Xen 4.2.amazon Hypervisor Intel Xeon E5-2680 v2 @ 2.79GHz (4 Cores) 8192MB GCC 4.8 Intel Xeon E5-2680 v2 @ 2.79GHz (8 Cores) 15360MB Intel Xeon E5-2680 v2 @ 2.80GHz (16 Cores) 30720MB Intel Xeon E5-2680 v2 @ 2.80GHz (32 Cores) 60416MB 2 x Intel Xeon E5-2680 v2 @ 2.80GHz (40 Cores) Supermicro X9DRW v0123456789 Intel Ivytown DMI2 258048MB 256GB Samsung SSD 840 Matrox s G200eR2 Intel I350 Gigabit Connection Ubuntu 12.04 3.2.0-23-generic (x86_64) GCC 4.6 Intel Xeon E5-2680 v2 @ 2.79GHz (16 Cores) 30720MB 8GB 3.2.0-54-virtual (x86_64) Xen 4.2.amazon Hypervisor OpenBenchmarking.org Compiler Details - AMD Opteron 2384: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - AMD FX-8150: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - AMD FX-8350: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Intel Core i7 4960X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c3.xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c3.2xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c3.4xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - c3.8xlarge: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - E5-2680v2-256GB: --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v - c3.4xlarge.mattb: --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v Processor Details - AMD Opteron 2384, AMD FX-8150, AMD FX-8350, Intel Core i7 4960X, E5-2680v2-256GB: Scaling Governor: acpi-cpufreq ondemand OpenCL Details - AMD FX-8150: GPU Compute Cores: 32 System Details - AMD FX-8150: GPU Compute Cores: 32.
Amazon EC2 Cloud Linux Performance Comparison compress-pbzip2: 256MB File Compression rodinia: OpenMP Streamcluster john-the-ripper: Traditional DES parboil: OpenMP MRI Gridding john-the-ripper: Blowfish build-linux-kernel: Time To Compile rodinia: OpenMP CFD Solver c-ray: Total Time parboil: OpenMP CUTCP parboil: OpenMP LBM parboil: OpenMP Stencl AMD Opteron 2384 AMD FX-8150 AMD FX-8350 Intel Core i7 4960X c3.large c3.xlarge c3.2xlarge c3.4xlarge c3.8xlarge E5-2680v2-256GB c3.4xlarge.mattb 16.30 67.05 9546000 79.08 2966 167.40 234.54 54.13 43.09 545.82 94.04 7.49 40.60 17224333 321.33 6415 100.34 89.84 29.97 20.47 276.37 41.94 6.57 33.86 19168667 256.29 7120 83.61 82.95 25.86 17.40 234.82 39.13 5.21 32.65 14891667 51.92 8922 62.28 66.23 21.64 10.36 239.15 35.68 34.96 100.87 2095333 14.34 1247 487.69 403.02 141.81 69.03 938.53 123.49 17.54 52.65 4190667 28.29 2512 244.67 205.14 70.87 35.34 483.24 67.67 8.97 28.03 8354000 44.74 5023 125.87 104.42 35.44 18.58 255.00 34.53 4.66 18.25 16777000 70.87 10013 69.97 56.44 17.73 9.57 133.71 19.01 13.49 33128000 144.11 19968 40.38 28.61 8.91 5.37 68.80 11.96 18.53 80709667 205.77 24267 26.51 23.38 11.19 4.56 63.60 11.20 4.99 18.41 34842667 80.74 10047 64.59 56.90 27.67 9.97 134.35 19.44 OpenBenchmarking.org
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.6 256MB File Compression AMD FX-8150 AMD FX-8350 AMD Opteron 2384 Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.large c3.xlarge 8 16 24 32 40 SE +/- 0.14, N = 6 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 SE +/- 0.17, N = 6 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.29, N = 6 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 7.49 6.57 16.30 5.21 8.97 4.66 4.99 34.96 17.54 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 40.60 33.86 67.05 18.53 32.65 28.03 18.25 18.41 13.49 100.87 52.65 1. (CXX) g++ options: -O2 -lOpenCL
John The Ripper Test: Traditional DES OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Traditional DES AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 20M 40M 60M 80M 100M SE +/- 10666.67, N = 3 SE +/- 50190.75, N = 3 SE +/- 46758.24, N = 3 SE +/- 747853.45, N = 3 SE +/- 49666.67, N = 3 SE +/- 17000.00, N = 3 SE +/- 0.00, N = 3 SE +/- 231062.28, N = 3 SE +/- 0.00, N = 3 SE +/- 6119.19, N = 3 SE +/- 19333.33, N = 3 17224333 19168667 9546000 80709667 14891667 8354000 16777000 34842667 33128000 2095333 4190667 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 70 140 210 280 350 SE +/- 1.26, N = 3 SE +/- 0.95, N = 3 SE +/- 0.35, N = 3 SE +/- 3.05, N = 6 SE +/- 0.32, N = 3 SE +/- 1.87, N = 6 SE +/- 0.91, N = 3 SE +/- 0.09, N = 3 SE +/- 1.67, N = 3 SE +/- 0.82, N = 6 SE +/- 0.71, N = 6 321.33 256.29 79.08 205.77 51.92 44.74 70.87 80.74 144.11 14.34 28.29 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Blowfish AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 5K 10K 15K 20K 25K SE +/- 0.00, N = 3 SE +/- 16.00, N = 3 SE +/- 10.00, N = 3 SE +/- 53.33, N = 3 SE +/- 6.00, N = 3 SE +/- 16.67, N = 3 SE +/- 33.33, N = 3 SE +/- 33.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 8.33, N = 3 6415 7120 2966 24267 8922 5023 10013 10047 19968 1247 2512 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 3.1 Time To Compile AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 110 220 330 440 550 SE +/- 0.99, N = 3 SE +/- 0.84, N = 3 SE +/- 0.99, N = 3 SE +/- 0.46, N = 4 SE +/- 0.58, N = 3 SE +/- 0.96, N = 3 SE +/- 0.54, N = 3 SE +/- 1.54, N = 6 SE +/- 0.57, N = 5 SE +/- 1.51, N = 3 SE +/- 1.26, N = 3 100.34 83.61 167.40 26.51 62.28 125.87 69.97 64.59 40.38 487.69 244.67
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 90 180 270 360 450 SE +/- 0.21, N = 3 SE +/- 0.56, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.20, N = 3 SE +/- 0.24, N = 3 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 SE +/- 0.48, N = 3 SE +/- 0.33, N = 3 89.84 82.95 234.54 23.38 66.23 104.42 56.44 56.90 28.61 403.02 205.14 1. (CXX) g++ options: -O2 -lOpenCL
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 29.97 25.86 54.13 11.19 21.64 35.44 17.73 27.67 8.91 141.81 70.87 -march=native 1. (CC) gcc options: -lm -lpthread -O3
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 15 30 45 60 75 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 20.47 17.40 43.09 4.56 10.36 18.58 9.57 9.97 5.37 69.03 35.34 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 200 400 600 800 1000 SE +/- 4.27, N = 3 SE +/- 2.32, N = 3 SE +/- 1.90, N = 3 SE +/- 1.14, N = 3 SE +/- 0.32, N = 3 SE +/- 1.87, N = 3 SE +/- 0.37, N = 3 SE +/- 0.14, N = 3 SE +/- 0.50, N = 3 SE +/- 0.54, N = 3 SE +/- 3.08, N = 3 276.37 234.82 545.82 63.60 239.15 255.00 133.71 134.35 68.80 938.53 483.24 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP Stencl OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencl AMD FX-8150 AMD FX-8350 AMD Opteron 2384 E5-2680v2-256GB Intel Core i7 4960X c3.2xlarge c3.4xlarge c3.4xlarge.mattb c3.8xlarge c3.large c3.xlarge 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.29, N = 3 SE +/- 0.20, N = 3 SE +/- 0.05, N = 3 SE +/- 0.59, N = 3 SE +/- 0.54, N = 5 SE +/- 0.43, N = 6 SE +/- 0.50, N = 6 SE +/- 0.33, N = 6 SE +/- 0.42, N = 3 SE +/- 1.08, N = 3 41.94 39.13 94.04 11.20 35.68 34.53 19.01 19.44 11.96 123.49 67.67 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Phoronix Test Suite v10.8.4