Linux Kernel Benchmarks AMD Opteron AMD Opteron 2384 testing on Ubuntu 13.10 x86_64 with Linux 3.0, 3.4, 3.10, 3.12 kernels. Benchmarks by Michael Larabel for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1311125-SO-LINUXKERN95&sro&gru .
Linux Kernel Benchmarks AMD Opteron Processor Motherboard Chipset Memory Disk Graphics Audio Monitor OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution OpenGL Linux 3.0.101 Linux 3.4.68 Linux 3.10.18 Linux 3.12.0 AMD Opteron 2384 @ 2.70GHz (4 Cores) TYAN S2927/S2927-E NVIDIA MCP55 4096MB 64GB AGILITY-EX AMD Radeon HD 4870 512MB ATI R6xx HDMI Acer P243W Ubuntu 13.10 3.0.101-0300101-generic (x86_64) Xfce 4.10 X Server 1.14.3 radeon 7.2.0 GCC 4.8 ext4 1920x1200 3.4.68-030468-generic (x86_64) 2.1 Mesa 9.2.1 Gallium 0.4 3.10.18-031018-generic (x86_64) 3.1 Mesa 9.2.1 Gallium 0.4 3.12.0-031200-generic (x86_64) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Disk Details - Linux 3.0.101: CFQ / barrier=1,data=ordered,errors=remount-ro,relatime,rw,user_xattr - Linux 3.4.68: CFQ / data=ordered,errors=remount-ro,relatime,rw - Linux 3.10.18: DEADLINE / data=ordered,errors=remount-ro,relatime,rw - Linux 3.12.0: DEADLINE / data=ordered,errors=remount-ro,relatime,rw Processor Details - Linux 3.0.101: Scaling Governor: powernow-k8 ondemand - Linux 3.4.68: Scaling Governor: powernow-k8 ondemand - Linux 3.10.18: Scaling Governor: acpi-cpufreq ondemand - Linux 3.12.0: Scaling Governor: acpi-cpufreq ondemand Graphics Details - EXA System Details - Linux 3.0.101: Disk Scheduler: CFQ. - Linux 3.4.68: Disk Scheduler: CFQ. - Linux 3.10.18: Disk Scheduler: DEADLINE. - Linux 3.12.0: Disk Scheduler: DEADLINE.
Linux Kernel Benchmarks AMD Opteron fs-mark: 1000 Files, 1MB Size fs-mark: 5000 Files, 1MB Size, 4 Threads fs-mark: 4000 Files, 32 Sub Dirs, 1MB Size urbanterror: 1920 x 1200 openarena: 1920 x 1200 openarena: 1920 x 1200 padman: 1920 x 1200 xonotic: 1920 x 1200 - Low xonotic: 1920 x 1200 - High hpcc: G-Ptrans hpcc: EP-STREAM Triad hpcc: Rand Ring Bandwidth hpcc: G-HPL hpcc: G-Ffte hpcc: EP-DGEMM hpcc: G-Rand Access hpcc: Max Ping Pong Bandwidth himeno: Poisson Pressure Solver postmark: Disk Transaction Performance parboil: Seven-Point Stencil parboil: Cutoff Pair Potential parboil: Lid-Driven Cavity Fluid Dynamics rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster hmmer: Pfam Database Search mafft: Multiple Sequence Alignment build-linux-kernel: Time To Compile c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples hpcc: Rand Ring Latency Linux 3.0.101 Linux 3.4.68 Linux 3.10.18 Linux 3.12.0 21.27 39.82 21.45 0.61906 2.36854 0.60114 12.25250 2.74437 5.58376 0.03677 2191.870 538.48 1497 93.40 44.76 544.28 94.47 237.45 68.06 23.35 13.15 168.60 54.41 250 0.71676 20.93 44.40 22.68 21.55 122.80 58.47 127.03 62.58 19.47 0.62281 1.80643 0.59934 12.42107 2.75175 5.86491 0.03699 2186.929 510.65 1386 92.57 43.26 544.36 92.75 234.19 67.02 23.04 13.32 166.95 54.01 250 0.73436 20.92 45.73 21.25 61.93 179.80 69.33 154.97 69.95 51.00 0.61807 1.88706 0.59729 12.42990 2.74960 6.14844 0.03676 2181.116 504.28 1258 93.06 44.27 543.28 93.74 236.30 67.68 22.84 12.87 170.68 54.02 251 0.72616 21.17 45.07 22.57 64.20 201.60 71 174.30 89.44 60.75 0.62006 1.80232 0.60093 12.45260 2.76571 6.20508 0.03680 2185.325 540.63 1518 92.87 57.68 543.10 92.11 234.35 67.02 23.52 12.30 167.90 54.04 250 0.72060 OpenBenchmarking.org
FS-Mark Test: 1000 Files, 1MB Size OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 1000 Files, 1MB Size Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 5 10 15 20 25 SE +/- 0.13, N = 3 SE +/- 0.40, N = 6 SE +/- 0.54, N = 6 SE +/- 0.27, N = 3 21.27 20.92 21.17 20.93 1. (CC) gcc options: -static
FS-Mark Test: 5000 Files, 1MB Size, 4 Threads OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 5000 Files, 1MB Size, 4 Threads Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 10 20 30 40 50 SE +/- 2.18, N = 6 SE +/- 1.52, N = 6 SE +/- 1.72, N = 6 SE +/- 2.42, N = 6 39.82 45.73 45.07 44.40 1. (CC) gcc options: -static
FS-Mark Test: 4000 Files, 32 Sub Dirs, 1MB Size OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 4000 Files, 32 Sub Dirs, 1MB Size Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 5 10 15 20 25 SE +/- 0.39, N = 6 SE +/- 0.49, N = 6 SE +/- 0.35, N = 3 SE +/- 0.37, N = 4 21.45 21.25 22.57 22.68 1. (CC) gcc options: -static
Urban Terror Resolution: 1920 x 1200 OpenBenchmarking.org Frames Per Second, More Is Better Urban Terror 4.2.013 Resolution: 1920 x 1200 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 14 28 42 56 70 SE +/- 0.22, N = 3 SE +/- 0.21, N = 3 SE +/- 7.69, N = 6 61.93 64.20 21.55
OpenArena Resolution: 1920 x 1200 OpenBenchmarking.org Frames Per Second, More Is Better OpenArena 0.8.5 Resolution: 1920 x 1200 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 40 80 120 160 200 SE +/- 0.42, N = 3 SE +/- 1.07, N = 3 SE +/- 0.70, N = 3 179.80 201.60 122.80
OpenArena Resolution: 1920 x 1200 OpenBenchmarking.org Frames Per Second, More Is Better OpenArena 0.8.8 Resolution: 1920 x 1200 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 16 32 48 64 80 SE +/- 0.41, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 69.33 71.00 58.47 MIN: 8 MIN: 8 MIN: 8
World of Padman Resolution: 1920 x 1200 OpenBenchmarking.org Frames Per Second, More Is Better World of Padman 1.2 Resolution: 1920 x 1200 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 40 80 120 160 200 SE +/- 0.28, N = 3 SE +/- 0.31, N = 3 SE +/- 0.52, N = 3 154.97 174.30 127.03
Xonotic Resolution: 1920 x 1200 - Effects Quality: Low OpenBenchmarking.org Frames Per Second, More Is Better Xonotic 0.7 Resolution: 1920 x 1200 - Effects Quality: Low Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.11, N = 3 SE +/- 0.22, N = 3 69.95 89.44 62.58 MIN: 55 / MAX: 142 MIN: 66 / MAX: 259 MIN: 41 / MAX: 105
Xonotic Resolution: 1920 x 1200 - Effects Quality: High OpenBenchmarking.org Frames Per Second, More Is Better Xonotic 0.7 Resolution: 1920 x 1200 - Effects Quality: High Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 14 28 42 56 70 SE +/- 0.09, N = 3 SE +/- 0.26, N = 3 SE +/- 0.31, N = 6 51.00 60.75 19.47 MIN: 25 / MAX: 85 MIN: 23 / MAX: 127 MIN: 6 / MAX: 89
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.4.3 Test / Class: G-Ptrans Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.1401 0.2802 0.4203 0.5604 0.7005 SE +/- 0.00903, N = 3 SE +/- 0.00604, N = 3 SE +/- 0.00443, N = 3 SE +/- 0.00528, N = 3 0.61906 0.61807 0.62006 0.62281 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.4.3 Test / Class: EP-STREAM Triad Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.5329 1.0658 1.5987 2.1316 2.6645 SE +/- 0.05860, N = 3 SE +/- 0.09207, N = 3 SE +/- 0.00266, N = 3 SE +/- 0.00529, N = 3 2.36854 1.88706 1.80232 1.80643 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.4.3 Test / Class: Random Ring Bandwidth Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.1353 0.2706 0.4059 0.5412 0.6765 SE +/- 0.00133, N = 3 SE +/- 0.00093, N = 3 SE +/- 0.00020, N = 3 SE +/- 0.00042, N = 3 0.60114 0.59729 0.60093 0.59934 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: G-HPL Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 12.25 12.43 12.45 12.42 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: G-Ffte Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.6223 1.2446 1.8669 2.4892 3.1115 SE +/- 0.00258, N = 3 SE +/- 0.00907, N = 3 SE +/- 0.00405, N = 3 SE +/- 0.03836, N = 3 2.74437 2.74960 2.76571 2.75175 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.4.3 Test / Class: EP-DGEMM Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 2 4 6 8 10 SE +/- 0.96527, N = 3 SE +/- 0.42035, N = 3 SE +/- 0.42679, N = 3 SE +/- 0.72053, N = 3 5.58376 6.14844 6.20508 5.86491 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.4.3 Test / Class: G-Random Access Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.0083 0.0166 0.0249 0.0332 0.0415 SE +/- 0.00016, N = 3 SE +/- 0.00020, N = 3 SE +/- 0.00041, N = 3 SE +/- 0.00075, N = 3 0.03677 0.03676 0.03680 0.03699 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.4.3 Test / Class: Max Ping Pong Bandwidth Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 500 1000 1500 2000 2500 SE +/- 10.83, N = 3 SE +/- 9.63, N = 3 SE +/- 6.00, N = 3 SE +/- 5.01, N = 3 2191.87 2181.12 2185.33 2186.93 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 120 240 360 480 600 SE +/- 1.54, N = 3 SE +/- 1.94, N = 3 SE +/- 3.33, N = 3 SE +/- 2.36, N = 3 538.48 504.28 540.63 510.65 1. (CC) gcc options: -O3
PostMark Disk Transaction Performance OpenBenchmarking.org TPS, More Is Better PostMark 1.51 Disk Transaction Performance Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 300 600 900 1200 1500 SE +/- 15.59, N = 3 SE +/- 13.04, N = 3 SE +/- 7.94, N = 3 SE +/- 2.33, N = 3 1497 1258 1518 1386 1. (CC) gcc options: -O3
Urban Terror Resolution: 1920 x 1200 - Total Frame Time OpenBenchmarking.org Milliseconds, Fewer Is Better Urban Terror 4.2.013 Resolution: 1920 x 1200 - Total Frame Time Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 20 40 60 80 100 Min: 2 / Avg: 16.06 / Max: 42 Min: 2 / Avg: 15.46 / Max: 40 Min: 2 / Avg: 18.69 / Max: 99
OpenArena Resolution: 1920 x 1200 - Total Frame Time OpenBenchmarking.org Milliseconds, Fewer Is Better OpenArena 0.8.8 Resolution: 1920 x 1200 - Total Frame Time Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 10 20 30 40 50 Min: 8 / Avg: 14.37 / Max: 45 Min: 8 / Avg: 14.05 / Max: 37 Min: 8 / Avg: 17.07 / Max: 47
Parboil Test: Seven-Point Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: Seven-Point Stencil Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 93.40 93.06 92.87 92.57 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: Cutoff Pair Potential OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: Cutoff Pair Potential Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 13 26 39 52 65 SE +/- 0.68, N = 5 SE +/- 0.03, N = 3 SE +/- 0.41, N = 3 SE +/- 0.03, N = 3 44.76 44.27 57.68 43.26 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: Lid-Driven Cavity Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: Lid-Driven Cavity Fluid Dynamics Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 120 240 360 480 600 SE +/- 0.21, N = 3 SE +/- 0.27, N = 3 SE +/- 0.18, N = 3 SE +/- 1.95, N = 3 544.28 543.28 543.10 544.36 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Leukocyte Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.23, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 94.47 93.74 92.11 92.75 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 50 100 150 200 250 SE +/- 0.72, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 237.45 236.30 234.35 234.19 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP Streamcluster Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 15 30 45 60 75 SE +/- 0.80, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 68.06 67.68 67.02 67.02 1. (CXX) g++ options: -O2 -lOpenCL
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 6 12 18 24 30 SE +/- 0.28, N = 3 SE +/- 0.04, N = 3 SE +/- 0.46, N = 6 SE +/- 0.19, N = 3 23.35 22.84 23.52 23.04 1. (CC) gcc options: -O2 -pthread -lhmmer -lsquid -lm
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 3 6 9 12 15 SE +/- 0.20, N = 6 SE +/- 0.30, N = 6 SE +/- 0.29, N = 6 SE +/- 0.03, N = 3 13.15 12.87 12.30 13.32 1. (CC) gcc options: -O3 -lm -lpthread
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 3.1 Time To Compile Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 40 80 120 160 200 SE +/- 1.74, N = 3 SE +/- 1.19, N = 3 SE +/- 1.11, N = 3 SE +/- 1.21, N = 3 168.60 170.68 167.90 166.95
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 12 24 36 48 60 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 54.41 54.02 54.04 54.01 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 50 100 150 200 250 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 250 251 250 250 1. (CXX) g++ options: -fopenmp
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.4.3 Test / Class: Random Ring Latency Linux 3.0.101 Linux 3.10.18 Linux 3.12.0 Linux 3.4.68 0.1652 0.3304 0.4956 0.6608 0.826 SE +/- 0.00618, N = 3 SE +/- 0.00902, N = 3 SE +/- 0.00726, N = 3 SE +/- 0.00639, N = 3 0.71676 0.72616 0.72060 0.73436 1. (CC) gcc options: -fomit-frame-pointer -O3 -march=native -funroll-loops -lblas -lm -pthread -lmpi -lopen-rte -lopen-pal -ldl -lnsl -lutil 2. BLAS + Open MPI 1.4.5
Phoronix Test Suite v10.8.5