AMD EPYC 7642 compiler testing by Michael Larabel for a future article.
GCC 9.2.0 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 9.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
GCC 10.0.0 20191117 OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 10.0.0 20191117, File-System: ext4, Screen Resolution: 1024x768
LLVM Clang 10 Git OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 20 40 60 80 100 SE +/- 1.63, N = 3 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 103.99 102.20 94.94 -mabm -mabm 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 2 4 6 8 10 SE +/- 0.052, N = 3 SE +/- 0.035, N = 3 SE +/- 0.085, N = 3 7.560 8.087 5.871 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 18.37 18.28 16.26 1. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 13 26 39 52 65 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 SE +/- 0.15, N = 3 55.72 56.38 50.76 1. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 10-bit GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 16.72 16.65 16.91 1. (CXX) g++ options: -O3 -march=native -lpthread
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 600 1200 1800 2400 3000 SE +/- 3.13, N = 3 SE +/- 9.84, N = 3 SE +/- 4.31, N = 3 2774.05 2754.17 2800.49 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 130 260 390 520 650 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 594.61 594.76 602.82 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 50 100 150 200 250 SE +/- 0.17, N = 3 SE +/- 0.29, N = 3 SE +/- 0.42, N = 3 196.08 197.88 218.85 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 700 1400 2100 2800 3500 SE +/- 9.11, N = 3 SE +/- 7.23, N = 3 SE +/- 3.59, N = 3 2755.87 2749.81 3341.43 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 2K 4K 6K 8K 10K SE +/- 6.90, N = 3 SE +/- 44.47, N = 3 SE +/- 22.64, N = 3 8575.57 8484.97 8231.47 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 400 800 1200 1600 2000 SE +/- 0.22, N = 3 SE +/- 0.26, N = 3 SE +/- 0.06, N = 3 1748.11 1743.45 1607.88 1. (CC) gcc options: -O3 -march=native -lm
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 200K 400K 600K 800K 1000K SE +/- 725.51, N = 5 SE +/- 586.45, N = 5 SE +/- 467.20, N = 5 1033744 1010602 1133755 1. (CC) gcc options: -O3 -march=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 400 800 1200 1600 2000 SE +/- 3.18, N = 3 SE +/- 1.53, N = 3 1675 1624 45 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 110 220 330 440 550 SE +/- 2.96, N = 3 SE +/- 0.67, N = 3 505 481 516 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 120 240 360 480 600 573 572 13 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 865 786 21 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 500 1000 1500 2000 2500 SE +/- 27.63, N = 5 SE +/- 30.90, N = 3 2138 1873 113 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 140 280 420 560 700 SE +/- 1.86, N = 3 651 634 25 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 300 600 900 1200 1500 SE +/- 4.04, N = 3 SE +/- 1.53, N = 3 1170 1070 204 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p GCC 9.2.0 LLVM Clang 10 Git 120 240 360 480 600 SE +/- 0.79, N = 3 SE +/- 2.02, N = 3 546.36 575.08 MIN: 345.34 / MAX: 673.27 MIN: 353.68 / MAX: 718.09 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 4K GCC 9.2.0 LLVM Clang 10 Git 60 120 180 240 300 SE +/- 0.54, N = 3 SE +/- 0.20, N = 3 273.96 279.11 MIN: 129.55 / MAX: 296.84 MIN: 127.8 / MAX: 303.17 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 1080p GCC 9.2.0 LLVM Clang 10 Git 130 260 390 520 650 SE +/- 1.75, N = 3 SE +/- 1.70, N = 3 588.19 595.62 MIN: 266.11 / MAX: 655.78 MIN: 260.63 / MAX: 664.58 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 10-bit GCC 9.2.0 LLVM Clang 10 Git 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.06, N = 3 102.19 75.81 MIN: 67.61 / MAX: 169.03 MIN: 50.99 / MAX: 121.26 1. (CC) gcc options: -O3 -march=native -pthread
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 0 - Input: 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 0.016 0.032 0.048 0.064 0.08 SE +/- 0.000, N = 6 SE +/- 0.000, N = 9 SE +/- 0.000, N = 9 0.065 0.067 0.071 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 4 - Input: 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 3 6 9 12 15 SE +/- 0.061, N = 3 SE +/- 0.022, N = 3 SE +/- 0.072, N = 3 9.810 9.942 10.400 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 8 - Input: 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 79.73 81.10 87.14 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: VMAF Optimized - Input: Bosphorus 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 80 160 240 320 400 SE +/- 1.11, N = 3 SE +/- 1.14, N = 3 SE +/- 1.42, N = 3 370.91 372.91 379.36 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 80 160 240 320 400 SE +/- 5.05, N = 3 SE +/- 1.93, N = 3 SE +/- 2.39, N = 3 375.84 377.46 382.36 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 70 140 210 280 350 SE +/- 1.87, N = 3 SE +/- 2.79, N = 3 SE +/- 4.64, N = 3 293.33 288.98 298.55 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.1 vpxenc VP9 1080p Video Encode GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.59, N = 3 SE +/- 0.22, N = 3 155.85 155.17 160.44 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 30 60 90 120 150 SE +/- 0.96, N = 3 SE +/- 1.50, N = 3 SE +/- 0.22, N = 3 154.14 152.23 154.61 -mstack-alignment=64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.1.2 H.265 1080p Video Encoding GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 11 22 33 44 55 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 48.76 48.36 48.92 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 400K 800K 1200K 1600K 2000K SE +/- 1399.84, N = 3 SE +/- 2856.66, N = 3 SE +/- 6972.36, N = 3 1589318.73 1664749.69 1283555.69 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 13.54 13.73 21.84 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 9.2.0 GCC 10.0.0 20191117 1.1531 2.3062 3.4593 4.6124 5.7655 SE +/- 0.031, N = 3 SE +/- 0.281, N = 15 2.755 5.125 1. (CXX) g++ options: -fopenmp -O3 -march=native
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 37.18 35.98 41.36 1. (CC) gcc options: -lm -O3 -march=native
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 22.36 22.20 22.41 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 2 4 6 8 10 SE +/- 0.020, N = 3 SE +/- 0.006, N = 3 SE +/- 0.011, N = 3 8.546 8.580 8.667 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 9.2.0 LLVM Clang 10 Git 3 6 9 12 15 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 9.019 10.404 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr 1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 2K 4K 6K 8K 10K SE +/- 2.79, N = 3 SE +/- 6.05, N = 3 SE +/- 3.88, N = 3 10301.1 10290.6 7550.7 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Degridding GCC 9.2.0 GCC 10.0.0 20191117 1300 2600 3900 5200 6500 SE +/- 3.14, N = 3 SE +/- 1.85, N = 3 5892.25 5962.81 1. (CXX) g++ options: -lpthread
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Gridding GCC 9.2.0 GCC 10.0.0 20191117 1500 3000 4500 6000 7500 SE +/- 54.12, N = 3 SE +/- 59.89, N = 3 6602.28 6946.85 1. (CXX) g++ options: -lpthread
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Degridding GCC 9.2.0 GCC 10.0.0 20191117 1400 2800 4200 5600 7000 SE +/- 33.47, N = 3 SE +/- 0.00, N = 3 5187.24 6656.40 1. (CXX) g++ options: -lpthread
GROMACS The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2019.4 Water Benchmark GCC 9.2.0 GCC 10.0.0 20191117 0.9189 1.8378 2.7567 3.6756 4.5945 SE +/- 0.001, N = 3 SE +/- 0.006, N = 3 4.084 4.075 1. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 130K 260K 390K 520K 650K SE +/- 986.51, N = 3 SE +/- 467.85, N = 3 SE +/- 1073.83, N = 3 589144.11 593115.57 596205.49 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 10K 20K 30K 40K 50K SE +/- 43.17, N = 3 SE +/- 19.13, N = 3 SE +/- 21.66, N = 3 48273.12 48304.47 48421.31 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.77, N = 15 SE +/- 0.01, N = 3 40.55 42.10 38.66 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 80 160 240 320 400 SE +/- 0.78, N = 3 SE +/- 0.22, N = 3 SE +/- 0.16, N = 3 354.48 346.36 341.74 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 400 800 1200 1600 2000 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 1417.04 1448.15 1706.42 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 102.22 100.82 88.54 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 19.70 18.39 19.47 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.39, N = 6 37.75 37.59 34.84 1. (CXX) g++ options: -O3 -march=native -std=c++11
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPOP GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 500K 1000K 1500K 2000K 2500K SE +/- 38584.75, N = 15 SE +/- 13200.64, N = 3 SE +/- 34458.33, N = 15 2293158.05 2237292.80 2155677.58 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SADD GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 400K 800K 1200K 1600K 2000K SE +/- 28492.65, N = 15 SE +/- 24277.91, N = 15 SE +/- 33951.29, N = 15 1703114.95 1694281.30 1645122.77 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPUSH GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 300K 600K 900K 1200K 1500K SE +/- 11571.16, N = 3 SE +/- 18234.45, N = 15 SE +/- 18364.34, N = 15 1287761.00 1387280.44 1372099.01 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: GET GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 500K 1000K 1500K 2000K 2500K SE +/- 32908.04, N = 4 SE +/- 38824.38, N = 15 SE +/- 22857.27, N = 3 2216263.44 2029652.62 2158757.83 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SET GCC 9.2.0 GCC 10.0.0 20191117 LLVM Clang 10 Git 300K 600K 900K 1200K 1500K SE +/- 22520.49, N = 15 SE +/- 24405.72, N = 12 SE +/- 25369.83, N = 3 1465273.92 1494027.97 1494870.67 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
GCC 9.2.0 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 9.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 18 November 2019 18:10 by user phoronix.
GCC 10.0.0 20191117 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 10.0.0 20191117, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 19 November 2019 07:26 by user phoronix.
LLVM Clang 10 Git Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 19 November 2019 16:13 by user phoronix.