AMD EPYC 7642 compiler testing by Michael Larabel for a future article.
GCC 9.2.0 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 9.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
GCC 10.0.0 20191117 OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 10.0.0 20191117, File-System: ext4, Screen Resolution: 1024x768
LLVM Clang 10 Git OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 1.63, N = 3 94.94 102.20 103.99 -mabm -mabm 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm
Timed HMMer Search This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 2 4 6 8 10 SE +/- 0.085, N = 3 SE +/- 0.035, N = 3 SE +/- 0.052, N = 3 5.871 8.087 7.560 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 16.26 18.28 18.37 1. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 13 26 39 52 65 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 50.76 56.38 55.72 1. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 10-bit LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 16.91 16.65 16.72 1. (CXX) g++ options: -O3 -march=native -lpthread
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 600 1200 1800 2400 3000 SE +/- 4.31, N = 3 SE +/- 9.84, N = 3 SE +/- 3.13, N = 3 2800.49 2754.17 2774.05 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 130 260 390 520 650 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 602.82 594.76 594.61 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 50 100 150 200 250 SE +/- 0.42, N = 3 SE +/- 0.29, N = 3 SE +/- 0.17, N = 3 218.85 197.88 196.08 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 700 1400 2100 2800 3500 SE +/- 3.59, N = 3 SE +/- 7.23, N = 3 SE +/- 9.11, N = 3 3341.43 2749.81 2755.87 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 2K 4K 6K 8K 10K SE +/- 22.64, N = 3 SE +/- 44.47, N = 3 SE +/- 6.90, N = 3 8231.47 8484.97 8575.57 1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 400 800 1200 1600 2000 SE +/- 0.06, N = 3 SE +/- 0.26, N = 3 SE +/- 0.22, N = 3 1607.88 1743.45 1748.11 1. (CC) gcc options: -O3 -march=native -lm
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 200K 400K 600K 800K 1000K SE +/- 467.20, N = 5 SE +/- 586.45, N = 5 SE +/- 725.51, N = 5 1133755 1010602 1033744 1. (CC) gcc options: -O3 -march=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 400 800 1200 1600 2000 SE +/- 1.53, N = 3 SE +/- 3.18, N = 3 45 1624 1675 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 110 220 330 440 550 SE +/- 0.67, N = 3 SE +/- 2.96, N = 3 516 481 505 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 120 240 360 480 600 13 572 573 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 21 786 865 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 500 1000 1500 2000 2500 SE +/- 30.90, N = 3 SE +/- 27.63, N = 5 113 1873 2138 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 140 280 420 560 700 SE +/- 1.86, N = 3 25 634 651 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 300 600 900 1200 1500 SE +/- 1.53, N = 3 SE +/- 4.04, N = 3 204 1070 1170 -fopenmp -fopenmp 1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p LLVM Clang 10 Git GCC 9.2.0 120 240 360 480 600 SE +/- 2.02, N = 3 SE +/- 0.79, N = 3 575.08 546.36 MIN: 353.68 / MAX: 718.09 MIN: 345.34 / MAX: 673.27 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 4K LLVM Clang 10 Git GCC 9.2.0 60 120 180 240 300 SE +/- 0.20, N = 3 SE +/- 0.54, N = 3 279.11 273.96 MIN: 127.8 / MAX: 303.17 MIN: 129.55 / MAX: 296.84 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 1080p LLVM Clang 10 Git GCC 9.2.0 130 260 390 520 650 SE +/- 1.70, N = 3 SE +/- 1.75, N = 3 595.62 588.19 MIN: 260.63 / MAX: 664.58 MIN: 266.11 / MAX: 655.78 1. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 10-bit LLVM Clang 10 Git GCC 9.2.0 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 75.81 102.19 MIN: 50.99 / MAX: 121.26 MIN: 67.61 / MAX: 169.03 1. (CC) gcc options: -O3 -march=native -pthread
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 0 - Input: 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 0.016 0.032 0.048 0.064 0.08 SE +/- 0.000, N = 9 SE +/- 0.000, N = 9 SE +/- 0.000, N = 6 0.071 0.067 0.065 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 4 - Input: 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 3 6 9 12 15 SE +/- 0.072, N = 3 SE +/- 0.022, N = 3 SE +/- 0.061, N = 3 10.400 9.942 9.810 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.7 Encoder Mode: Enc Mode 8 - Input: 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 87.14 81.10 79.73 1. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: VMAF Optimized - Input: Bosphorus 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 80 160 240 320 400 SE +/- 1.42, N = 3 SE +/- 1.14, N = 3 SE +/- 1.11, N = 3 379.36 372.91 370.91 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 80 160 240 320 400 SE +/- 2.39, N = 3 SE +/- 1.93, N = 3 SE +/- 5.05, N = 3 382.36 377.46 375.84 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 70 140 210 280 350 SE +/- 4.64, N = 3 SE +/- 2.79, N = 3 SE +/- 1.87, N = 3 298.55 288.98 293.33 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.1 vpxenc VP9 1080p Video Encode LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 40 80 120 160 200 SE +/- 0.22, N = 3 SE +/- 0.59, N = 3 SE +/- 0.16, N = 3 160.44 155.17 155.85 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 1.50, N = 3 SE +/- 0.96, N = 3 154.61 152.23 154.14 -mstack-alignment=64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.1.2 H.265 1080p Video Encoding LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 11 22 33 44 55 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.12, N = 3 48.92 48.36 48.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 400K 800K 1200K 1600K 2000K SE +/- 6972.36, N = 3 SE +/- 2856.66, N = 3 SE +/- 1399.84, N = 3 1283555.69 1664749.69 1589318.73 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 21.84 13.73 13.54 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 10.0.0 20191117 GCC 9.2.0 1.1531 2.3062 3.4593 4.6124 5.7655 SE +/- 0.281, N = 15 SE +/- 0.031, N = 3 5.125 2.755 1. (CXX) g++ options: -fopenmp -O3 -march=native
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 41.36 35.98 37.18 1. (CC) gcc options: -lm -O3 -march=native
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 22.41 22.20 22.36 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 2 4 6 8 10 SE +/- 0.011, N = 3 SE +/- 0.006, N = 3 SE +/- 0.020, N = 3 8.667 8.580 8.546 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 LLVM Clang 10 Git GCC 9.2.0 3 6 9 12 15 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 10.404 9.019 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr 1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 2K 4K 6K 8K 10K SE +/- 3.88, N = 3 SE +/- 6.05, N = 3 SE +/- 2.79, N = 3 7550.7 10290.6 10301.1 -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve MT - Degridding GCC 10.0.0 20191117 GCC 9.2.0 1300 2600 3900 5200 6500 SE +/- 1.85, N = 3 SE +/- 3.14, N = 3 5962.81 5892.25 1. (CXX) g++ options: -lpthread
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Gridding GCC 10.0.0 20191117 GCC 9.2.0 1500 3000 4500 6000 7500 SE +/- 59.89, N = 3 SE +/- 54.12, N = 3 6946.85 6602.28 1. (CXX) g++ options: -lpthread
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 2018-11-10 Test: tConvolve OpenMP - Degridding GCC 10.0.0 20191117 GCC 9.2.0 1400 2800 4200 5600 7000 SE +/- 0.00, N = 3 SE +/- 33.47, N = 3 6656.40 5187.24 1. (CXX) g++ options: -lpthread
GROMACS The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2019.4 Water Benchmark GCC 10.0.0 20191117 GCC 9.2.0 0.9189 1.8378 2.7567 3.6756 4.5945 SE +/- 0.006, N = 3 SE +/- 0.001, N = 3 4.075 4.084 1. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 130K 260K 390K 520K 650K SE +/- 1073.83, N = 3 SE +/- 467.85, N = 3 SE +/- 986.51, N = 3 596205.49 593115.57 589144.11 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 10K 20K 30K 40K 50K SE +/- 21.66, N = 3 SE +/- 19.13, N = 3 SE +/- 43.17, N = 3 48421.31 48304.47 48273.12 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.77, N = 15 SE +/- 0.04, N = 3 38.66 42.10 40.55 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 80 160 240 320 400 SE +/- 0.16, N = 3 SE +/- 0.22, N = 3 SE +/- 0.78, N = 3 341.74 346.36 354.48 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 400 800 1200 1600 2000 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.19, N = 3 1706.42 1448.15 1417.04 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 88.54 100.82 102.22 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 19.47 18.39 19.70 1. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 9 18 27 36 45 SE +/- 0.39, N = 6 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 34.84 37.59 37.75 1. (CXX) g++ options: -O3 -march=native -std=c++11
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPOP LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 500K 1000K 1500K 2000K 2500K SE +/- 34458.33, N = 15 SE +/- 13200.64, N = 3 SE +/- 38584.75, N = 15 2155677.58 2237292.80 2293158.05 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SADD LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 400K 800K 1200K 1600K 2000K SE +/- 33951.29, N = 15 SE +/- 24277.91, N = 15 SE +/- 28492.65, N = 15 1645122.77 1694281.30 1703114.95 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: LPUSH LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 300K 600K 900K 1200K 1500K SE +/- 18364.34, N = 15 SE +/- 18234.45, N = 15 SE +/- 11571.16, N = 3 1372099.01 1387280.44 1287761.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: GET LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 500K 1000K 1500K 2000K 2500K SE +/- 22857.27, N = 3 SE +/- 38824.38, N = 15 SE +/- 32908.04, N = 4 2158757.83 2029652.62 2216263.44 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
OpenBenchmarking.org Requests Per Second, More Is Better Redis 5.0.5 Test: SET LLVM Clang 10 Git GCC 10.0.0 20191117 GCC 9.2.0 300K 600K 900K 1200K 1500K SE +/- 25369.83, N = 3 SE +/- 24405.72, N = 12 SE +/- 22520.49, N = 15 1494870.67 1494027.97 1465273.92 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native
GCC 9.2.0 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 9.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 18 November 2019 18:10 by user phoronix.
GCC 10.0.0 20191117 Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: GCC 10.0.0 20191117, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: --disable-multilib --enable-checking=releaseProcessor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 19 November 2019 07:26 by user phoronix.
LLVM Clang 10 Git Processor: AMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads), Motherboard: ASRockRack EPYCD8 (P2.10 BIOS), Chipset: AMD Starship/Matisse, Memory: 129024MB, Disk: 280GB INTEL SSDPED1D280GA, Graphics: llvmpipe 126GB, Audio: AMD Starship/Matisse, Network: 2 x Intel I350
OS: Ubuntu 19.10, Kernel: 5.3.0-050300-generic (x86_64), Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17rc1 + Python 3.7.5rc1Security Notes: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling
Testing initiated at 19 November 2019 16:13 by user phoronix.