EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 Benchmarking

AMD EPYC 7642 compiler testing by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1911208-HU-EPYC7642C55&sro&grs.

EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionGCC 9.2.0GCC 10.0.0 20191117LLVM Clang 10 GitAMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads)ASRockRack EPYCD8 (P2.10 BIOS)AMD Starship/Matisse129024MB280GB INTEL SSDPED1D280GAllvmpipe 126GBAMD Starship/Matisse2 x Intel I350Ubuntu 19.105.3.0-050300-generic (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.5GCC 9.2.0ext41024x768GCC 10.0.0 20191117Clang 10.0.0OpenBenchmarking.orgEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 9.2.0: --disable-multilib --enable-checking=release- GCC 10.0.0 20191117: --disable-multilib --enable-checking=release- LLVM Clang 10 Git: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2 Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Details- Python 2.7.17rc1 + Python 3.7.5rc1Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling

EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 Benchmarkinggraphics-magick: HWB Color Spacegraphics-magick: Sharpenjohn-the-ripper: Blowfishgraphics-magick: Enhancedgraphics-magick: Swirlgraphics-magick: Noise-Gaussiangraphics-magick: Resizingc-ray: Total Time - 4K, 16 Rays Per Pixelbuild-linux-kernel: Time To Compilehmmer: Pfam Database Searchopenssl: RSA 4096-bit Performancedav1d: Chimera 1080p 10-bitcoremark: CoreMark Size 666 - Iterations Per Secondaskap: tConvolve OpenMP - Degriddingbuild-llvm: Time To Compilescimark2: Sparse Matrix Multiplycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorencode-mp3: WAV To MP3aobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLAClibgav1: Summer Nature 4Ktscp: AI Chess Performancelibgav1: Chimera 1080pscimark2: Fast Fourier Transformlibgav1: Summer Nature 1080pmrbayes: Primate Phylogeny Analysissvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 0 - 1080pscimark2: Jacobi Successive Over-Relaxationcpp-perf-bench: Stepanov Abstractionredis: LPUSHgraphics-magick: Rotatecpp-perf-bench: Function Objectssvt-av1: Enc Mode 4 - 1080pmafft: Multiple Sequence Alignmentdav1d: Chimera 1080psqlite-speedtest: Timed Time - Size 1,000askap: tConvolve OpenMP - Griddingscimark2: Dense LU Matrix Factorizationcpp-perf-bench: Math Libraryvpxenc: vpxenc VP9 1080p Video Encodesvt-vp9: Visual Quality Optimized - Bosphorus 1080pfftw: Float + SSE - 2D FFT Size 4096svt-vp9: VMAF Optimized - Bosphorus 1080predis: SETdav1d: Summer Nature 4Ksvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pscimark2: Compositex264: H.264 Video Encodinglibgav1: Chimera 1080p 10-bitmt-dgemm: Sustained Floating-Point Ratecompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19scimark2: Monte Carlodav1d: Summer Nature 1080ppgbench: Buffer Test - Normal Load - Read Onlyaskap: tConvolve MT - Degriddingx265: H.265 1080p Video Encodingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9cpp-perf-bench: Atolminife: Smallaskap: tConvolve MT - Griddingpgbench: Buffer Test - Normal Load - Read Writexsbench: gromacs: Water Benchmarkredis: GETredis: SADDredis: LPOPcpp-perf-bench: Ctypesmallpt: Global Illumination Renderer; 128 Sampleshimeno: Poisson Pressure SolverGCC 9.2.0GCC 10.0.0 20191117LLVM Clang 10 Git1170573609058651675651213813.53730.3277.56010301.1102.191589318.7267335187.24145.4952755.871417.040102.2179.01937.17910.65318.37103374439.40196.0855.72103.98979.7300.0651748.1137.7491287761.050519.7019.8102.234546.3674.5736602.288575.57354.479155.85293.3316872370.911465273.92273.96375.842774.05154.1416.7214.0628148.546594.61588.19589144.1070595892.2548.7622.35575.9019425.54387.8448273.12291564548454.0842216263.441703114.952293158.0540.5502.7552487.7590211070572613617861624634187313.73434.5518.08710290.61664749.6905636656.4162.7832749.811448.147100.81835.97810.76818.28101060239.21197.8856.38102.19781.0950.0671743.4537.5871387280.4448118.3939.9422.11175.8766946.858484.97346.364155.17288.9817427372.911494027.97377.462754.17152.2316.6513.8560288.580594.76593115.5720435962.8148.3622.19875.55219314.94410.5548304.46595364744834.0752029652.621694281.302237292.842.1045.1252844.33963920413146821452511321.83642.6715.8717550.775.811283555.689296131.333341.431706.42188.53610.40441.3579.49816.26113375535.20218.8550.7694.93587.1390.0711607.8834.8381372099.0151619.47010.4002.196575.0878.4828231.47341.744160.44298.55379.361494870.67279.11382.362800.49154.6116.918.667602.82595.62596205.48807348.9222.41376.19948421.3112672158757.831645122.772155677.5838.6643483.855356OpenBenchmarking.org

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git30060090012001500SE +/- 1.53, N = 3SE +/- 4.04, N = 310701170204-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git12024036048060057257313-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git13K26K39K52K65KSE +/- 27.14, N = 3SE +/- 11.24, N = 361361609051468-fopenmp-fopenmp1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git2004006008001000SE +/- 0.33, N = 3SE +/- 0.67, N = 378686521-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git400800120016002000SE +/- 1.53, N = 3SE +/- 3.18, N = 31624167545-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git140280420560700SE +/- 1.86, N = 363465125-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git5001000150020002500SE +/- 30.90, N = 3SE +/- 27.63, N = 518732138113-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 313.7313.5421.841. (CC) gcc options: -lm -lpthread -O3 -march=native

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git1020304050SE +/- 0.35, N = 8SE +/- 0.24, N = 13SE +/- 0.32, N = 1434.5530.3342.67

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git246810SE +/- 0.035, N = 3SE +/- 0.052, N = 3SE +/- 0.085, N = 38.0877.5605.8711. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git2K4K6K8K10KSE +/- 6.05, N = 3SE +/- 2.79, N = 3SE +/- 3.88, N = 310290.610301.17550.7-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p 10-bitGCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.21, N = 3SE +/- 0.06, N = 3102.1975.81MIN: 67.61 / MAX: 169.03MIN: 50.99 / MAX: 121.261. (CC) gcc options: -O3 -march=native -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git400K800K1200K1600K2000KSE +/- 2856.66, N = 3SE +/- 1399.84, N = 3SE +/- 6972.36, N = 31664749.691589318.731283555.691. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingGCC 10.0.0 20191117GCC 9.2.014002800420056007000SE +/- 0.00, N = 3SE +/- 33.47, N = 36656.405187.241. (CXX) g++ options: -lpthread

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git4080120160200162.78145.50131.33

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git7001400210028003500SE +/- 7.23, N = 3SE +/- 9.11, N = 3SE +/- 3.59, N = 32749.812755.873341.431. (CC) gcc options: -O3 -march=native -lm

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git400800120016002000SE +/- 0.03, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 31448.151417.041706.421. (CXX) g++ options: -O3 -march=native -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3100.82102.2288.541. (CXX) g++ options: -O3 -march=native -std=c++11

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.2.0LLVM Clang 10 Git3691215SE +/- 0.001, N = 3SE +/- 0.003, N = 39.01910.404-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git918273645SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 335.9837.1841.361. (CC) gcc options: -lm -O3 -march=native

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git3691215SE +/- 0.008, N = 5SE +/- 0.007, N = 5SE +/- 0.009, N = 510.76810.6539.498-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 318.2818.3716.261. (CXX) g++ options: -O3 -march=native -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git200K400K600K800K1000KSE +/- 586.45, N = 5SE +/- 725.51, N = 5SE +/- 467.20, N = 51010602103374411337551. (CC) gcc options: -O3 -march=native

libgav1

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git918273645SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 339.2139.4035.201. (CXX) g++ options: -O3 -march=native -lpthread

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git50100150200250SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.42, N = 3197.88196.08218.851. (CC) gcc options: -O3 -march=native -lm

libgav1

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git1326395265SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 356.3855.7250.761. (CXX) g++ options: -O3 -march=native -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.11, N = 3SE +/- 1.63, N = 3SE +/- 0.14, N = 3102.20103.9994.94-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 8 - Input: 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.03, N = 3SE +/- 0.31, N = 3SE +/- 0.17, N = 381.1079.7387.141. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 0 - Input: 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git0.0160.0320.0480.0640.08SE +/- 0.000, N = 9SE +/- 0.000, N = 6SE +/- 0.000, N = 90.0670.0650.0711. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git400800120016002000SE +/- 0.26, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 31743.451748.111607.881. (CC) gcc options: -O3 -march=native -lm

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.39, N = 637.5937.7534.841. (CXX) g++ options: -O3 -march=native -std=c++11

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSHGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git300K600K900K1200K1500KSE +/- 18234.45, N = 15SE +/- 11571.16, N = 3SE +/- 18364.34, N = 151387280.441287761.001372099.011. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git110220330440550SE +/- 0.67, N = 3SE +/- 2.96, N = 3481505516-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 318.3919.7019.471. (CXX) g++ options: -O3 -march=native -std=c++11

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 4 - Input: 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git3691215SE +/- 0.022, N = 3SE +/- 0.061, N = 3SE +/- 0.072, N = 39.9429.81010.4001. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence AlignmentGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git0.50271.00541.50812.01082.5135SE +/- 0.030, N = 15SE +/- 0.006, N = 2SE +/- 0.032, N = 22.1112.2342.1961. (CC) gcc options: -std=c99 -O3 -lm -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080pGCC 9.2.0LLVM Clang 10 Git120240360480600SE +/- 0.79, N = 3SE +/- 2.02, N = 3546.36575.08MIN: 345.34 / MAX: 673.27MIN: 353.68 / MAX: 718.091. (CC) gcc options: -O3 -march=native -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 375.8874.5778.481. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingGCC 10.0.0 20191117GCC 9.2.015003000450060007500SE +/- 59.89, N = 3SE +/- 54.12, N = 36946.856602.281. (CXX) g++ options: -lpthread

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git2K4K6K8K10KSE +/- 44.47, N = 3SE +/- 6.90, N = 3SE +/- 22.64, N = 38484.978575.578231.471. (CC) gcc options: -O3 -march=native -lm

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git80160240320400SE +/- 0.22, N = 3SE +/- 0.78, N = 3SE +/- 0.16, N = 3346.36354.48341.741. (CXX) g++ options: -O3 -march=native -std=c++11

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.1vpxenc VP9 1080p Video EncodeGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git4080120160200SE +/- 0.59, N = 3SE +/- 0.16, N = 3SE +/- 0.22, N = 3155.17155.85160.441. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git70140210280350SE +/- 2.79, N = 3SE +/- 1.87, N = 3SE +/- 4.64, N = 3288.98293.33298.551. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 10.0.0 20191117GCC 9.2.04K8K12K16K20KSE +/- 113.29, N = 3SE +/- 28.26, N = 317427168721. (CC) gcc options: -pthread -O3 -march=native -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git80160240320400SE +/- 1.14, N = 3SE +/- 1.11, N = 3SE +/- 1.42, N = 3372.91370.91379.361. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SETGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git300K600K900K1200K1500KSE +/- 24405.72, N = 12SE +/- 22520.49, N = 15SE +/- 25369.83, N = 31494027.971465273.921494870.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 4KGCC 9.2.0LLVM Clang 10 Git60120180240300SE +/- 0.54, N = 3SE +/- 0.20, N = 3273.96279.11MIN: 129.55 / MAX: 296.84MIN: 127.8 / MAX: 303.171. (CC) gcc options: -O3 -march=native -pthread

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git80160240320400SE +/- 1.93, N = 3SE +/- 5.05, N = 3SE +/- 2.39, N = 3377.46375.84382.361. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git6001200180024003000SE +/- 9.84, N = 3SE +/- 3.13, N = 3SE +/- 4.31, N = 32754.172774.052800.491. (CC) gcc options: -O3 -march=native -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git306090120150SE +/- 1.50, N = 3SE +/- 0.96, N = 3SE +/- 0.22, N = 3152.23154.14154.61-mstack-alignment=641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git48121620SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 316.6516.7216.911. (CXX) g++ options: -O3 -march=native -lpthread

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGCC 10.0.0 20191117GCC 9.2.048121620SE +/- 0.09, N = 3SE +/- 0.19, N = 313.8614.061. (CC) gcc options: -O3 -march=native -fopenmp

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git246810SE +/- 0.006, N = 3SE +/- 0.020, N = 3SE +/- 0.011, N = 38.5808.5468.6671. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git130260390520650SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3594.76594.61602.821. (CC) gcc options: -O3 -march=native -lm

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080pGCC 9.2.0LLVM Clang 10 Git130260390520650SE +/- 1.75, N = 3SE +/- 1.70, N = 3588.19595.62MIN: 266.11 / MAX: 655.78MIN: 260.63 / MAX: 664.581. (CC) gcc options: -O3 -march=native -pthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git130K260K390K520K650KSE +/- 467.85, N = 3SE +/- 986.51, N = 3SE +/- 1073.83, N = 3593115.57589144.11596205.491. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingGCC 10.0.0 20191117GCC 9.2.013002600390052006500SE +/- 1.85, N = 3SE +/- 3.14, N = 35962.815892.251. (CXX) g++ options: -lpthread

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video EncodingGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git1122334455SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 348.3648.7648.921. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git510152025SE +/- 0.13, N = 3SE +/- 0.16, N = 3SE +/- 0.16, N = 322.2022.3622.411. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git20406080100SE +/- 0.22, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 375.5575.9076.201. (CXX) g++ options: -O3 -march=native -std=c++11

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGCC 10.0.0 20191117GCC 9.2.04K8K12K16K20KSE +/- 7.36, N = 3SE +/- 2.61, N = 319314.919425.51. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingGCC 10.0.0 20191117GCC 9.2.09001800270036004500SE +/- 2.82, N = 3SE +/- 4.79, N = 34410.554387.841. (CXX) g++ options: -lpthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git10K20K30K40K50KSE +/- 19.13, N = 3SE +/- 43.17, N = 3SE +/- 21.66, N = 348304.4748273.1248421.311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06GCC 10.0.0 20191117GCC 9.2.01.4M2.8M4.2M5.6M7MSE +/- 1920.71, N = 3SE +/- 3588.47, N = 3647448364548451. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkGCC 10.0.0 20191117GCC 9.2.00.91891.83782.75673.67564.5945SE +/- 0.006, N = 3SE +/- 0.001, N = 34.0754.0841. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GETGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git500K1000K1500K2000K2500KSE +/- 38824.38, N = 15SE +/- 32908.04, N = 4SE +/- 22857.27, N = 32029652.622216263.442158757.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADDGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git400K800K1200K1600K2000KSE +/- 24277.91, N = 15SE +/- 28492.65, N = 15SE +/- 33951.29, N = 151694281.301703114.951645122.771. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOPGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git500K1000K1500K2000K2500KSE +/- 13200.64, N = 3SE +/- 38584.75, N = 15SE +/- 34458.33, N = 152237292.802293158.052155677.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git1020304050SE +/- 0.77, N = 15SE +/- 0.04, N = 3SE +/- 0.01, N = 342.1040.5538.661. (CXX) g++ options: -O3 -march=native -std=c++11

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.0.0 20191117GCC 9.2.01.15312.30623.45934.61245.7655SE +/- 0.281, N = 15SE +/- 0.031, N = 35.1252.7551. (CXX) g++ options: -fopenmp -O3 -march=native

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 10.0.0 20191117GCC 9.2.0LLVM Clang 10 Git7001400210028003500SE +/- 4.44, N = 3SE +/- 66.64, N = 15SE +/- 45.52, N = 152844.342487.763483.861. (CC) gcc options: -O3 -march=native -mavx2


Phoronix Test Suite v10.8.5