EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 Benchmarking

AMD EPYC 7642 compiler testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1911208-HU-EPYC7642C55
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 3 Tests
Bioinformatics 4 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 28 Tests
Compression Tests 2 Tests
CPU Massive 25 Tests
Creator Workloads 13 Tests
Cryptography 2 Tests
Database Test Suite 3 Tests
Encoding 9 Tests
HPC - High Performance Computing 9 Tests
Common Kernel Benchmarks 3 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 4 Tests
Multi-Core 21 Tests
OpenMPI Tests 3 Tests
Programmer / Developer System Benchmarks 5 Tests
Renderers 3 Tests
Scientific Computing 8 Tests
Server 4 Tests
Server CPU Tests 13 Tests
Single-Threaded 5 Tests
Video Encoding 7 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 9.2.0
November 18 2019
  5 Hours, 30 Minutes
GCC 10.0.0 20191117
November 19 2019
  5 Hours, 28 Minutes
LLVM Clang 10 Git
November 19 2019
  5 Hours, 23 Minutes
Invert Hiding All Results Option
  5 Hours, 27 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 BenchmarkingOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7642 48-Core @ 2.30GHz (48 Cores / 96 Threads)ASRockRack EPYCD8 (P2.10 BIOS)AMD Starship/Matisse129024MB280GB INTEL SSDPED1D280GAllvmpipe 126GBAMD Starship/Matisse2 x Intel I350Ubuntu 19.105.3.0-050300-generic (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.5GCC 9.2.0GCC 10.0.0 20191117Clang 10.0.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilersFile-SystemScreen ResolutionEPYC 7642 Compiler GCC 10 Vs. LLVM Clang 10 Benchmarking PerformanceSystem Logs- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 9.2.0: --disable-multilib --enable-checking=release- GCC 10.0.0 20191117: --disable-multilib --enable-checking=release- LLVM Clang 10 Git: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101c- Python 2.7.17rc1 + Python 3.7.5rc1- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling

GCC 9.2.0GCC 10.0.0 20191117LLVM Clang 10 GitLogarithmic Result OverviewPhoronix Test SuiteJohn The RipperGraphicsMagickC-RayTimed Linux Kernel CompilationHimeno BenchmarkTimed HMMer SearchOpenSSLCoremarkTimed LLVM CompilationAOBenchFLAC Audio EncodingTSCPTimed MrBayes Analysislibgav1SVT-AV1Timed MAFFT AlignmentSQLite SpeedtestSciMarkVP9 libvpx EncodingSVT-VP9CppPerformanceBenchmarksx264Zstd Compressionx265XZ CompressionRedisPostgreSQL pgbench

EPYC 7642 Compiler GCC 10 vs. LLVM Clang 10 Benchmarkingcpp-perf-bench: Rand Numberssvt-av1: Enc Mode 0 - 1080plibgav1: Chimera 1080p 10-bitfftw: Float + SSE - 2D FFT Size 4096cpp-perf-bench: Math Libraryhimeno: Poisson Pressure Solverlibgav1: Chimera 1080plibgav1: Summer Nature 4Kdav1d: Chimera 1080p 10-bitbuild-linux-kernel: Time To Compilepgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writeaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingmrbayes: Primate Phylogeny Analysisvpxenc: vpxenc VP9 1080p Video Encodecpp-perf-bench: Stepanov Vectorcpp-perf-bench: Ctypesqlite-speedtest: Timed Time - Size 1,000cpp-perf-bench: Atolgraphics-magick: Resizinglibgav1: Summer Nature 1080pgraphics-magick: Noise-Gaussiangraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Swirlgraphics-magick: HWB Color Spacegraphics-magick: Rotateredis: SADDgromacs: Water Benchmarkbuild-llvm: Time To Compilecpp-perf-bench: Stepanov Abstractionredis: LPUSHredis: LPOPaobench: 2048 x 2048 - Total Timeredis: SETjohn-the-ripper: Blowfishscimark2: Compositecoremark: CoreMark Size 666 - Iterations Per Secondredis: GETcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9openssl: RSA 4096-bit Performancecpp-perf-bench: Function Objectsdav1d: Chimera 1080pminife: Smallencode-flac: WAV To FLACc-ray: Total Time - 4K, 16 Rays Per Pixelmt-dgemm: Sustained Floating-Point Rateaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingsmallpt: Global Illumination Renderer; 128 Samplesdav1d: Summer Nature 4Kx265: H.265 1080p Video Encodingsvt-av1: Enc Mode 4 - 1080pxsbench: encode-mp3: WAV To MP3compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19hmmer: Pfam Database Searchsvt-av1: Enc Mode 8 - 1080pdav1d: Summer Nature 1080pmafft: Multiple Sequence Alignmentx264: H.264 Video Encodingsvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ptscp: AI Chess Performancescimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte CarloGCC 9.2.0GCC 10.0.0 20191117LLVM Clang 10 Git1417.0400.06516.7216872354.4792487.75902139.4018.37102.1930.327589144.10705948273.1229155892.254387.84103.989155.85102.21740.55074.57375.90213855.72651573865167511705051703114.954.084145.49537.7491287761.02293158.0537.1791465273.92609052774.051589318.7267332216263.4422.35510301.119.701546.3619425.510.65313.53714.0628145187.246602.282.755273.9648.769.81064548459.0198.5467.56079.730588.192.234154.14293.33370.91375.8410337441748.118575.572755.87196.08594.611448.1470.06716.6517427346.3642844.33963939.2118.2834.551593115.57204348304.4659535962.814410.55102.197155.17100.81842.10475.87675.552187356.38634572786162410704811694281.304.075162.78337.5871387280.442237292.835.9781494027.97613612754.171664749.6905632029652.6222.19810290.618.39319314.910.76813.73413.8560286656.46946.855.12548.369.94264744838.5808.08781.0952.111152.23288.98372.91377.4610106021743.458484.972749.81197.88594.761706.4210.07116.91341.7443483.85535635.2016.2675.8142.671596205.48807348421.31126794.935160.4488.53638.66478.48276.19911350.76251321452045161645122.77131.3334.8381372099.012155677.5841.3571494870.6714682800.491283555.6892962158757.8322.4137550.719.470575.089.49821.836279.1148.9210.40010.4048.6675.87187.139595.622.196154.61298.55379.36382.3611337551607.888231.473341.43218.85602.82OpenBenchmarking.org

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117400800120016002000SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.03, N = 31706.421417.041448.151. (CXX) g++ options: -O3 -march=native -std=c++11

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 0 - Input: 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911170.0160.0320.0480.0640.08SE +/- 0.000, N = 9SE +/- 0.000, N = 6SE +/- 0.000, N = 90.0710.0650.0671. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111748121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 316.9116.7216.651. (CXX) g++ options: -O3 -march=native -lpthread

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 9.2.0GCC 10.0.0 201911174K8K12K16K20KSE +/- 28.26, N = 3SE +/- 113.29, N = 316872174271. (CC) gcc options: -pthread -O3 -march=native -lm

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111780160240320400SE +/- 0.16, N = 3SE +/- 0.78, N = 3SE +/- 0.22, N = 3341.74354.48346.361. (CXX) g++ options: -O3 -march=native -std=c++11

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911177001400210028003500SE +/- 45.52, N = 15SE +/- 66.64, N = 15SE +/- 4.44, N = 33483.862487.762844.341. (CC) gcc options: -O3 -march=native -mavx2

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117918273645SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 335.2039.4039.211. (CXX) g++ options: -O3 -march=native -lpthread

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 316.2618.3718.281. (CXX) g++ options: -O3 -march=native -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p 10-bitLLVM Clang 10 GitGCC 9.2.020406080100SE +/- 0.06, N = 3SE +/- 0.21, N = 375.81102.19MIN: 50.99 / MAX: 121.26MIN: 67.61 / MAX: 169.031. (CC) gcc options: -O3 -march=native -pthread

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911171020304050SE +/- 0.32, N = 14SE +/- 0.24, N = 13SE +/- 0.35, N = 842.6730.3334.55

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117130K260K390K520K650KSE +/- 1073.83, N = 3SE +/- 986.51, N = 3SE +/- 467.85, N = 3596205.49589144.11593115.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111710K20K30K40K50KSE +/- 21.66, N = 3SE +/- 43.17, N = 3SE +/- 19.13, N = 348421.3148273.1248304.471. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingGCC 9.2.0GCC 10.0.0 2019111713002600390052006500SE +/- 3.14, N = 3SE +/- 1.85, N = 35892.255962.811. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingGCC 9.2.0GCC 10.0.0 201911179001800270036004500SE +/- 4.79, N = 3SE +/- 2.82, N = 34387.844410.551. (CXX) g++ options: -lpthread

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111720406080100SE +/- 0.14, N = 3SE +/- 1.63, N = 3SE +/- 0.11, N = 394.94103.99102.20-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.1vpxenc VP9 1080p Video EncodeLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911174080120160200SE +/- 0.22, N = 3SE +/- 0.16, N = 3SE +/- 0.59, N = 3160.44155.85155.171. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111720406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 388.54102.22100.821. (CXX) g++ options: -O3 -march=native -std=c++11

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911171020304050SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.77, N = 1538.6640.5542.101. (CXX) g++ options: -O3 -march=native -std=c++11

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000LLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111720406080100SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 378.4874.5775.881. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111720406080100SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.22, N = 376.2075.9075.551. (CXX) g++ options: -O3 -march=native -std=c++11

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911175001000150020002500SE +/- 27.63, N = 5SE +/- 30.90, N = 311321381873-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911171326395265SE +/- 0.15, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 350.7655.7256.381. (CXX) g++ options: -O3 -march=native -lpthread

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117140280420560700SE +/- 1.86, N = 325651634-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111712024036048060013573572-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911172004006008001000SE +/- 0.67, N = 3SE +/- 0.33, N = 321865786-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117400800120016002000SE +/- 3.18, N = 3SE +/- 1.53, N = 34516751624-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111730060090012001500SE +/- 4.04, N = 3SE +/- 1.53, N = 320411701070-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117110220330440550SE +/- 2.96, N = 3SE +/- 0.67, N = 3516505481-fopenmp-fopenmp1. (CC) gcc options: -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADDLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117400K800K1200K1600K2000KSE +/- 33951.29, N = 15SE +/- 28492.65, N = 15SE +/- 24277.91, N = 151645122.771703114.951694281.301. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkGCC 9.2.0GCC 10.0.0 201911170.91891.83782.75673.67564.5945SE +/- 0.001, N = 3SE +/- 0.006, N = 34.0844.0751. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911174080120160200131.33145.50162.78

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117918273645SE +/- 0.39, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 334.8437.7537.591. (CXX) g++ options: -O3 -march=native -std=c++11

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSHLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117300K600K900K1200K1500KSE +/- 18364.34, N = 15SE +/- 11571.16, N = 3SE +/- 18234.45, N = 151372099.011287761.001387280.441. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOPLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117500K1000K1500K2000K2500KSE +/- 34458.33, N = 15SE +/- 38584.75, N = 15SE +/- 13200.64, N = 32155677.582293158.052237292.801. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117918273645SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 341.3637.1835.981. (CC) gcc options: -lm -O3 -march=native

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SETLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117300K600K900K1200K1500KSE +/- 25369.83, N = 3SE +/- 22520.49, N = 15SE +/- 24405.72, N = 121494870.671465273.921494027.971. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111713K26K39K52K65KSE +/- 11.24, N = 3SE +/- 27.14, N = 314686090561361-fopenmp-fopenmp1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911176001200180024003000SE +/- 4.31, N = 3SE +/- 3.13, N = 3SE +/- 9.84, N = 32800.492774.052754.171. (CC) gcc options: -O3 -march=native -lm

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117400K800K1200K1600K2000KSE +/- 6972.36, N = 3SE +/- 1399.84, N = 3SE +/- 2856.66, N = 31283555.691589318.731664749.691. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GETLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117500K1000K1500K2000K2500KSE +/- 22857.27, N = 3SE +/- 32908.04, N = 4SE +/- 38824.38, N = 152158757.832216263.442029652.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9LLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117510152025SE +/- 0.16, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 322.4122.3622.201. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911172K4K6K8K10KSE +/- 3.88, N = 3SE +/- 2.79, N = 3SE +/- 6.05, N = 37550.710301.110290.6-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 319.4719.7018.391. (CXX) g++ options: -O3 -march=native -std=c++11

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080pLLVM Clang 10 GitGCC 9.2.0120240360480600SE +/- 2.02, N = 3SE +/- 0.79, N = 3575.08546.36MIN: 353.68 / MAX: 718.09MIN: 345.34 / MAX: 673.271. (CC) gcc options: -O3 -march=native -pthread

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGCC 9.2.0GCC 10.0.0 201911174K8K12K16K20KSE +/- 2.61, N = 3SE +/- 7.36, N = 319425.519314.91. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911173691215SE +/- 0.009, N = 5SE +/- 0.007, N = 5SE +/- 0.008, N = 59.49810.65310.768-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117510152025SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 321.8413.5413.731. (CC) gcc options: -lm -lpthread -O3 -march=native

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGCC 9.2.0GCC 10.0.0 2019111748121620SE +/- 0.19, N = 3SE +/- 0.09, N = 314.0613.861. (CC) gcc options: -O3 -march=native -fopenmp

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingGCC 9.2.0GCC 10.0.0 2019111714002800420056007000SE +/- 33.47, N = 3SE +/- 0.00, N = 35187.246656.401. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingGCC 9.2.0GCC 10.0.0 2019111715003000450060007500SE +/- 54.12, N = 3SE +/- 59.89, N = 36602.286946.851. (CXX) g++ options: -lpthread

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 9.2.0GCC 10.0.0 201911171.15312.30623.45934.61245.7655SE +/- 0.031, N = 3SE +/- 0.281, N = 152.7555.1251. (CXX) g++ options: -fopenmp -O3 -march=native

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 4KLLVM Clang 10 GitGCC 9.2.060120180240300SE +/- 0.20, N = 3SE +/- 0.54, N = 3279.11273.96MIN: 127.8 / MAX: 303.17MIN: 129.55 / MAX: 296.841. (CC) gcc options: -O3 -march=native -pthread

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video EncodingLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911171122334455SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 348.9248.7648.361. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 4 - Input: 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911173691215SE +/- 0.072, N = 3SE +/- 0.061, N = 3SE +/- 0.022, N = 310.4009.8109.9421. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

Xsbench

XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06GCC 9.2.0GCC 10.0.0 201911171.4M2.8M4.2M5.6M7MSE +/- 3588.47, N = 3SE +/- 1920.71, N = 3645484564744831. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3LLVM Clang 10 GitGCC 9.2.03691215SE +/- 0.003, N = 3SE +/- 0.001, N = 310.4049.019-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19LLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117246810SE +/- 0.011, N = 3SE +/- 0.020, N = 3SE +/- 0.006, N = 38.6678.5468.5801. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117246810SE +/- 0.085, N = 3SE +/- 0.052, N = 3SE +/- 0.035, N = 35.8717.5608.0871. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.7Encoder Mode: Enc Mode 8 - Input: 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111720406080100SE +/- 0.17, N = 3SE +/- 0.31, N = 3SE +/- 0.03, N = 387.1479.7381.101. (CXX) g++ options: -O3 -march=native -fPIE -fPIC -pie

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080pLLVM Clang 10 GitGCC 9.2.0130260390520650SE +/- 1.70, N = 3SE +/- 1.75, N = 3595.62588.19MIN: 260.63 / MAX: 664.58MIN: 266.11 / MAX: 655.781. (CC) gcc options: -O3 -march=native -pthread

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence AlignmentLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911170.50271.00541.50812.01082.5135SE +/- 0.032, N = 2SE +/- 0.006, N = 2SE +/- 0.030, N = 152.1962.2342.1111. (CC) gcc options: -std=c99 -O3 -lm -lpthread

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117306090120150SE +/- 0.22, N = 3SE +/- 0.96, N = 3SE +/- 1.50, N = 3154.61154.14152.23-mstack-alignment=641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111770140210280350SE +/- 4.64, N = 3SE +/- 1.87, N = 3SE +/- 2.79, N = 3298.55293.33288.981. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111780160240320400SE +/- 1.42, N = 3SE +/- 1.11, N = 3SE +/- 1.14, N = 3379.36370.91372.911. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111780160240320400SE +/- 2.39, N = 3SE +/- 5.05, N = 3SE +/- 1.93, N = 3382.36375.84377.461. (CC) gcc options: -O3 -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117200K400K600K800K1000KSE +/- 467.20, N = 5SE +/- 725.51, N = 5SE +/- 586.45, N = 51133755103374410106021. (CC) gcc options: -O3 -march=native

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117400800120016002000SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.26, N = 31607.881748.111743.451. (CC) gcc options: -O3 -march=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911172K4K6K8K10KSE +/- 22.64, N = 3SE +/- 6.90, N = 3SE +/- 44.47, N = 38231.478575.578484.971. (CC) gcc options: -O3 -march=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 201911177001400210028003500SE +/- 3.59, N = 3SE +/- 9.11, N = 3SE +/- 7.23, N = 33341.432755.872749.811. (CC) gcc options: -O3 -march=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 2019111750100150200250SE +/- 0.42, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 3218.85196.08197.881. (CC) gcc options: -O3 -march=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloLLVM Clang 10 GitGCC 9.2.0GCC 10.0.0 20191117130260390520650SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3602.82594.61594.761. (CC) gcc options: -O3 -march=native -lm