Threadripper EOY2019 Clang vs. GCC

AMD Ryzen Threadripper 3960X GCC vs. LLVM Clang compiler benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1912235-PTS-THREADRI74
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 3 Tests
Bioinformatics 3 Tests
Chess Test Suite 5 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 28 Tests
Compression Tests 2 Tests
CPU Massive 25 Tests
Creator Workloads 13 Tests
Cryptography 2 Tests
Database Test Suite 4 Tests
Encoding 8 Tests
HPC - High Performance Computing 11 Tests
Common Kernel Benchmarks 5 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 5 Tests
Multi-Core 23 Tests
NVIDIA GPU Compute 3 Tests
OpenCL 2 Tests
OpenMPI Tests 6 Tests
Programmer / Developer System Benchmarks 5 Tests
Renderers 5 Tests
Scientific Computing 7 Tests
Server 7 Tests
Server CPU Tests 11 Tests
Single-Threaded 5 Tests
Video Encoding 6 Tests
Common Workstation Benchmarks 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 10.0.0 20191208
December 22 2019
  5 Hours, 16 Minutes
LLVM Clang 10.0 20191222
December 22 2019
  3 Hours, 44 Minutes
GCC 9.2.1
December 23 2019
  4 Hours, 55 Minutes
LLVM Clang 9.0.0
December 23 2019
  4 Hours, 47 Minutes
Invert Hiding All Results Option
  4 Hours, 41 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Threadripper EOY2019 Clang vs. GCCOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS)AMD Starship/Matisse32768MB1000GB Sabrent Rocket 4.0 1TBGigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz)AMD Baffin HDMI/DPASUS VP28UAquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723Ubuntu 19.105.4.0-nvme-hwmon (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.1 (LLVM 9.0.0)GCC 10.0.0 20191208Clang 10.0.0GCC 9.2.1 20191008Clang 9.0.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilersFile-SystemScreen ResolutionThreadripper EOY2019 Clang Vs. GCC BenchmarksSystem Logs- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 10.0.0 20191208: --disable-multilib --enable-checking=release- LLVM Clang 10.0 20191222: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2- GCC 9.2.1: --disable-multilib --enable-checking=release- NONE / errors=remount-ro,relatime,rw- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025- Python 2.7.17rc1 + Python 3.7.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected

GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.0Logarithmic Result OverviewPhoronix Test SuiteJohn The RipperTimed PHP CompilationC-RayTimed ImageMagick CompilationPostgreSQL pgbenchOpenSSLLAME MP3 EncodingAOBenchTSCPFLAC Audio Encodinglibgav1Timed MrBayes AnalysisHimeno BenchmarkAOM AV1SQLite Speedtestdav1dVP9 libvpx Encodingx265Zstd CompressionTungsten RendererCppPerformanceBenchmarksXZ CompressionNGINX Benchmarkx264Apache BenchmarkSQLite

Threadripper EOY2019 Clang vs. GCCjohn-the-ripper: Blowfishttsiod-renderer: Phong Rendering With Soft-Shadow Mappingmt-dgemm: Sustained Floating-Point Ratebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelbuild-imagemagick: Time To Compileopenssl: RSA 4096-bit Performancedav1d: Chimera 1080p 10-bitmkl-dnn: Recurrent Neural Network Training - f32parboil: OpenMP MRI Griddingcpp-perf-bench: Rand Numberstungsten: Non-Exponentialqmcpack: fftw: Float + SSE - 2D FFT Size 4096encode-mp3: WAV To MP3cpp-perf-bench: Stepanov Vectoraobench: 2048 x 2048 - Total Timetscp: AI Chess Performancelibgav1: Summer Nature 4Kencode-flac: WAV To FLAClibgav1: Chimera 1080plibgav1: Summer Nature 1080ptungsten: Hairmkl-dnn: IP Batch 1D - f32cpp-perf-bench: Ctypetungsten: Volumetric Causticlibgav1: Chimera 1080p 10-bitcpp-perf-bench: Stepanov Abstractionaskap: tConvolve OpenMP - Degriddingrocksdb: Rand Fillmrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solverfftw: Stock - 2D FFT Size 4096aom-av1: AV1 Video Encodingcpp-perf-bench: Function Objectssqlite-speedtest: Timed Time - Size 1,000dav1d: Chimera 1080ptungsten: Water Causticvpxenc: vpxenc VP9 1080p Video Encodestockfish: Total Timerocksdb: Seq Filllczero: Randrodinia: OpenMP CFD Solvermkl-dnn: Convolution Batch conv_alexnet - f32askap: tConvolve OpenMP - Griddingcpp-perf-bench: Math Libraryx265: H.265 1080p Video Encodingminife: Smallpgbench: Buffer Test - Normal Load - Read Onlycompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19dav1d: Summer Nature 1080pdav1d: Summer Nature 4Kmkl-dnn: Convolution Batch conv_googlenet_v3 - f32rocksdb: Rand Readbyte: Dhrystone 2compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9nginx: Static Web Page Servingx264: H.264 Video Encodingcpp-perf-bench: Atoln-queens: Elapsed Timeaskap: tConvolve MT - Degriddingcrafty: Elapsed Timeaskap: tConvolve MT - Griddingapache: Static Web Page Servingrocksdb: Rand Fill Syncsqlite: 1parboil: OpenMP Stencilrodinia: OpenMP LavaMDrocksdb: Read While Writinggromacs: Water Benchmarkparboil: OpenMP CUTCPpgbench: Buffer Test - Normal Load - Read Writesmallpt: Global Illumination Renderer; 128 Samplesrodinia: OpenMP Streamclusterlczero: BLASGCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.041784961.9748.86318350.24219.13415.0647173.794.47194.07762.9416571061.4803.369271878.1202876.70176.26327.389134599123.898.04251.7077.929.449601.8613132.3113.9669721.9328.2953716.3393089769.7864898.1708218135.20.1513.83857.116612.9418.2157195.638090908210247331058059.185124.2825255.51262.96865.747767.98673644.95828510.052676.37289.2952.469114536896748002215.120.04243035.46199.2258.2784.3783339.3390278351943.5834157.762427714.2357.55170610.08549459162.5151.25741015513.4404234.77519.06929.3289193278.17830.83920.9055189.174.451282.0334.051767.78866.78231.745152824821.177.14546.6970.288.7262529.7953.7248823.1125.86470.7064866.9034270.1514.76459.906625.1119.1580207.68258.40467.11668836.60545310.107685.40291.7619.68743580.60196.1057.84334327.9014.24814868.67884941482935.1238.38597244.93919.15513.5057178.899.72194.76649.0009651069.4833.374191893.6241846.72977.50128.620137308923.748.07351.8477.119.709521.7371032.2204.0790222.0128.5234096.25102333570.3304583.3462128111.60.1414.81656.169587.2419.1975197.377673746210836491059969.151125.0085509.3269.48464.697777.50654500.67635010.040667.33285.2852.282014228776648969411.219.75343426.72197.9357.4734.4293376.2789599371955.9734228.502436814.2527.57639610.10149508902.5161.25771630910.9917853.49219.76132.75986058669.85220.79140364.76330.97213.5317191.673.36145.8501296.9844.070432262.8235597.95167.10831.585148700123.947.19252.4778.578.700241.6731529.0853.6818624.1926.03364.8544983.2817867531.20.1514.69858.332623.0519.3485207.7881336054100236.89.633118.989260.63866.278037.33675821.0958339.817674.39288.6051.328419.68543783.30198.2957.99534144.3914.20031491.37719018.81833.3033OpenBenchmarking.org

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 2019122213K26K39K52K65KSE +/- 246.24, N = 3SE +/- 170.74, N = 3SE +/- 184.04, N = 3SE +/- 3.28, N = 36058641784414821932-fopenmp-fopenmp-fopenmp1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt
OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 2019122211K22K33K44K55KMin: 60321 / Avg: 60586 / Max: 61078Min: 41501 / Avg: 41784 / Max: 42091Min: 41184 / Avg: 41481.67 / Max: 41818Min: 1927 / Avg: 1931.67 / Max: 19381. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02004006008001000SE +/- 1.33, N = 3SE +/- 2.99, N = 3SE +/- 0.09, N = 3961.97935.1269.851. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02004006008001000Min: 959.33 / Avg: 961.97 / Max: 963.48Min: 930.15 / Avg: 935.12 / Max: 940.47Min: 69.75 / Avg: 69.85 / Max: 70.031. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0246810SE +/- 0.059593, N = 3SE +/- 0.037368, N = 3SE +/- 0.007958, N = 38.8631838.3859720.7914031. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.03691215Min: 8.78 / Avg: 8.86 / Max: 8.98Min: 8.31 / Avg: 8.39 / Max: 8.44Min: 0.78 / Avg: 0.79 / Max: 0.811. (CC) gcc options: -O3 -march=native -fopenmp

Timed PHP Compilation

This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 2019122220406080100SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.24, N = 3SE +/- 0.12, N = 344.9450.2464.7678.181. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 201912221530456075Min: 44.83 / Avg: 44.94 / Max: 45.1Min: 50.09 / Avg: 50.24 / Max: 50.34Min: 64.3 / Avg: 64.76 / Max: 65.11Min: 77.96 / Avg: 78.18 / Max: 78.371. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0714212835SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 319.1319.1630.8430.971. (CC) gcc options: -lm -lpthread -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0714212835Min: 19.03 / Avg: 19.13 / Max: 19.21Min: 19.08 / Avg: 19.16 / Max: 19.25Min: 30.59 / Avg: 30.84 / Max: 31.03Min: 30.76 / Avg: 30.97 / Max: 31.121. (CC) gcc options: -lm -lpthread -O3 -march=native

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileGCC 9.2.1LLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222510152025SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 313.5113.5315.0620.91
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileGCC 9.2.1LLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222510152025Min: 13.42 / Avg: 13.51 / Max: 13.67Min: 13.4 / Avg: 13.53 / Max: 13.6Min: 15.02 / Avg: 15.06 / Max: 15.13Min: 20.75 / Avg: 20.91 / Max: 21.01

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 2019122215003000450060007500SE +/- 29.40, N = 3SE +/- 21.05, N = 3SE +/- 20.75, N = 3SE +/- 21.60, N = 37191.67178.87173.75189.1-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl
OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 2019122212002400360048006000Min: 7150.1 / Avg: 7191.57 / Max: 7248.4Min: 7150.1 / Avg: 7178.77 / Max: 7219.8Min: 7144.2 / Avg: 7173.67 / Max: 7213.7Min: 5157 / Avg: 5189.13 / Max: 5230.21. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p 10-bitGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100SE +/- 0.24, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 399.7294.4774.4573.36MIN: 60.51 / MAX: 199.48MIN: 56.43 / MAX: 192.99MIN: 45.96 / MAX: 154.32MIN: 45.63 / MAX: 149.131. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p 10-bitGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100Min: 99.26 / Avg: 99.72 / Max: 100.07Min: 94.27 / Avg: 94.47 / Max: 94.72Min: 74.22 / Avg: 74.45 / Max: 74.6Min: 73.23 / Avg: 73.36 / Max: 73.441. (CC) gcc options: -O3 -march=native -pthread

MKL-DNN DNNL

This is a test of the Intel MKL-DNN (DNNL / Deep Neural Network Library) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.14080120160200SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.16, N = 3145.85194.08194.77-fopenmp=libomp - MIN: 143.95-fopenmp - MIN: 192.29-fopenmp - MIN: 192.971. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.14080120160200Min: 145.22 / Avg: 145.85 / Max: 146.54Min: 193.42 / Avg: 194.08 / Max: 194.78Min: 194.45 / Avg: 194.77 / Max: 194.931. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingGCC 9.2.1GCC 10.0.0 201912081428425670SE +/- 0.15, N = 3SE +/- 0.24, N = 349.0062.941. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingGCC 9.2.1GCC 10.0.0 201912081224364860Min: 48.71 / Avg: 49 / Max: 49.22Min: 62.47 / Avg: 62.94 / Max: 63.211. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.030060090012001500SE +/- 10.04, N = 3SE +/- 2.69, N = 3SE +/- 12.43, N = 3SE +/- 0.16, N = 31061.481069.481282.031296.981. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.02004006008001000Min: 1051.14 / Avg: 1061.48 / Max: 1081.55Min: 1064.11 / Avg: 1069.48 / Max: 1072.4Min: 1257.17 / Avg: 1282.03 / Max: 1294.48Min: 1296.66 / Avg: 1296.98 / Max: 1297.181. (CXX) g++ options: -O3 -march=native -std=c++11

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.00.91581.83162.74743.66324.579SE +/- 0.01371, N = 3SE +/- 0.01280, N = 3SE +/- 0.01369, N = 3SE +/- 0.00501, N = 33.369273.374194.051764.07043-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0246810Min: 3.35 / Avg: 3.37 / Max: 3.4Min: 3.36 / Avg: 3.37 / Max: 3.4Min: 4.03 / Avg: 4.05 / Max: 4.08Min: 4.06 / Avg: 4.07 / Max: 4.081. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.050010001500200025001878.11893.62262.8-finline-limit=1000 -funroll-all-loops-finline-limit=1000 -funroll-all-loops1. (CXX) g++ options: -O3 -march=native -fopenmp -fomit-frame-pointer -fstrict-aliasing -ffast-math -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 9.2.1LLVM Clang 9.0.0GCC 10.0.0 201912085K10K15K20K25KSE +/- 309.06, N = 4SE +/- 404.65, N = 3SE +/- 239.62, N = 32418423559202871. (CC) gcc options: -pthread -O3 -march=native -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 9.2.1LLVM Clang 9.0.0GCC 10.0.0 201912084K8K12K16K20KMin: 23323 / Avg: 24183.5 / Max: 24767Min: 22817 / Avg: 23558.67 / Max: 24210Min: 19909 / Avg: 20286.67 / Max: 207311. (CC) gcc options: -pthread -O3 -march=native -lm

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0246810SE +/- 0.007, N = 3SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.001, N = 66.7016.7297.7887.951-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-lncurses1. (CC) gcc options: -O3 -pipe -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.03691215Min: 6.69 / Avg: 6.7 / Max: 6.72Min: 6.71 / Avg: 6.73 / Max: 6.75Min: 7.78 / Avg: 7.79 / Max: 7.8Min: 7.95 / Avg: 7.95 / Max: 7.961. (CC) gcc options: -O3 -pipe -march=native -lm

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.120406080100SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 366.7867.1176.2677.501. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.11530456075Min: 66.72 / Avg: 66.78 / Max: 66.85Min: 67.1 / Avg: 67.11 / Max: 67.12Min: 76.19 / Avg: 76.26 / Max: 76.35Min: 77.44 / Avg: 77.5 / Max: 77.541. (CXX) g++ options: -O3 -march=native -std=c++11

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0LLVM Clang 10.0 20191222714212835SE +/- 0.04, N = 3SE +/- 0.30, N = 3SE +/- 0.01, N = 3SE +/- 0.45, N = 1527.3928.6231.5931.751. (CC) gcc options: -lm -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0LLVM Clang 10.0 20191222714212835Min: 27.34 / Avg: 27.39 / Max: 27.46Min: 28.02 / Avg: 28.62 / Max: 28.93Min: 31.57 / Avg: 31.58 / Max: 31.6Min: 30.58 / Avg: 31.74 / Max: 37.931. (CC) gcc options: -lm -O3 -march=native

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208300K600K900K1200K1500KSE +/- 1690.40, N = 5SE +/- 2711.28, N = 5SE +/- 9469.71, N = 5SE +/- 1231.70, N = 515282481487001137308913459911. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208300K600K900K1200K1500KMin: 1521486 / Avg: 1528247.6 / Max: 1529938Min: 1476616 / Avg: 1487001.2 / Max: 1492623Min: 1349946 / Avg: 1373089 / Max: 1390853Min: 1343360 / Avg: 1345991.2 / Max: 13499461. (CC) gcc options: -O3 -march=native

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 323.9423.8923.7421.171. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222612182430Min: 23.9 / Avg: 23.94 / Max: 23.97Min: 23.86 / Avg: 23.89 / Max: 23.93Min: 23.72 / Avg: 23.74 / Max: 23.78Min: 21.07 / Avg: 21.17 / Max: 21.231. (CXX) g++ options: -O3 -march=native -lpthread

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1246810SE +/- 0.003, N = 5SE +/- 0.005, N = 5SE +/- 0.006, N = 5SE +/- 0.007, N = 57.1457.1928.0428.073-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.13691215Min: 7.14 / Avg: 7.15 / Max: 7.15Min: 7.18 / Avg: 7.19 / Max: 7.21Min: 8.02 / Avg: 8.04 / Max: 8.06Min: 8.05 / Avg: 8.07 / Max: 8.091. (CXX) g++ options: -O3 -march=native -logg -lm

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912221224364860SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 352.4751.8451.7046.691. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912221122334455Min: 52.25 / Avg: 52.47 / Max: 52.61Min: 51.62 / Avg: 51.84 / Max: 52Min: 51.56 / Avg: 51.7 / Max: 51.84Min: 46.64 / Avg: 46.69 / Max: 46.751. (CXX) g++ options: -O3 -march=native -lpthread

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 2019122220406080100SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 378.5777.9277.1170.281. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 201912221530456075Min: 78.14 / Avg: 78.57 / Max: 78.8Min: 77.82 / Avg: 77.92 / Max: 78.09Min: 77.06 / Avg: 77.11 / Max: 77.19Min: 70.09 / Avg: 70.28 / Max: 70.411. (CXX) g++ options: -O3 -march=native -lpthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.13691215SE +/- 0.01343, N = 3SE +/- 0.01435, N = 3SE +/- 0.03984, N = 3SE +/- 0.02093, N = 38.700248.726259.449609.70952-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.13691215Min: 8.68 / Avg: 8.7 / Max: 8.72Min: 8.7 / Avg: 8.73 / Max: 8.75Min: 9.38 / Avg: 9.45 / Max: 9.52Min: 9.67 / Avg: 9.71 / Max: 9.731. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

MKL-DNN DNNL

This is a test of the Intel MKL-DNN (DNNL / Deep Neural Network Library) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 201912080.41880.83761.25641.67522.094SE +/- 0.00428, N = 3SE +/- 0.00269, N = 3SE +/- 0.00636, N = 31.673151.737101.86131-fopenmp=libomp - MIN: 1.62-fopenmp - MIN: 1.67-fopenmp - MIN: 1.811. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208246810Min: 1.67 / Avg: 1.67 / Max: 1.68Min: 1.73 / Avg: 1.74 / Max: 1.74Min: 1.85 / Avg: 1.86 / Max: 1.871. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208816243240SE +/- 0.01, N = 3SE +/- 0.30, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 329.0929.8032.2232.311. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208714212835Min: 29.06 / Avg: 29.09 / Max: 29.1Min: 29.2 / Avg: 29.8 / Max: 30.1Min: 32.22 / Avg: 32.22 / Max: 32.22Min: 32.26 / Avg: 32.31 / Max: 32.391. (CXX) g++ options: -O3 -march=native -std=c++11

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.10.91781.83562.75343.67124.589SE +/- 0.00668, N = 3SE +/- 0.00780, N = 3SE +/- 0.04581, N = 3SE +/- 0.03613, N = 33.681863.724883.966974.07902-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.1246810Min: 3.67 / Avg: 3.68 / Max: 3.69Min: 3.71 / Avg: 3.72 / Max: 3.74Min: 3.88 / Avg: 3.97 / Max: 4.02Min: 4.04 / Avg: 4.08 / Max: 4.151. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

libgav1

Libgav1 is an AV1 decoder developed by Google for AV1 profile 0/1 compliance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208612182430SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 324.1923.1122.0121.931. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208612182430Min: 24.08 / Avg: 24.19 / Max: 24.36Min: 22.98 / Avg: 23.11 / Max: 23.19Min: 21.96 / Avg: 22.01 / Max: 22.09Min: 21.86 / Avg: 21.93 / Max: 22.011. (CXX) g++ options: -O3 -march=native -lpthread

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 325.8626.0328.3028.521. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1612182430Min: 25.85 / Avg: 25.86 / Max: 25.89Min: 26.02 / Avg: 26.03 / Max: 26.05Min: 28.28 / Avg: 28.3 / Max: 28.33Min: 28.45 / Avg: 28.52 / Max: 28.61. (CXX) g++ options: -O3 -march=native -std=c++11

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingGCC 9.2.1GCC 10.0.0 201912089001800270036004500SE +/- 0.00, N = 3SE +/- 46.05, N = 34096.253716.331. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingGCC 9.2.1GCC 10.0.0 201912087001400210028003500Min: 4096.25 / Avg: 4096.25 / Max: 4096.25Min: 3647.34 / Avg: 3716.33 / Max: 3803.661. (CXX) g++ options: -lpthread

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillGCC 9.2.1GCC 10.0.0 20191208200K400K600K800K1000KSE +/- 4040.85, N = 3SE +/- 13468.75, N = 310233359308971. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillGCC 9.2.1GCC 10.0.0 20191208200K400K600K800K1000KMin: 1017053 / Avg: 1023334.67 / Max: 1030879Min: 916670 / Avg: 930897 / Max: 9578201. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 201912221632486480SE +/- 0.36, N = 3SE +/- 0.32, N = 3SE +/- 0.27, N = 3SE +/- 0.32, N = 364.8569.7970.3370.71-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 201912221428425670Min: 64.14 / Avg: 64.85 / Max: 65.26Min: 69.18 / Avg: 69.79 / Max: 70.25Min: 69.8 / Avg: 70.33 / Max: 70.71Min: 70.09 / Avg: 70.71 / Max: 71.151. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.111002200330044005500SE +/- 84.96, N = 3SE +/- 51.56, N = 7SE +/- 77.03, N = 3SE +/- 61.37, N = 34983.284898.174866.904583.351. (CC) gcc options: -O3 -march=native -mavx2
OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.19001800270036004500Min: 4824.83 / Avg: 4983.28 / Max: 5115.68Min: 4716.42 / Avg: 4898.17 / Max: 5043.49Min: 4757.83 / Avg: 4866.9 / Max: 5015.67Min: 4521.83 / Avg: 4583.35 / Max: 4706.081. (CC) gcc options: -O3 -march=native -mavx2

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02K4K6K8K10KSE +/- 76.26, N = 3SE +/- 26.44, N = 3SE +/- 24.98, N = 38135.28111.67531.21. (CC) gcc options: -pthread -O3 -march=native -lm
OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.014002800420056007000Min: 7982.9 / Avg: 8135.17 / Max: 8218.9Min: 8062.6 / Avg: 8111.63 / Max: 8153.3Min: 7482 / Avg: 7531.2 / Max: 7563.31. (CC) gcc options: -pthread -O3 -march=native -lm

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-09-16AV1 Video EncodingLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.10.03380.06760.10140.13520.169SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.150.150.150.141. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-09-16AV1 Video EncodingLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 10.0.0 20191208GCC 9.2.112345Min: 0.15 / Avg: 0.15 / Max: 0.15Min: 0.15 / Avg: 0.15 / Max: 0.15Min: 0.15 / Avg: 0.15 / Max: 0.15Min: 0.14 / Avg: 0.14 / Max: 0.141. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 313.8414.7014.7614.821. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.148121620Min: 13.83 / Avg: 13.84 / Max: 13.86Min: 14.69 / Avg: 14.7 / Max: 14.71Min: 14.76 / Avg: 14.76 / Max: 14.77Min: 14.52 / Avg: 14.82 / Max: 14.981. (CXX) g++ options: -O3 -march=native -std=c++11

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 201912221326395265SE +/- 0.46, N = 3SE +/- 0.10, N = 3SE +/- 0.44, N = 3SE +/- 0.13, N = 356.1757.1258.3359.911. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.0LLVM Clang 10.0 201912221224364860Min: 55.26 / Avg: 56.17 / Max: 56.66Min: 56.93 / Avg: 57.12 / Max: 57.25Min: 57.46 / Avg: 58.33 / Max: 58.87Min: 59.72 / Avg: 59.91 / Max: 60.151. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080pLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1130260390520650SE +/- 1.62, N = 3SE +/- 2.49, N = 3SE +/- 4.43, N = 3SE +/- 2.39, N = 3625.11623.05612.94587.24MIN: 468.04 / MAX: 781MIN: 475.37 / MAX: 782.95MIN: 452.17 / MAX: 769.81MIN: 439.81 / MAX: 722.491. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080pLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1110220330440550Min: 622.14 / Avg: 625.11 / Max: 627.71Min: 620.3 / Avg: 623.05 / Max: 628.03Min: 605.45 / Avg: 612.94 / Max: 620.79Min: 582.96 / Avg: 587.24 / Max: 591.231. (CC) gcc options: -O3 -march=native -pthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticGCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.0510152025SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 318.2219.1619.2019.35-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticGCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.0510152025Min: 18.17 / Avg: 18.22 / Max: 18.26Min: 19.11 / Avg: 19.16 / Max: 19.24Min: 19.16 / Avg: 19.2 / Max: 19.23Min: 19.31 / Avg: 19.35 / Max: 19.381. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.1vpxenc VP9 1080p Video EncodeLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 2019120850100150200250SE +/- 1.20, N = 3SE +/- 1.73, N = 15SE +/- 1.49, N = 3SE +/- 1.29, N = 3207.78207.68197.37195.631. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.1vpxenc VP9 1080p Video EncodeLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 201912084080120160200Min: 206.3 / Avg: 207.78 / Max: 210.15Min: 192.3 / Avg: 207.68 / Max: 216.43Min: 194.59 / Avg: 197.37 / Max: 199.67Min: 194.07 / Avg: 195.63 / Max: 198.191. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.120M40M60M80M100MSE +/- 194228.81, N = 3SE +/- 1221575.84, N = 3SE +/- 74252.03, N = 38133605480909082767374621. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.114M28M42M56M70MMin: 80958759 / Avg: 81336053.67 / Max: 81604770Min: 78627926 / Avg: 80909082 / Max: 82807280Min: 76593541 / Avg: 76737462.33 / Max: 768411261. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential FillGCC 9.2.1GCC 10.0.0 20191208200K400K600K800K1000KSE +/- 10427.62, N = 3SE +/- 4276.33, N = 3108364910247331. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential FillGCC 9.2.1GCC 10.0.0 20191208200K400K600K800K1000KMin: 1065816 / Avg: 1083649 / Max: 1101930Min: 1019461 / Avg: 1024732.67 / Max: 10332011. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: RandomGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.020K40K60K80K100KSE +/- 354.64, N = 3SE +/- 275.83, N = 3SE +/- 553.66, N = 3105996.0105805.0100236.81. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: RandomGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.020K40K60K80K100KMin: 105360 / Avg: 105995.67 / Max: 106586Min: 105529 / Avg: 105805.33 / Max: 106357Min: 99175.5 / Avg: 100236.83 / Max: 1010411. (CXX) g++ options: -O3 -march=native -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.03691215SE +/- 0.052, N = 3SE +/- 0.029, N = 3SE +/- 0.133, N = 39.1519.1859.633-O2 -lOpenCL-O2 -lOpenCL-O3 -fopenmp1. (CXX) g++ options:
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverGCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.03691215Min: 9.05 / Avg: 9.15 / Max: 9.22Min: 9.13 / Avg: 9.18 / Max: 9.23Min: 9.4 / Avg: 9.63 / Max: 9.861. (CXX) g++ options:

MKL-DNN DNNL

This is a test of the Intel MKL-DNN (DNNL / Deep Neural Network Library) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1306090120150SE +/- 0.15, N = 3SE +/- 1.45, N = 3SE +/- 1.55, N = 3118.99124.28125.01-fopenmp=libomp - MIN: 118.28-fopenmp - MIN: 122.25-fopenmp - MIN: 122.481. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.120406080100Min: 118.68 / Avg: 118.99 / Max: 119.16Min: 122.81 / Avg: 124.28 / Max: 127.19Min: 123.18 / Avg: 125.01 / Max: 128.11. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingGCC 9.2.1GCC 10.0.0 2019120812002400360048006000SE +/- 37.73, N = 3SE +/- 34.80, N = 35509.305255.511. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingGCC 9.2.1GCC 10.0.0 2019120810002000300040005000Min: 5433.8 / Avg: 5509.27 / Max: 5547Min: 5220.71 / Avg: 5255.51 / Max: 5325.121. (CXX) g++ options: -lpthread

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.160120180240300SE +/- 2.46, N = 3SE +/- 0.41, N = 3SE +/- 0.29, N = 3SE +/- 0.45, N = 3258.40260.64262.97269.481. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.150100150200250Min: 253.49 / Avg: 258.4 / Max: 261.16Min: 259.84 / Avg: 260.64 / Max: 261.23Min: 262.62 / Avg: 262.97 / Max: 263.56Min: 268.6 / Avg: 269.48 / Max: 270.121. (CXX) g++ options: -O3 -march=native -std=c++11

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video EncodingLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.11530456075SE +/- 0.39, N = 3SE +/- 0.07, N = 3SE +/- 0.29, N = 3SE +/- 0.10, N = 367.1166.2765.7464.691. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video EncodingLLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.11326395265Min: 66.37 / Avg: 67.11 / Max: 67.69Min: 66.13 / Avg: 66.27 / Max: 66.38Min: 65.2 / Avg: 65.74 / Max: 66.21Min: 64.52 / Avg: 64.69 / Max: 64.871. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 201912082K4K6K8K10KSE +/- 3.20, N = 3SE +/- 9.20, N = 3SE +/- 4.74, N = 38037.337777.507767.981. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 2019120814002800420056007000Min: 8033.18 / Avg: 8037.33 / Max: 8043.63Min: 7765.66 / Avg: 7777.5 / Max: 7795.61Min: 7761.61 / Avg: 7767.98 / Max: 7777.241. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyLLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1140K280K420K560K700KSE +/- 2145.94, N = 3SE +/- 876.37, N = 3SE +/- 1018.99, N = 3SE +/- 1703.60, N = 3675821.10673644.96668836.61654500.681. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyLLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1120K240K360K480K600KMin: 672374.47 / Avg: 675821.1 / Max: 679759.31Min: 672360.52 / Avg: 673644.96 / Max: 675320.01Min: 666856.43 / Avg: 668836.61 / Max: 670244.03Min: 651323.06 / Avg: 654500.68 / Max: 657154.31. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912223691215SE +/- 0.100, N = 3SE +/- 0.125, N = 3SE +/- 0.089, N = 3SE +/- 0.033, N = 39.81710.04010.05210.1071. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912223691215Min: 9.68 / Avg: 9.82 / Max: 10.01Min: 9.8 / Avg: 10.04 / Max: 10.21Min: 9.92 / Avg: 10.05 / Max: 10.22Min: 10.06 / Avg: 10.11 / Max: 10.171. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080pLLVM Clang 10.0 20191222GCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.1150300450600750SE +/- 2.78, N = 3SE +/- 0.42, N = 3SE +/- 1.73, N = 3SE +/- 1.97, N = 3685.40676.37674.39667.33MIN: 400.43 / MAX: 750.08MIN: 396.97 / MAX: 738.73MIN: 376.28 / MAX: 738.32MIN: 387.89 / MAX: 728.981. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080pLLVM Clang 10.0 20191222GCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.1120240360480600Min: 681.79 / Avg: 685.4 / Max: 690.87Min: 675.83 / Avg: 676.37 / Max: 677.2Min: 671.34 / Avg: 674.39 / Max: 677.33Min: 663.62 / Avg: 667.33 / Max: 670.361. (CC) gcc options: -O3 -march=native -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 4KLLVM Clang 10.0 20191222GCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.160120180240300SE +/- 0.34, N = 3SE +/- 0.64, N = 3SE +/- 0.37, N = 3SE +/- 1.13, N = 3291.76289.29288.60285.28MIN: 174.15 / MAX: 309.6MIN: 172.47 / MAX: 306.73MIN: 170.73 / MAX: 306.52MIN: 168.88 / MAX: 304.291. (CC) gcc options: -O3 -march=native -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 4KLLVM Clang 10.0 20191222GCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.150100150200250Min: 291.11 / Avg: 291.76 / Max: 292.29Min: 288.01 / Avg: 289.29 / Max: 290.05Min: 287.89 / Avg: 288.6 / Max: 289.15Min: 283.2 / Avg: 285.28 / Max: 287.091. (CC) gcc options: -O3 -march=native -pthread

MKL-DNN DNNL

This is a test of the Intel MKL-DNN (DNNL / Deep Neural Network Library) as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 201912081224364860SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 0.24, N = 351.3352.2852.47-fopenmp=libomp - MIN: 50.48-fopenmp - MIN: 51.18-fopenmp - MIN: 51.431. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32LLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 201912081122334455Min: 51.13 / Avg: 51.33 / Max: 51.49Min: 51.93 / Avg: 52.28 / Max: 52.53Min: 52.11 / Avg: 52.47 / Max: 52.931. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadGCC 10.0.0 20191208GCC 9.2.130M60M90M120M150MSE +/- 467553.12, N = 3SE +/- 234281.58, N = 31453689671422877661. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadGCC 10.0.0 20191208GCC 9.2.130M60M90M120M150MMin: 144762532 / Avg: 145368967 / Max: 146288622Min: 141996237 / Avg: 142287765.67 / Max: 1427512121. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2GCC 9.2.1GCC 10.0.0 2019120810M20M30M40M50MSE +/- 70447.90, N = 3SE +/- 638551.10, N = 448969411.248002215.11. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2GCC 9.2.1GCC 10.0.0 201912088M16M24M32M40MMin: 48828697.3 / Avg: 48969411.23 / Max: 49045965.9Min: 46087504.6 / Avg: 48002215.1 / Max: 486868441. (CC) gcc options: -O3 -march=native

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9LLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 319.6919.6919.7520.041. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9LLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208510152025Min: 19.66 / Avg: 19.69 / Max: 19.73Min: 19.66 / Avg: 19.69 / Max: 19.71Min: 19.74 / Avg: 19.75 / Max: 19.77Min: 19.88 / Avg: 20.04 / Max: 20.361. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native

NGINX Benchmark

This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 201912089K18K27K36K45KSE +/- 326.86, N = 3SE +/- 238.62, N = 3SE +/- 457.53, N = 3SE +/- 490.20, N = 343783.3043580.6043426.7243035.461. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingLLVM Clang 9.0.0LLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 201912088K16K24K32K40KMin: 43264.85 / Avg: 43783.3 / Max: 44387.38Min: 43221.77 / Avg: 43580.6 / Max: 44032.51Min: 42537.29 / Avg: 43426.72 / Max: 44057.71Min: 42230.49 / Avg: 43035.46 / Max: 43922.621. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.1LLVM Clang 10.0 201912224080120160200SE +/- 2.04, N = 8SE +/- 1.58, N = 12SE +/- 0.53, N = 3SE +/- 0.76, N = 3199.22198.29197.93196.10-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0 20191208LLVM Clang 9.0.0GCC 9.2.1LLVM Clang 10.0 201912224080120160200Min: 185.55 / Avg: 199.22 / Max: 203.41Min: 181.53 / Avg: 198.29 / Max: 202.56Min: 197.01 / Avg: 197.93 / Max: 198.84Min: 194.77 / Avg: 196.1 / Max: 197.391. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 201912081326395265SE +/- 0.20, N = 3SE +/- 0.26, N = 3SE +/- 0.25, N = 3SE +/- 0.27, N = 357.4757.8458.0058.281. (CXX) g++ options: -O3 -march=native -std=c++11
OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0GCC 10.0.0 201912081224364860Min: 57.2 / Avg: 57.47 / Max: 57.86Min: 57.57 / Avg: 57.84 / Max: 58.36Min: 57.71 / Avg: 57.99 / Max: 58.48Min: 57.8 / Avg: 58.28 / Max: 58.731. (CXX) g++ options: -O3 -march=native -std=c++11

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeGCC 10.0.0 20191208GCC 9.2.10.99651.9932.98953.9864.9825SE +/- 0.004, N = 3SE +/- 0.007, N = 34.3784.4291. (CC) gcc options: -static -fopenmp -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeGCC 10.0.0 20191208GCC 9.2.1246810Min: 4.37 / Avg: 4.38 / Max: 4.39Min: 4.42 / Avg: 4.43 / Max: 4.441. (CC) gcc options: -static -fopenmp -O3 -march=native

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingGCC 9.2.1GCC 10.0.0 201912087001400210028003500SE +/- 2.38, N = 3SE +/- 13.81, N = 33376.273339.331. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingGCC 9.2.1GCC 10.0.0 201912086001200180024003000Min: 3373.89 / Avg: 3376.27 / Max: 3381.03Min: 3323.01 / Avg: 3339.33 / Max: 3366.781. (CXX) g++ options: -lpthread

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 10.0.0 20191208GCC 9.2.12M4M6M8M10MSE +/- 15301.93, N = 3SE +/- 11113.29, N = 3902783589599371. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 10.0.0 20191208GCC 9.2.11.6M3.2M4.8M6.4M8MMin: 9005307 / Avg: 9027834.67 / Max: 9057038Min: 8941640 / Avg: 8959937.33 / Max: 89800141. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingGCC 9.2.1GCC 10.0.0 20191208400800120016002000SE +/- 1.51, N = 3SE +/- 10.21, N = 31955.971943.581. (CXX) g++ options: -lpthread
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingGCC 9.2.1GCC 10.0.0 2019120830060090012001500Min: 1954.17 / Avg: 1955.97 / Max: 1958.97Min: 1928.81 / Avg: 1943.58 / Max: 1963.181. (CXX) g++ options: -lpthread

Apache Benchmark

This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingLLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.07K14K21K28K35KSE +/- 71.82, N = 3SE +/- 27.04, N = 3SE +/- 54.98, N = 3SE +/- 32.67, N = 334327.9034228.5034157.7634144.391. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingLLVM Clang 10.0 20191222GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 9.0.06K12K18K24K30KMin: 34236.76 / Avg: 34327.9 / Max: 34469.61Min: 34188.65 / Avg: 34228.5 / Max: 34280.09Min: 34079.71 / Avg: 34157.76 / Max: 34263.87Min: 34094.83 / Avg: 34144.39 / Max: 34206.041. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Fill SyncGCC 9.2.1GCC 10.0.0 201912085K10K15K20K25KSE +/- 29.29, N = 3SE +/- 40.43, N = 324368242771. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Fill SyncGCC 9.2.1GCC 10.0.0 201912084K8K12K16K20KMin: 24320 / Avg: 24367.67 / Max: 24421Min: 24208 / Avg: 24277 / Max: 243481. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1LLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.148121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 314.2014.2414.2514.251. (CC) gcc options: -O3 -march=native -lz -lm -ldl -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1LLVM Clang 9.0.0GCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.148121620Min: 14.16 / Avg: 14.2 / Max: 14.22Min: 14.18 / Avg: 14.24 / Max: 14.27Min: 14.24 / Avg: 14.25 / Max: 14.27Min: 14.18 / Avg: 14.25 / Max: 14.291. (CC) gcc options: -O3 -march=native -lz -lm -ldl -lpthread

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilGCC 10.0.0 20191208GCC 9.2.1246810SE +/- 0.036635, N = 3SE +/- 0.021046, N = 37.5517067.5763961. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilGCC 10.0.0 20191208GCC 9.2.13691215Min: 7.51 / Avg: 7.55 / Max: 7.62Min: 7.55 / Avg: 7.58 / Max: 7.621. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDGCC 10.0.0 20191208GCC 9.2.13691215SE +/- 0.01, N = 3SE +/- 0.02, N = 310.0910.101. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDGCC 10.0.0 20191208GCC 9.2.13691215Min: 10.07 / Avg: 10.09 / Max: 10.1Min: 10.06 / Avg: 10.1 / Max: 10.131. (CXX) g++ options: -O2 -lOpenCL

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While WritingGCC 9.2.1GCC 10.0.0 201912081.1M2.2M3.3M4.4M5.5MSE +/- 73789.66, N = 4SE +/- 38218.40, N = 15495089049459161. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While WritingGCC 9.2.1GCC 10.0.0 20191208900K1800K2700K3600K4500KMin: 4836365 / Avg: 4950890 / Max: 5162932Min: 4826397 / Avg: 4945915.53 / Max: 53851751. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkGCC 9.2.1GCC 10.0.0 201912080.56611.13221.69832.26442.8305SE +/- 0.001, N = 3SE +/- 0.005, N = 32.5162.5151. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkGCC 9.2.1GCC 10.0.0 20191208246810Min: 2.51 / Avg: 2.52 / Max: 2.52Min: 2.51 / Avg: 2.52 / Max: 2.521. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPGCC 10.0.0 20191208GCC 9.2.10.2830.5660.8491.1321.415SE +/- 0.007993, N = 3SE +/- 0.002849, N = 31.2574101.2577161. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPGCC 10.0.0 20191208GCC 9.2.1246810Min: 1.25 / Avg: 1.26 / Max: 1.27Min: 1.25 / Avg: 1.26 / Max: 1.261. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912227K14K21K28K35KSE +/- 127.72, N = 3SE +/- 77.46, N = 3SE +/- 2238.03, N = 12SE +/- 1738.68, N = 1531491.3830910.9915513.4414868.681. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208LLVM Clang 10.0 201912225K10K15K20K25KMin: 31305.7 / Avg: 31491.38 / Max: 31736.15Min: 30759.56 / Avg: 30910.99 / Max: 31014.97Min: 11190.63 / Avg: 15513.44 / Max: 31475.92Min: 11139.59 / Avg: 14868.68 / Max: 31166.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 9.2.1GCC 10.0.0 201912081.07442.14883.22324.29765.372SE +/- 0.006, N = 3SE +/- 0.107, N = 153.4924.7751. (CXX) g++ options: -fopenmp -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 9.2.1GCC 10.0.0 20191208246810Min: 3.49 / Avg: 3.49 / Max: 3.5Min: 4.41 / Avg: 4.78 / Max: 5.291. (CXX) g++ options: -fopenmp -O3 -march=native

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1510152025SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.54, N = 1518.8219.0719.76-O3 -fopenmp-O2 -lOpenCL-O2 -lOpenCL1. (CXX) g++ options:
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterLLVM Clang 9.0.0GCC 10.0.0 20191208GCC 9.2.1510152025Min: 18.74 / Avg: 18.82 / Max: 18.94Min: 19.04 / Avg: 19.07 / Max: 19.09Min: 18.97 / Avg: 19.76 / Max: 27.021. (CXX) g++ options:

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: BLASLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208816243240SE +/- 0.60, N = 15SE +/- 0.56, N = 12SE +/- 0.41, N = 333.3032.7629.331. (CXX) g++ options: -O3 -march=native -lpthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: BLASLLVM Clang 9.0.0GCC 9.2.1GCC 10.0.0 20191208714212835Min: 30.64 / Avg: 33.3 / Max: 39.06Min: 29.53 / Avg: 32.76 / Max: 35.2Min: 28.6 / Avg: 29.33 / Max: 30.011. (CXX) g++ options: -O3 -march=native -lpthread