Threadripper EOY2019 Clang vs. GCC

AMD Ryzen Threadripper 3960X GCC vs. LLVM Clang compiler benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1912235-PTS-THREADRI74&sro&grs.

Threadripper EOY2019 Clang vs. GCCProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.0AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS)AMD Starship/Matisse32768MB1000GB Sabrent Rocket 4.0 1TBGigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz)AMD Baffin HDMI/DPASUS VP28UAquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723Ubuntu 19.105.4.0-nvme-hwmon (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.1 (LLVM 9.0.0)GCC 10.0.0 20191208ext43840x2160Clang 10.0.0GCC 9.2.1 20191008Clang 9.0.0OpenBenchmarking.orgEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 10.0.0 20191208: --disable-multilib --enable-checking=release- LLVM Clang 10.0 20191222: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2- GCC 9.2.1: --disable-multilib --enable-checking=releaseDisk Details- NONE / errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025Python Details- Python 2.7.17rc1 + Python 3.7.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected

Threadripper EOY2019 Clang vs. GCCjohn-the-ripper: Blowfishttsiod-renderer: Phong Rendering With Soft-Shadow Mappingmt-dgemm: Sustained Floating-Point Ratebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelbuild-imagemagick: Time To Compileopenssl: RSA 4096-bit Performancedav1d: Chimera 1080p 10-bitmkl-dnn: Recurrent Neural Network Training - f32parboil: OpenMP MRI Griddingcpp-perf-bench: Rand Numberstungsten: Non-Exponentialqmcpack: fftw: Float + SSE - 2D FFT Size 4096encode-mp3: WAV To MP3cpp-perf-bench: Stepanov Vectoraobench: 2048 x 2048 - Total Timetscp: AI Chess Performancelibgav1: Summer Nature 4Kencode-flac: WAV To FLAClibgav1: Chimera 1080plibgav1: Summer Nature 1080ptungsten: Hairmkl-dnn: IP Batch 1D - f32cpp-perf-bench: Ctypetungsten: Volumetric Causticlibgav1: Chimera 1080p 10-bitcpp-perf-bench: Stepanov Abstractionaskap: tConvolve OpenMP - Degriddingrocksdb: Rand Fillmrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solverfftw: Stock - 2D FFT Size 4096aom-av1: AV1 Video Encodingcpp-perf-bench: Function Objectssqlite-speedtest: Timed Time - Size 1,000dav1d: Chimera 1080ptungsten: Water Causticvpxenc: vpxenc VP9 1080p Video Encodestockfish: Total Timerocksdb: Seq Filllczero: Randrodinia: OpenMP CFD Solvermkl-dnn: Convolution Batch conv_alexnet - f32askap: tConvolve OpenMP - Griddingcpp-perf-bench: Math Libraryx265: H.265 1080p Video Encodingminife: Smallpgbench: Buffer Test - Normal Load - Read Onlycompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19dav1d: Summer Nature 1080pdav1d: Summer Nature 4Kmkl-dnn: Convolution Batch conv_googlenet_v3 - f32rocksdb: Rand Readbyte: Dhrystone 2compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9nginx: Static Web Page Servingx264: H.264 Video Encodingcpp-perf-bench: Atoln-queens: Elapsed Timeaskap: tConvolve MT - Degriddingcrafty: Elapsed Timeaskap: tConvolve MT - Griddingapache: Static Web Page Servingrocksdb: Rand Fill Syncsqlite: 1parboil: OpenMP Stencilrodinia: OpenMP LavaMDrocksdb: Read While Writinggromacs: Water Benchmarkparboil: OpenMP CUTCPpgbench: Buffer Test - Normal Load - Read Writesmallpt: Global Illumination Renderer; 128 Samplesrodinia: OpenMP Streamclusterlczero: BLASGCC 10.0.0 20191208LLVM Clang 10.0 20191222GCC 9.2.1LLVM Clang 9.0.041784961.9748.86318350.24219.13415.0647173.794.47194.07762.9416571061.4803.369271878.1202876.70176.26327.389134599123.898.04251.7077.929.449601.8613132.3113.9669721.9328.2953716.3393089769.7864898.1708218135.20.1513.83857.116612.9418.2157195.638090908210247331058059.185124.2825255.51262.96865.747767.98673644.95828510.052676.37289.2952.469114536896748002215.120.04243035.46199.2258.2784.3783339.3390278351943.5834157.762427714.2357.55170610.08549459162.5151.25741015513.4404234.77519.06929.3289193278.17830.83920.9055189.174.451282.0334.051767.78866.78231.745152824821.177.14546.6970.288.7262529.7953.7248823.1125.86470.7064866.9034270.1514.76459.906625.1119.1580207.68258.40467.11668836.60545310.107685.40291.7619.68743580.60196.1057.84334327.9014.24814868.67884941482935.1238.38597244.93919.15513.5057178.899.72194.76649.0009651069.4833.374191893.6241846.72977.50128.620137308923.748.07351.8477.119.709521.7371032.2204.0790222.0128.5234096.25102333570.3304583.3462128111.60.1414.81656.169587.2419.1975197.377673746210836491059969.151125.0085509.3269.48464.697777.50654500.67635010.040667.33285.2852.282014228776648969411.219.75343426.72197.9357.4734.4293376.2789599371955.9734228.502436814.2527.57639610.10149508902.5161.25771630910.9917853.49219.76132.75986058669.85220.79140364.76330.97213.5317191.673.36145.8501296.9844.070432262.8235597.95167.10831.585148700123.947.19252.4778.578.700241.6731529.0853.6818624.1926.03364.8544983.2817867531.20.1514.69858.332623.0519.3485207.7881336054100236.89.633118.989260.63866.278037.33675821.0958339.817674.39288.6051.328419.68543783.30198.2957.99534144.3914.20031491.37719018.81833.3033OpenBenchmarking.org

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.013K26K39K52K65KSE +/- 170.74, N = 3SE +/- 184.04, N = 3SE +/- 3.28, N = 3SE +/- 246.24, N = 34178441482193260586-fopenmp-fopenmp-fopenmp1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02004006008001000SE +/- 1.33, N = 3SE +/- 2.99, N = 3SE +/- 0.09, N = 3961.97935.1269.851. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0246810SE +/- 0.059593, N = 3SE +/- 0.037368, N = 3SE +/- 0.007958, N = 38.8631838.3859720.7914031. (CC) gcc options: -O3 -march=native -fopenmp

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.24, N = 350.2444.9478.1864.761. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0714212835SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 319.1319.1630.8430.971. (CC) gcc options: -lm -lpthread -O3 -march=native

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0510152025SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 315.0613.5120.9113.53

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.015003000450060007500SE +/- 20.75, N = 3SE +/- 21.05, N = 3SE +/- 21.60, N = 3SE +/- 29.40, N = 37173.77178.85189.17191.6-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080p 10-bitGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100SE +/- 0.13, N = 3SE +/- 0.24, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 394.4799.7274.4573.36MIN: 56.43 / MAX: 192.99MIN: 60.51 / MAX: 199.48MIN: 45.96 / MAX: 154.32MIN: 45.63 / MAX: 149.131. (CC) gcc options: -O3 -march=native -pthread

MKL-DNN DNNL

Harness: Recurrent Neural Network Training - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.04080120160200SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.38, N = 3194.08194.77145.85-fopenmp - MIN: 192.29-fopenmp - MIN: 192.97-fopenmp=libomp - MIN: 143.951. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingGCC 10.0.0 20191208GCC 9.2.11428425670SE +/- 0.24, N = 3SE +/- 0.15, N = 362.9449.001. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.030060090012001500SE +/- 10.04, N = 3SE +/- 2.69, N = 3SE +/- 12.43, N = 3SE +/- 0.16, N = 31061.481069.481282.031296.981. (CXX) g++ options: -O3 -march=native -std=c++11

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.00.91581.83162.74743.66324.579SE +/- 0.01371, N = 3SE +/- 0.01280, N = 3SE +/- 0.01369, N = 3SE +/- 0.00501, N = 33.369273.374194.051764.07043-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

QMCPACK

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.050010001500200025001878.11893.62262.8-finline-limit=1000 -funroll-all-loops-finline-limit=1000 -funroll-all-loops1. (CXX) g++ options: -O3 -march=native -fopenmp -fomit-frame-pointer -fstrict-aliasing -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.05K10K15K20K25KSE +/- 239.62, N = 3SE +/- 309.06, N = 4SE +/- 404.65, N = 32028724184235591. (CC) gcc options: -pthread -O3 -march=native -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0246810SE +/- 0.007, N = 3SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.001, N = 66.7016.7297.7887.951-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-lncurses1. (CC) gcc options: -O3 -pipe -march=native -lm

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 376.2677.5066.7867.111. (CXX) g++ options: -O3 -march=native -std=c++11

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0714212835SE +/- 0.04, N = 3SE +/- 0.30, N = 3SE +/- 0.45, N = 15SE +/- 0.01, N = 327.3928.6231.7531.591. (CC) gcc options: -lm -O3 -march=native

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0300K600K900K1200K1500KSE +/- 1231.70, N = 5SE +/- 9469.71, N = 5SE +/- 1690.40, N = 5SE +/- 2711.28, N = 513459911373089152824814870011. (CC) gcc options: -O3 -march=native

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 4KGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 323.8923.7421.1723.941. (CXX) g++ options: -O3 -march=native -lpthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0246810SE +/- 0.006, N = 5SE +/- 0.007, N = 5SE +/- 0.003, N = 5SE +/- 0.005, N = 58.0428.0737.1457.192-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

libgav1

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080pGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.01224364860SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 351.7051.8446.6952.471. (CXX) g++ options: -O3 -march=native -lpthread

libgav1

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Summer Nature 1080pGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.020406080100SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 377.9277.1170.2878.571. (CXX) g++ options: -O3 -march=native -lpthread

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.03691215SE +/- 0.03984, N = 3SE +/- 0.02093, N = 3SE +/- 0.01435, N = 3SE +/- 0.01343, N = 39.449609.709528.726258.70024-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

MKL-DNN DNNL

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: IP Batch 1D - Data Type: f32GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.00.41880.83761.25641.67522.094SE +/- 0.00636, N = 3SE +/- 0.00269, N = 3SE +/- 0.00428, N = 31.861311.737101.67315-fopenmp - MIN: 1.81-fopenmp - MIN: 1.67-fopenmp=libomp - MIN: 1.621. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0816243240SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.30, N = 3SE +/- 0.01, N = 332.3132.2229.8029.091. (CXX) g++ options: -O3 -march=native -std=c++11

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.00.91781.83562.75343.67124.589SE +/- 0.04581, N = 3SE +/- 0.03613, N = 3SE +/- 0.00780, N = 3SE +/- 0.00668, N = 33.966974.079023.724883.68186-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 2019-10-05Video Input: Chimera 1080p 10-bitGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 321.9322.0123.1124.191. (CXX) g++ options: -O3 -march=native -lpthread

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0714212835SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 328.3028.5225.8626.031. (CXX) g++ options: -O3 -march=native -std=c++11

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - DegriddingGCC 10.0.0 20191208GCC 9.2.19001800270036004500SE +/- 46.05, N = 3SE +/- 0.00, N = 33716.334096.251. (CXX) g++ options: -lpthread

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillGCC 10.0.0 20191208GCC 9.2.1200K400K600K800K1000KSE +/- 13468.75, N = 3SE +/- 4040.85, N = 393089710233351. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.01632486480SE +/- 0.32, N = 3SE +/- 0.27, N = 3SE +/- 0.32, N = 3SE +/- 0.36, N = 369.7970.3370.7164.85-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.011002200330044005500SE +/- 51.56, N = 7SE +/- 61.37, N = 3SE +/- 77.03, N = 3SE +/- 84.96, N = 34898.174583.354866.904983.281. (CC) gcc options: -O3 -march=native -mavx2

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02K4K6K8K10KSE +/- 76.26, N = 3SE +/- 26.44, N = 3SE +/- 24.98, N = 38135.28111.67531.21. (CC) gcc options: -pthread -O3 -march=native -lm

AOM AV1

AV1 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2019-09-16AV1 Video EncodingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.00.03380.06760.10140.13520.169SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.150.140.150.151. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.048121620SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.8414.8214.7614.701. (CXX) g++ options: -O3 -march=native -std=c++11

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.01326395265SE +/- 0.10, N = 3SE +/- 0.46, N = 3SE +/- 0.13, N = 3SE +/- 0.44, N = 357.1256.1759.9158.331. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Chimera 1080pGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0130260390520650SE +/- 4.43, N = 3SE +/- 2.39, N = 3SE +/- 1.62, N = 3SE +/- 2.49, N = 3612.94587.24625.11623.05MIN: 452.17 / MAX: 769.81MIN: 439.81 / MAX: 722.49MIN: 468.04 / MAX: 781MIN: 475.37 / MAX: 782.951. (CC) gcc options: -O3 -march=native -pthread

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0510152025SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 318.2219.2019.1619.35-fstrict-aliasing-fstrict-aliasing1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.1vpxenc VP9 1080p Video EncodeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.050100150200250SE +/- 1.29, N = 3SE +/- 1.49, N = 3SE +/- 1.73, N = 15SE +/- 1.20, N = 3195.63197.37207.68207.781. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=c++11

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.020M40M60M80M100MSE +/- 1221575.84, N = 3SE +/- 74252.03, N = 3SE +/- 194228.81, N = 38090908276737462813360541. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Facebook RocksDB

Test: Sequential Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential FillGCC 10.0.0 20191208GCC 9.2.1200K400K600K800K1000KSE +/- 4276.33, N = 3SE +/- 10427.62, N = 3102473310836491. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

LeelaChessZero

Backend: Random

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: RandomGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.020K40K60K80K100KSE +/- 275.83, N = 3SE +/- 354.64, N = 3SE +/- 553.66, N = 3105805.0105996.0100236.81. (CXX) g++ options: -O3 -march=native -lpthread

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.03691215SE +/- 0.029, N = 3SE +/- 0.052, N = 3SE +/- 0.133, N = 39.1859.1519.633-O2 -lOpenCL-O2 -lOpenCL-O3 -fopenmp1. (CXX) g++ options:

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0306090120150SE +/- 1.45, N = 3SE +/- 1.55, N = 3SE +/- 0.15, N = 3124.28125.01118.99-fopenmp - MIN: 122.25-fopenmp - MIN: 122.48-fopenmp=libomp - MIN: 118.281. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - GriddingGCC 10.0.0 20191208GCC 9.2.112002400360048006000SE +/- 34.80, N = 3SE +/- 37.73, N = 35255.515509.301. (CXX) g++ options: -lpthread

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.060120180240300SE +/- 0.29, N = 3SE +/- 0.45, N = 3SE +/- 2.46, N = 3SE +/- 0.41, N = 3262.97269.48258.40260.641. (CXX) g++ options: -O3 -march=native -std=c++11

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video EncodingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.01530456075SE +/- 0.29, N = 3SE +/- 0.10, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 365.7464.6967.1166.271. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.02K4K6K8K10KSE +/- 4.74, N = 3SE +/- 9.20, N = 3SE +/- 3.20, N = 37767.987777.508037.331. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0140K280K420K560K700KSE +/- 876.37, N = 3SE +/- 1703.60, N = 3SE +/- 1018.99, N = 3SE +/- 2145.94, N = 3673644.96654500.68668836.61675821.101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.03691215SE +/- 0.089, N = 3SE +/- 0.125, N = 3SE +/- 0.033, N = 3SE +/- 0.100, N = 310.05210.04010.1079.8171. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 1080pGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0150300450600750SE +/- 0.42, N = 3SE +/- 1.97, N = 3SE +/- 2.78, N = 3SE +/- 1.73, N = 3676.37667.33685.40674.39MIN: 396.97 / MAX: 738.73MIN: 387.89 / MAX: 728.98MIN: 400.43 / MAX: 750.08MIN: 376.28 / MAX: 738.321. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.5.0Video Input: Summer Nature 4KGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.060120180240300SE +/- 0.64, N = 3SE +/- 1.13, N = 3SE +/- 0.34, N = 3SE +/- 0.37, N = 3289.29285.28291.76288.60MIN: 172.47 / MAX: 306.73MIN: 168.88 / MAX: 304.29MIN: 174.15 / MAX: 309.6MIN: 170.73 / MAX: 306.521. (CC) gcc options: -O3 -march=native -pthread

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.01224364860SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 352.4752.2851.33-fopenmp - MIN: 51.43-fopenmp - MIN: 51.18-fopenmp=libomp - MIN: 50.481. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadGCC 10.0.0 20191208GCC 9.2.130M60M90M120M150MSE +/- 467553.12, N = 3SE +/- 234281.58, N = 31453689671422877661. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2GCC 10.0.0 20191208GCC 9.2.110M20M30M40M50MSE +/- 638551.10, N = 4SE +/- 70447.90, N = 348002215.148969411.21. (CC) gcc options: -O3 -march=native

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.0510152025SE +/- 0.16, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.0419.7519.6919.691. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.09K18K27K36K45KSE +/- 490.20, N = 3SE +/- 457.53, N = 3SE +/- 238.62, N = 3SE +/- 326.86, N = 343035.4643426.7243580.6043783.301. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.04080120160200SE +/- 2.04, N = 8SE +/- 0.53, N = 3SE +/- 0.76, N = 3SE +/- 1.58, N = 12199.22197.93196.10198.29-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.01326395265SE +/- 0.27, N = 3SE +/- 0.20, N = 3SE +/- 0.26, N = 3SE +/- 0.25, N = 358.2857.4757.8458.001. (CXX) g++ options: -O3 -march=native -std=c++11

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeGCC 10.0.0 20191208GCC 9.2.10.99651.9932.98953.9864.9825SE +/- 0.004, N = 3SE +/- 0.007, N = 34.3784.4291. (CC) gcc options: -static -fopenmp -O3 -march=native

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - DegriddingGCC 10.0.0 20191208GCC 9.2.17001400210028003500SE +/- 13.81, N = 3SE +/- 2.38, N = 33339.333376.271. (CXX) g++ options: -lpthread

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 10.0.0 20191208GCC 9.2.12M4M6M8M10MSE +/- 15301.93, N = 3SE +/- 11113.29, N = 3902783589599371. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - GriddingGCC 10.0.0 20191208GCC 9.2.1400800120016002000SE +/- 10.21, N = 3SE +/- 1.51, N = 31943.581955.971. (CXX) g++ options: -lpthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.07K14K21K28K35KSE +/- 54.98, N = 3SE +/- 27.04, N = 3SE +/- 71.82, N = 3SE +/- 32.67, N = 334157.7634228.5034327.9034144.391. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Facebook RocksDB

Test: Random Fill Sync

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Fill SyncGCC 10.0.0 20191208GCC 9.2.15K10K15K20K25KSE +/- 40.43, N = 3SE +/- 29.29, N = 324277243681. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1GCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.048121620SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 314.2414.2514.2514.201. (CC) gcc options: -O3 -march=native -lz -lm -ldl -lpthread

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilGCC 10.0.0 20191208GCC 9.2.1246810SE +/- 0.036635, N = 3SE +/- 0.021046, N = 37.5517067.5763961. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDGCC 10.0.0 20191208GCC 9.2.13691215SE +/- 0.01, N = 3SE +/- 0.02, N = 310.0910.101. (CXX) g++ options: -O2 -lOpenCL

Facebook RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While WritingGCC 10.0.0 20191208GCC 9.2.11.1M2.2M3.3M4.4M5.5MSE +/- 38218.40, N = 15SE +/- 73789.66, N = 4494591649508901. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkGCC 10.0.0 20191208GCC 9.2.10.56611.13221.69832.26442.8305SE +/- 0.005, N = 3SE +/- 0.001, N = 32.5152.5161. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPGCC 10.0.0 20191208GCC 9.2.10.2830.5660.8491.1321.415SE +/- 0.007993, N = 3SE +/- 0.002849, N = 31.2574101.2577161. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 10.0 20191222LLVM Clang 9.0.07K14K21K28K35KSE +/- 2238.03, N = 12SE +/- 77.46, N = 3SE +/- 1738.68, N = 15SE +/- 127.72, N = 315513.4430910.9914868.6831491.381. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.0.0 20191208GCC 9.2.11.07442.14883.22324.29765.372SE +/- 0.107, N = 15SE +/- 0.006, N = 34.7753.4921. (CXX) g++ options: -fopenmp -O3 -march=native

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0510152025SE +/- 0.02, N = 3SE +/- 0.54, N = 15SE +/- 0.06, N = 319.0719.7618.82-O2 -lOpenCL-O2 -lOpenCL-O3 -fopenmp1. (CXX) g++ options:

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.22.0Backend: BLASGCC 10.0.0 20191208GCC 9.2.1LLVM Clang 9.0.0816243240SE +/- 0.41, N = 3SE +/- 0.56, N = 12SE +/- 0.60, N = 1529.3332.7633.301. (CXX) g++ options: -O3 -march=native -lpthread


Phoronix Test Suite v10.8.5