EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104129-IB-EPYC7763L05&sro&gru.

EPYC 7763 LLVM Clang Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 12.0Clang 11.0Clang 12.0 LTOAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEED2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.8Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73ext41024x768Clang 11.0.0-2~ubuntu20.04.1Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 11.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12.0 LTO: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7763 LLVM Clang Compiler Testsdav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080psvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kx265: Bosphorus 1080psimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondsecuremark: SecureMark-TLScompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedquantlib: fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptjpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8libraw: Post-Processing Benchmarketcpak: DXT1etcpak: ETC1etcpak: ETC2tscp: AI Chess Performanceliquid-dsp: 1 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57pgbench: 100 - 1 - Read Onlypgbench: 100 - 1 - Read Writepgbench: 100 - 100 - Read Onlypgbench: 100 - 250 - Read Onlypgbench: 100 - 100 - Read Writepgbench: 100 - 250 - Read Writewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressiontoybrot: TBBtoybrot: OpenMPtoybrot: C++ Taskstoybrot: C++ Threadsonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUfinancebench: Repo OpenMPfinancebench: Bonds OpenMPpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 250 - Read Write - Average Latencymrbayes: Primate Phylogeny Analysisavifenc: 0avifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodegcrypt: ngspice: C2670ngspice: C7552tachyon: Total Timewebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionastcenc: Mediumastcenc: Thoroughastcenc: ExhaustiveClang 12.0Clang 11.0Clang 12.0 LTO1198.22541.561244.11308.320.214.8717.228.9933.3938.110.537.1026.8522.1388.78103.170.18311.474118.06741.09345.30643.58487.43488.23372.4930.3274.002.750.844.604.6247135743460487881969.162648.665.751.973.0333498112990444561993712614107621364576051785466.28396926520452.0713911.548.5013926.52653.81333310805104679862.09088.37789.96744.1156495035051254454283623931935227973190.62675.13363.854280.228848.401785.5082.64484.2294659.3384682.455315.409321.190380.054351.284132.820133.048850.496843.40474.2712.150.8266.6666.3828.1341.782718.525284.642202.0851570966556630001564833333307063333336437666672431032811069022107120962319566841.3312.19919.0166.30938.44967807507743772201.077013.285071.075070.7101241.221321.444252.367972.036060.4919400.7797761302.70593.9721307.49590.1820.3136891305.10597.4811.1725833246.83723951596.8671870.0410.3050.0940.2341.6074.43189.11647.88425.1759.5103.36125.2205.74615.8709.2967.8548.2567.567236.924118.87095.95616.04682.739109.525207.0086.690374.0354.00586.764718.99361190.41543.431251.25184.190.214.9517.139.1433.1437.280.537.2026.6122.0086.09100.550.18111.821117.39241.01346.89652.74481.05482.02373.9929.9473.362.680.814.414.414954124621877104393351.267783.679.388.384.0346471108979745231915665613106820344636161790837.01000026011952.3513840.349.0113927.92640.2133241056410004.29438.68809.67878.56823.8145905074050084466763618131741229133319.34674.86399.164590.379146.881785.4279.14980.2214901.1274895.558299.214302.405319.234351.075128.586127.740848.236840.63778.4112.010.865.5865.4327.2438.711872.759205.065168.8191638265563070001578400000305136666735965333332494333121069367106550661616544881.3362.24018.5736.24337.72762477029683663951.080113.527871.075770.5947290.8411691.457572.318591.605400.4892780.7791011276.04563.2001277.62562.9700.3155221271.91563.2471.1514033178.49869851900.4348960.0400.3020.0940.2351.6264.60388.62047.89425.4729.5363.42926.0345.87915.5999.4087.9798.2507.392240.205103.82690.52716.40992.743109.636203.6347.366392.8493.98376.767419.025550.9313715.048.4713698.72657.82719.985284.763202.10170857367714393.633OpenBenchmarking.org

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pClang 11.0Clang 12.030060090012001500SE +/- 6.69, N = 3SE +/- 2.95, N = 31190.411198.22-lm - MIN: 685.16 / MAX: 1496.36MIN: 700.24 / MAX: 1494.161. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KClang 11.0Clang 12.0120240360480600SE +/- 1.43, N = 3SE +/- 1.79, N = 3543.43541.56-lm - MIN: 256.75 / MAX: 593.99MIN: 252.01 / MAX: 587.531. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pClang 11.0Clang 12.030060090012001500SE +/- 2.13, N = 3SE +/- 7.87, N = 31251.251244.11-lm - MIN: 556.46 / MAX: 1394.06MIN: 549.81 / MAX: 1390.031. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitClang 11.0Clang 12.070140210280350SE +/- 0.48, N = 3SE +/- 0.93, N = 3184.19308.32-lm - MIN: 114.52 / MAX: 310.5MIN: 220.53 / MAX: 490.511. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.00.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 30.210.211. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.01.11382.22763.34144.45525.569SE +/- 0.07, N = 3SE +/- 0.04, N = 34.954.871. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.048121620SE +/- 0.11, N = 3SE +/- 0.11, N = 317.1317.221. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.03691215SE +/- 0.03, N = 3SE +/- 0.10, N = 39.148.991. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0816243240SE +/- 0.22, N = 3SE +/- 0.48, N = 333.1433.391. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0918273645SE +/- 0.31, N = 3SE +/- 0.43, N = 337.2838.111. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.00.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 30.530.531. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0246810SE +/- 0.01, N = 3SE +/- 0.04, N = 37.207.101. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0612182430SE +/- 0.13, N = 3SE +/- 0.27, N = 326.6126.851. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0510152025SE +/- 0.15, N = 3SE +/- 0.05, N = 322.0022.131. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.020406080100SE +/- 0.51, N = 3SE +/- 1.07, N = 386.0988.781. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.020406080100SE +/- 0.53, N = 3SE +/- 0.31, N = 3100.55103.171. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pClang 11.0Clang 12.00.04120.08240.12360.16480.206SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1810.1831. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pClang 11.0Clang 12.03691215SE +/- 0.16, N = 4SE +/- 0.17, N = 311.8211.471. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pClang 11.0Clang 12.0306090120150SE +/- 0.46, N = 3SE +/- 0.10, N = 3117.39118.071. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pClang 11.0Clang 12.0918273645SE +/- 0.09, N = 3SE +/- 0.17, N = 341.0141.091. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 11.0Clang 12.080160240320400SE +/- 3.43, N = 3SE +/- 1.56, N = 3346.89345.301. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 11.0Clang 12.0140280420560700SE +/- 5.55, N = 3SE +/- 3.01, N = 3652.74643.581. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pClang 11.0Clang 12.0110220330440550SE +/- 0.23, N = 3SE +/- 1.37, N = 3481.05487.431. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 11.0Clang 12.0110220330440550SE +/- 1.76, N = 3SE +/- 0.73, N = 3482.02488.231. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 11.0Clang 12.080160240320400SE +/- 1.91, N = 3SE +/- 1.11, N = 3373.99372.491. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KClang 11.0Clang 12.0714212835SE +/- 0.25, N = 3SE +/- 0.23, N = 329.9430.321. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pClang 11.0Clang 12.01632486480SE +/- 0.49, N = 3SE +/- 0.49, N = 373.3674.001. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaClang 11.0Clang 12.00.61881.23761.85642.47523.094SE +/- 0.00, N = 3SE +/- 0.01, N = 32.682.751. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomClang 11.0Clang 12.00.1890.3780.5670.7560.945SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.841. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsClang 11.0Clang 12.01.0352.073.1054.145.175SE +/- 0.01, N = 3SE +/- 0.01, N = 34.414.601. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDClang 11.0Clang 12.01.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.00, N = 34.414.621. (CXX) g++ options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYClang 11.0Clang 12.0110220330440550SE +/- 36.50, N = 15SE +/- 15.30, N = 124954711. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYClang 11.0Clang 12.090180270360450SE +/- 34.43, N = 15SE +/- 15.69, N = 124123571. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTClang 11.0Clang 12.0100200300400500SE +/- 38.96, N = 15SE +/- 35.24, N = 124624341. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYClang 11.0Clang 12.0400800120016002000SE +/- 8.32, N = 15SE +/- 15.32, N = 1118776041. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYClang 11.0Clang 12.02004006008001000SE +/- 1.59, N = 15SE +/- 20.06, N = 1210438781. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTClang 11.0Clang 12.02004006008001000SE +/- 1.49, N = 15SE +/- 17.06, N = 129338191. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NClang 11.0Clang 12.01530456075SE +/- 3.65, N = 15SE +/- 2.22, N = 1251.269.11. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TClang 11.0Clang 12.0150300450600750SE +/- 1.41, N = 14SE +/- 4.04, N = 126776261. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNClang 11.0Clang 12.020406080100SE +/- 0.06, N = 15SE +/- 0.05, N = 1283.648.61. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTClang 11.0Clang 12.020406080100SE +/- 0.03, N = 15SE +/- 0.56, N = 1279.365.71. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNClang 11.0Clang 12.020406080100SE +/- 0.02, N = 15SE +/- 0.09, N = 1288.351.91. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTClang 11.0Clang 12.020406080100SE +/- 0.02, N = 14SE +/- 0.07, N = 1284.073.01. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -rdynamic -lOpenCL

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUClang 11.0Clang 12.080160240320400SE +/- 1.42, N = 3SE +/- 4.15, N = 43463331. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUClang 11.0Clang 12.0110220330440550SE +/- 5.55, N = 3SE +/- 10.30, N = 124714981. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUClang 11.0Clang 12.0306090120150SE +/- 0.29, N = 3SE +/- 0.50, N = 31081121. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUClang 11.0Clang 12.02K4K6K8K10KSE +/- 102.76, N = 8SE +/- 88.25, N = 12979799041. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUClang 11.0Clang 12.010002000300040005000SE +/- 169.87, N = 9SE +/- 126.29, N = 12452344561. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -ffunction-sections -fdata-sections -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlClang 11.0Clang 12.0400800120016002000SE +/- 12.41, N = 3SE +/- 6.57, N = 3191519931. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 11.0Clang 12.0150300450600750SE +/- 1.33, N = 3SE +/- 2.60, N = 36657121. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClang 11.0Clang 12.01302603905206506136141. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClang 11.0Clang 12.02004006008001000SE +/- 1.86, N = 3106810761. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 11.0Clang 12.05001000150020002500SE +/- 27.29, N = 3SE +/- 41.63, N = 12203421361. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianClang 11.0Clang 12.0100200300400500SE +/- 1.00, N = 34634571. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceClang 11.0Clang 12.0130260390520650SE +/- 0.88, N = 3SE +/- 0.67, N = 36166051. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClang 11.0Clang 12.0400K800K1200K1600K2000KSE +/- 971.31, N = 3SE +/- 984.68, N = 31790837.011785466.281. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 11.0Clang 12.060K120K180K240K300KSE +/- 407.86, N = 3SE +/- 1778.47, N = 32601192652041. (CC) gcc options: -pedantic -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedClang 11.0Clang 12.0Clang 12.0 LTO1224364860SE +/- 0.33, N = 3SE +/- 0.80, N = 3SE +/- 0.02, N = 352.3552.0750.931. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedClang 11.0Clang 12.0Clang 12.0 LTO3K6K9K12K15KSE +/- 15.91, N = 3SE +/- 71.01, N = 3SE +/- 60.82, N = 313840.313911.513715.01. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedClang 11.0Clang 12.0Clang 12.0 LTO1122334455SE +/- 0.46, N = 3SE +/- 0.42, N = 3SE +/- 0.74, N = 349.0148.5048.471. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClang 11.0Clang 12.0Clang 12.0 LTO3K6K9K12K15KSE +/- 23.21, N = 3SE +/- 65.90, N = 3SE +/- 46.50, N = 313927.913926.513698.71. (CC) gcc options: -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Clang 11.0Clang 12.0Clang 12.0 LTO6001200180024003000SE +/- 1.01, N = 3SE +/- 1.92, N = 3SE +/- 1.62, N = 32640.22653.82657.81. (CXX) g++ options: -O3 -march=native -rdynamic

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Clang 11.0Clang 12.03K6K9K12K15KSE +/- 20.33, N = 3SE +/- 24.25, N = 313324133331. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024Clang 11.0Clang 12.02K4K6K8K10KSE +/- 35.53, N = 3SE +/- 27.10, N = 310564108051. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048Clang 11.0Clang 12.02K4K6K8K10KSE +/- 28.76, N = 3SE +/- 7.75, N = 310004.210467.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Clang 11.0Clang 12.02K4K6K8K10KSE +/- 15.16, N = 3SE +/- 101.36, N = 39438.69862.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024Clang 11.0Clang 12.02K4K6K8K10KSE +/- 45.95, N = 3SE +/- 48.25, N = 38809.69088.31. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048Clang 11.0Clang 12.02K4K6K8K10KSE +/- 27.38, N = 3SE +/- 65.76, N = 37878.57789.91. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Clang 11.0Clang 12.015003000450060007500SE +/- 60.67, N = 3SE +/- 35.20, N = 36823.86744.11. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Clang 11.0Clang 12.03K6K9K12K15KSE +/- 129.55, N = 3SE +/- 48.79, N = 314590156491. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024Clang 11.0Clang 12.011K22K33K44K55KSE +/- 585.78, N = 3SE +/- 952.64, N = 1250740503501. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048Clang 11.0Clang 12.011K22K33K44K55KSE +/- 582.34, N = 3SE +/- 439.50, N = 350084512541. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Clang 11.0Clang 12.010K20K30K40K50KSE +/- 413.24, N = 15SE +/- 671.66, N = 1546676454281. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024Clang 11.0Clang 12.08K16K24K32K40KSE +/- 530.09, N = 4SE +/- 165.99, N = 336181362391. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048Clang 11.0Clang 12.07K14K21K28K35KSE +/- 146.10, N = 3SE +/- 77.17, N = 331741319351. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Clang 11.0Clang 12.05K10K15K20K25KSE +/- 220.77, N = 3SE +/- 348.10, N = 922913227971. (CC) gcc options: -pthread -O3 -march=native -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 11.0Clang 12.07001400210028003500SE +/- 15.12, N = 3SE +/- 1.11, N = 33319.343190.621. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 11.0Clang 12.0150300450600750SE +/- 0.40, N = 3SE +/- 0.40, N = 3674.86675.131. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 11.0Clang 12.090180270360450SE +/- 0.67, N = 3SE +/- 0.46, N = 3399.16363.851. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 11.0Clang 12.010002000300040005000SE +/- 3.87, N = 3SE +/- 10.41, N = 34590.374280.221. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 11.0Clang 12.02K4K6K8K10KSE +/- 77.81, N = 3SE +/- 7.16, N = 39146.888848.401. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 11.0Clang 12.0400800120016002000SE +/- 0.12, N = 3SE +/- 0.08, N = 31785.421785.501. (CC) gcc options: -O3 -march=native -lm

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIClang 11.0Clang 12.020406080100SE +/- 0.06, N = 3SE +/- 0.01, N = 379.1582.641. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClang 11.0Clang 12.020406080100SE +/- 0.04, N = 3SE +/- 0.06, N = 380.2284.231. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Clang 11.0Clang 12.011002200330044005500SE +/- 2.16, N = 3SE +/- 2.14, N = 34901.134659.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptClang 11.0Clang 12.010002000300040005000SE +/- 1.35, N = 3SE +/- 4.78, N = 34895.564682.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishClang 11.0Clang 12.070140210280350SE +/- 0.09, N = 3SE +/- 0.13, N = 3299.21315.411. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptClang 11.0Clang 12.070140210280350SE +/- 0.15, N = 3SE +/- 0.16, N = 3302.41321.191. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishClang 11.0Clang 12.080160240320400SE +/- 1.73, N = 3SE +/- 0.05, N = 3319.23380.051. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptClang 11.0Clang 12.080160240320400SE +/- 2.03, N = 3SE +/- 0.04, N = 3351.08351.281. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Clang 11.0Clang 12.0306090120150SE +/- 0.02, N = 3SE +/- 0.02, N = 3128.59132.821. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClang 11.0Clang 12.0306090120150SE +/- 0.01, N = 3SE +/- 0.01, N = 3127.74133.051. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Clang 11.0Clang 12.02004006008001000SE +/- 0.62, N = 3SE +/- 4.85, N = 3848.24850.501. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClang 11.0Clang 12.02004006008001000SE +/- 0.16, N = 3SE +/- 4.64, N = 3840.64843.401. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5Clang 11.0Clang 12.020406080100SE +/- 0.24, N = 3SE +/- 0.17, N = 378.4174.271. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7Clang 11.0Clang 12.03691215SE +/- 0.02, N = 3SE +/- 0.05, N = 312.0112.151. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8Clang 11.0Clang 12.00.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 30.800.821. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5Clang 11.0Clang 12.01530456075SE +/- 0.20, N = 3SE +/- 0.14, N = 365.5866.661. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7Clang 11.0Clang 12.01530456075SE +/- 0.08, N = 3SE +/- 0.16, N = 365.4366.381. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8Clang 11.0Clang 12.0714212835SE +/- 0.01, N = 3SE +/- 0.03, N = 327.2428.131. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkClang 11.0Clang 12.01020304050SE +/- 0.33, N = 3SE +/- 0.12, N = 338.7141.781. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Clang 11.0Clang 12.0Clang 12.0 LTO6001200180024003000SE +/- 1.69, N = 3SE +/- 2.64, N = 3SE +/- 6.09, N = 31872.762718.532719.991. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Clang 11.0Clang 12.0Clang 12.0 LTO60120180240300SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3205.07284.64284.761. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Clang 11.0Clang 12.0Clang 12.0 LTO4080120160200SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3168.82202.09202.101. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 11.0Clang 12.0400K800K1200K1600K2000KSE +/- 2852.59, N = 5SE +/- 1798.40, N = 5163826515709661. (CC) gcc options: -O3 -march=native

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57Clang 11.0Clang 12.012M24M36M48M60MSE +/- 40360.87, N = 3SE +/- 790005.27, N = 356307000556630001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57Clang 11.0Clang 12.0300M600M900M1200M1500MSE +/- 1331665.62, N = 3SE +/- 2255610.29, N = 3157840000015648333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Clang 11.0Clang 12.0700M1400M2100M2800M3500MSE +/- 2452436.43, N = 3SE +/- 6045475.81, N = 3305136666730706333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Clang 11.0Clang 12.0800M1600M2400M3200M4000MSE +/- 1559202.08, N = 3SE +/- 883804.91, N = 3359653333336437666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyClang 11.0Clang 12.05K10K15K20K25KSE +/- 289.16, N = 3SE +/- 303.43, N = 324943243101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteClang 11.0Clang 12.07001400210028003500SE +/- 14.62, N = 3SE +/- 3.48, N = 3331232811. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyClang 11.0Clang 12.0200K400K600K800K1000KSE +/- 1740.88, N = 3SE +/- 720.87, N = 3106936710690221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 11.0Clang 12.0200K400K600K800K1000KSE +/- 13844.42, N = 3SE +/- 6289.60, N = 3106550610712091. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteClang 11.0Clang 12.013K26K39K52K65KSE +/- 400.92, N = 3SE +/- 162.92, N = 361616623191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 11.0Clang 12.012K24K36K48K60KSE +/- 883.12, N = 3SE +/- 702.52, N = 1554488566841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultClang 11.0Clang 12.00.30060.60120.90181.20241.503SE +/- 0.001, N = 3SE +/- 0.001, N = 31.3361.3311. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Clang 11.0Clang 12.00.5041.0081.5122.0162.52SE +/- 0.000, N = 3SE +/- 0.001, N = 32.2402.1991. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessClang 11.0Clang 12.0510152025SE +/- 0.13, N = 3SE +/- 0.02, N = 318.5719.021. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionClang 11.0Clang 12.0246810SE +/- 0.018, N = 3SE +/- 0.004, N = 36.2436.3091. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionClang 11.0Clang 12.0918273645SE +/- 0.08, N = 3SE +/- 0.07, N = 337.7338.451. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBClang 11.0Clang 12.0Clang 12.0 LTO15003000450060007500SE +/- 67.11, N = 7SE +/- 87.21, N = 3SE +/- 86.43, N = 3624767807085-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPClang 11.0Clang 12.016003200480064008000SE +/- 20.42, N = 3SE +/- 14.89, N = 3702975071. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksClang 11.0Clang 12.0Clang 12.0 LTO16003200480064008000SE +/- 7.31, N = 3SE +/- 33.67, N = 3SE +/- 17.21, N = 3683674377367-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsClang 11.0Clang 12.0Clang 12.0 LTO15003000450060007500SE +/- 25.04, N = 3SE +/- 30.90, N = 3SE +/- 15.06, N = 3639572207143-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto1. (CXX) g++ options: -O3 -march=native -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.2430.4860.7290.9721.215SE +/- 0.00127, N = 3SE +/- 0.00199, N = 31.080111.07701MIN: 1.03MIN: 1.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.79381.58762.38143.17523.969SE +/- 0.04735, N = 3SE +/- 0.01639, N = 33.527873.28507MIN: 3.29MIN: 3.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.2420.4840.7260.9681.21SE +/- 0.00395, N = 3SE +/- 0.00286, N = 31.075771.07507MIN: 0.86MIN: 0.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.15980.31960.47940.63920.799SE +/- 0.008914, N = 3SE +/- 0.011383, N = 30.5947290.710124MIN: 0.53MIN: 0.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.27480.54960.82441.09921.374SE +/- 0.000480, N = 3SE +/- 0.018279, N = 40.8411691.221320MIN: 0.82MIN: 1.131. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.3280.6560.9841.3121.64SE +/- 0.00568, N = 3SE +/- 0.00123, N = 31.457571.44425MIN: 1.35MIN: 1.341. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.53281.06561.59842.13122.664SE +/- 0.02389, N = 3SE +/- 0.02100, N = 32.318592.36797MIN: 1.92MIN: 2.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.45810.91621.37431.83242.2905SE +/- 0.00118, N = 3SE +/- 0.01922, N = 121.605402.03606MIN: 1.55MIN: 1.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.11070.22140.33210.44280.5535SE +/- 0.001652, N = 3SE +/- 0.002843, N = 30.4892780.491940MIN: 0.46MIN: 0.471. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.17540.35080.52620.70160.877SE +/- 0.001200, N = 3SE +/- 0.004246, N = 30.7791010.779776MIN: 0.73MIN: 0.731. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUClang 11.0Clang 12.030060090012001500SE +/- 9.46, N = 3SE +/- 3.92, N = 31276.041302.70MIN: 1249.65MIN: 1289.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUClang 11.0Clang 12.0130260390520650SE +/- 0.83, N = 3SE +/- 9.50, N = 3563.20593.97MIN: 550.23MIN: 570.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.030060090012001500SE +/- 7.11, N = 3SE +/- 3.61, N = 31277.621307.49MIN: 1252.39MIN: 1293.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.0130260390520650SE +/- 0.25, N = 3SE +/- 1.89, N = 3562.97590.18MIN: 551.49MIN: 575.411. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUClang 11.0Clang 12.00.0710.1420.2130.2840.355SE +/- 0.000247, N = 3SE +/- 0.000321, N = 30.3155220.313689MIN: 0.3MIN: 0.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 11.0Clang 12.030060090012001500SE +/- 9.75, N = 3SE +/- 1.78, N = 31271.911305.10MIN: 1252.33MIN: 1294.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 11.0Clang 12.0130260390520650SE +/- 0.10, N = 3SE +/- 3.02, N = 3563.25597.48MIN: 551.31MIN: 580.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUClang 11.0Clang 12.00.26380.52760.79141.05521.319SE +/- 0.00653, N = 3SE +/- 0.00458, N = 31.151401.17258MIN: 1.09MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPClang 11.0Clang 12.07K14K21K28K35KSE +/- 0.81, N = 3SE +/- 64.93, N = 333178.5033246.841. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPClang 11.0Clang 12.011K22K33K44K55KSE +/- 4.51, N = 3SE +/- 10.95, N = 351900.4351596.871. (CXX) g++ options: -O3 -march=native -fopenmp

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyClang 11.0Clang 12.00.00920.01840.02760.03680.046SE +/- 0.001, N = 3SE +/- 0.001, N = 30.0400.0411. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyClang 11.0Clang 12.00.06860.13720.20580.27440.343SE +/- 0.002, N = 3SE +/- 0.000, N = 30.3020.3051. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyClang 11.0Clang 12.00.02120.04240.06360.08480.106SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0940.0941. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 11.0Clang 12.00.05290.10580.15870.21160.2645SE +/- 0.003, N = 3SE +/- 0.001, N = 30.2350.2341. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyClang 11.0Clang 12.00.36590.73181.09771.46361.8295SE +/- 0.011, N = 3SE +/- 0.004, N = 31.6261.6071. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 11.0Clang 12.01.03572.07143.10714.14285.1785SE +/- 0.074, N = 3SE +/- 0.054, N = 154.6034.4311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClang 11.0Clang 12.0Clang 12.0 LTO20406080100SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 1.09, N = 388.6289.1293.63-flto1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0Clang 11.0Clang 12.01122334455SE +/- 0.07, N = 3SE +/- 0.04, N = 347.8947.881. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Clang 11.0Clang 12.0612182430SE +/- 0.06, N = 3SE +/- 0.06, N = 325.4725.181. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Clang 11.0Clang 12.03691215SE +/- 0.022, N = 3SE +/- 0.014, N = 39.5369.5101. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10Clang 11.0Clang 12.00.77151.5432.31453.0863.8575SE +/- 0.010, N = 3SE +/- 0.014, N = 33.4293.3611. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessClang 11.0Clang 12.0612182430SE +/- 0.22, N = 3SE +/- 0.04, N = 326.0325.221. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessClang 11.0Clang 12.01.32282.64563.96845.29126.614SE +/- 0.011, N = 3SE +/- 0.013, N = 35.8795.7461. (CXX) g++ options: -O3 -fPIC -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClang 11.0Clang 12.048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 315.6015.871. (CC) gcc options: -lm -lpthread -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 11.0Clang 12.03691215SE +/- 0.032, N = 3SE +/- 0.041, N = 39.4089.2961. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACClang 11.0Clang 12.0246810SE +/- 0.006, N = 5SE +/- 0.007, N = 57.9797.8541. (CXX) g++ options: -O3 -march=native -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3Clang 11.0Clang 12.0246810SE +/- 0.021, N = 3SE +/- 0.003, N = 38.2508.2561. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 11.0Clang 12.0246810SE +/- 0.002, N = 5SE +/- 0.013, N = 57.3927.5671. (CXX) g++ options: -O3 -march=native -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9Clang 11.0Clang 12.050100150200250SE +/- 0.28, N = 3SE +/- 0.44, N = 3240.21236.921. (CC) gcc options: -O3 -march=native -fvisibility=hidden

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670Clang 11.0Clang 12.0306090120150SE +/- 0.06, N = 3SE +/- 0.53, N = 3103.83118.871. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552Clang 11.0Clang 12.020406080100SE +/- 1.37, N = 3SE +/- 1.11, N = 690.5395.961. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeClang 11.0Clang 12.048121620SE +/- 0.02, N = 3SE +/- 0.06, N = 316.4116.051. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultClang 11.0Clang 12.00.61721.23441.85162.46883.086SE +/- 0.031, N = 3SE +/- 0.027, N = 32.7432.7391. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7Clang 11.0Clang 12.020406080100SE +/- 0.10, N = 3SE +/- 0.10, N = 3109.64109.531. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7Clang 11.0Clang 12.050100150200250SE +/- 0.66, N = 3SE +/- 0.07, N = 3203.63207.011. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5Clang 11.0Clang 12.0246810SE +/- 0.022, N = 3SE +/- 0.006, N = 37.3666.6901. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionClang 11.0Clang 12.090180270360450SE +/- 0.17, N = 3SE +/- 0.49, N = 3392.85374.041. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumClang 11.0Clang 12.00.90131.80262.70393.60524.5065SE +/- 0.0013, N = 3SE +/- 0.0116, N = 33.98374.00581. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughClang 11.0Clang 12.0246810SE +/- 0.0026, N = 3SE +/- 0.0028, N = 36.76746.76471. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveClang 11.0Clang 12.0510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 319.0318.991. (CXX) g++ options: -O3 -march=native -flto -pthread


Phoronix Test Suite v10.8.4