LLVM Clang 13 Bencmarking Intel Xeon Ice Lake

LLVM Clang benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2108236-TJ-LLVMCLANG38&rdt&grr.

LLVM Clang 13 Bencmarking Intel Xeon Ice LakeProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 13Clang 12Clang 112 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 0998504GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 21.045.14.0-rc1-folio (x86_64) 20210715GNOME Shell 3.38.4X Server 1.20.11Clang 13.0.0-++20210820072921+23ba3732246a-1~exp1~20210820174536.53ext41920x1080Clang 12.0.0-3ubuntu1~21.04.1Clang 11.0.1-2ubuntu4OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0Python Details- Clang 13, Clang 12: Python 3.9.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 13 Bencmarking Intel Xeon Ice Lakemysqlslap: 2048mysqlslap: 4096tnn: CPU - DenseNetfftw: Float + SSE - 2D FFT Size 4096graphics-magick: Resizingsecuremark: SecureMark-TLSpgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlyapache: 1000ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU - mobilenetapache: 500onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Compression Speedgraphics-magick: HWB Color Spaceblosc: blosclzvpxenc: Speed 0 - Bosphorus 4Kpgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Writevpxenc: Speed 5 - Bosphorus 4Knginx: 1nginx: 1000nginx: 500compress-zstd: 8 - Compression Speednginx: 100nginx: 200nginx: 20onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUpjsip: INVITEfinancebench: Bonds OpenMPsqlite-speedtest: Timed Time - Size 1,000pjsip: OPTIONS, Statefuljohn-the-ripper: MD5graphics-magick: Noise-Gaussiangraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Rotategraphics-magick: Swirlpjsip: OPTIONS, Statelessfinancebench: Repo OpenMPtnn: CPU - MobileNet v2svt-av1: Preset 4 - Bosphorus 4Kcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 3 - Compression Speedcompress-zstd: 3, Long Mode - Compression Speedaobench: 2048 x 2048 - Total Timebotan: AES-256 - Decryptbotan: AES-256tnn: CPU - SqueezeNet v1.1libraw: Post-Processing Benchmarkaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kbotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305quantlib: john-the-ripper: Blowfishbotan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishaircrack-ng: botan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIsynthmark: VoiceMark_100coremark: CoreMark Size 666 - Iterations Per Secondonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUencode-opus: WAV To Opus Encodesvt-av1: Preset 8 - Bosphorus 4Konednn: IP Shapes 1D - bf16bf16bf16 - CPUc-ray: Total Time - 4K, 16 Rays Per Pixeltachyon: Total Timedav1d: Chimera 1080p 10-bitaom-av1: Speed 9 Realtime - Bosphorus 4Konednn: IP Shapes 3D - bf16bf16bf16 - CPUdraco: Church Facadedraco: Liondav1d: Summer Nature 4Ktnn: CPU - SqueezeNet v2onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUsvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080ptscp: AI Chess Performancesvt-hevc: 10 - Bosphorus 1080pClang 13Clang 12Clang 117123214370.099184894852404470.257973432112600.0223.9819.2127.9421.719.6417.874.8612.668.797.8417.38145589.94598.02047.083.686624671.55.712.9488485415.3575270.65205977.68204880.022775.9208590.36196023.59254133.55365.727527158281.78645864.6351014010430000600877117472818744187138153.404297539.6464.967828.06697.9863.136.0005753.6375761.303400.26139.9919.6045.30849.956855.7642657.3112407333.278327.933290.469288.807212825.677114.500116.10376.24877.094513.0102185330.0367743.281149.63258.9822.7891012.59413.4699843.9656.651.7177072025967533.0284.6102.07168374.913.61489476.72353.721491816626.336983194353.389184434522508490.255981525112287.1737.1119.2628.3521.4811.0421.246.9413.7511.0111.6217.65121643.80611.92645.780.485224249.35.622.9738413214.1175015.29205740.39204058.282560.6206297.92194567.38251352.56368.449529158502.85416765.2941018510275333600758110472318644036437760.885417541.9274.849833.66321.0865.636.4285700.3265735.246402.25536.6519.3944.84874.911882.1652606.3111335326.798329.534288.611293.730210157.651114.829115.32776.24376.916503.9302146447.3800823.058219.90857.9172.7965815.22413.9533844.7854.471.7255870575870532.5183.8602.06918365.793.57952456.18340.211481388604.457003196912.120181354732520890.260967214120827.4536.2619.7226.7922.369.9721.596.0715.3210.9011.4617.30122576.13608.60445.584.885223751.95.602.9388516214.3274606.79205224.87204519.002729.0205660.13196465.21254323.92368.248528958558.25520864.415101958880000623745110472818604130537221.003906773.4674.895798.86433.3826.736.1675714.9165743.347657.10034.5519.3244.06866.095869.3522589.5113200325.704326.769279.963286.034211799.583115.648114.75473.56174.395491.0272103884.8925213.053159.42357.8412.7864214.80114.2482842.9554.441.7164070005827528.63105.3972.08089362.673.59463460.20344.591477410616.45OpenBenchmarking.org

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 2048Clang 13Clang 12Clang 11150300450600750SE +/- 1.71, N = 3SE +/- 6.96, N = 9SE +/- 7.30, N = 9712698700-lpthread -lm -lstdc++-lpthread -lm -lstdc++-lbz2 -lsnappy1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 4096Clang 13Clang 12Clang 1170140210280350SE +/- 1.31, N = 3SE +/- 1.07, N = 3SE +/- 1.96, N = 3321319319-lpthread -lm -lstdc++-lpthread -lm -lstdc++-lbz2 -lsnappy1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetClang 13Clang 12Clang 1115003000450060007500SE +/- 2.30, N = 3SE +/- 1.66, N = 3SE +/- 3.50, N = 34370.104353.396912.12MIN: 4351.72 / MAX: 4450.96MIN: 4332.87 / MAX: 4442MIN: 6888.33 / MAX: 6947.751. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Clang 13Clang 12Clang 114K8K12K16K20KSE +/- 43.73, N = 3SE +/- 34.47, N = 3SE +/- 187.54, N = 41848918443181351. (CC) gcc options: -pthread -O3 -march=native -lm

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 13Clang 12Clang 11100200300400500SE +/- 8.56, N = 15SE +/- 9.74, N = 15SE +/- 14.35, N = 124854524731. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 13Clang 12Clang 1150K100K150K200K250KSE +/- 40.95, N = 3SE +/- 339.82, N = 3SE +/- 87.48, N = 32404472508492520891. (CC) gcc options: -pedantic -O3

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 13Clang 12Clang 110.05850.1170.17550.2340.2925SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 150.2570.2550.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 13Clang 12Clang 11200K400K600K800K1000KSE +/- 10672.94, N = 3SE +/- 3346.94, N = 3SE +/- 16009.81, N = 159734329815259672141. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000Clang 13Clang 12Clang 1130K60K90K120K150KSE +/- 485.15, N = 3SE +/- 534.15, N = 3SE +/- 924.52, N = 15112600.02112287.17120827.451. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mClang 13Clang 12Clang 11918273645SE +/- 1.77, N = 3SE +/- 3.84, N = 3SE +/- 2.48, N = 1423.9837.1136.26MIN: 20.12 / MAX: 66.47MIN: 19.88 / MAX: 96.76MIN: 18.78 / MAX: 124.571. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdClang 13Clang 12Clang 11510152025SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.53, N = 1419.2119.2619.72MIN: 18.04 / MAX: 38.36MIN: 18.05 / MAX: 35.83MIN: 17.67 / MAX: 58.391. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyClang 13Clang 12Clang 11714212835SE +/- 0.54, N = 3SE +/- 0.55, N = 3SE +/- 0.26, N = 1427.9428.3526.79MIN: 25.71 / MAX: 39.58MIN: 25.63 / MAX: 60.64MIN: 23.92 / MAX: 56.11. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Clang 13Clang 12Clang 11510152025SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 0.61, N = 1421.7121.4822.36MIN: 19.54 / MAX: 44.52MIN: 19.9 / MAX: 40.42MIN: 18.98 / MAX: 71.341. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetClang 13Clang 12Clang 113691215SE +/- 0.17, N = 3SE +/- 1.31, N = 3SE +/- 0.16, N = 149.6411.049.97MIN: 8.9 / MAX: 11.05MIN: 9.24 / MAX: 14.73MIN: 8.85 / MAX: 29.231. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetClang 13Clang 12Clang 11510152025SE +/- 0.31, N = 3SE +/- 1.87, N = 3SE +/- 1.02, N = 1417.8721.2421.59MIN: 16.58 / MAX: 37.88MIN: 17.28 / MAX: 39.77MIN: 16.19 / MAX: 50.351. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceClang 13Clang 12Clang 11246810SE +/- 0.01, N = 3SE +/- 1.08, N = 3SE +/- 0.47, N = 144.866.946.07MIN: 4.57 / MAX: 9.45MIN: 4.49 / MAX: 19.88MIN: 4.35 / MAX: 27.781. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Clang 13Clang 12Clang 1148121620SE +/- 2.13, N = 3SE +/- 2.00, N = 3SE +/- 1.01, N = 1412.6613.7515.32MIN: 8.73 / MAX: 27.07MIN: 9.12 / MAX: 41.76MIN: 8.59 / MAX: 45.641. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetClang 13Clang 12Clang 113691215SE +/- 1.33, N = 3SE +/- 1.31, N = 3SE +/- 0.66, N = 148.7911.0110.90MIN: 6.69 / MAX: 26.58MIN: 7.09 / MAX: 20.3MIN: 6.9 / MAX: 32.841. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Clang 13Clang 12Clang 113691215SE +/- 0.01, N = 3SE +/- 1.05, N = 3SE +/- 0.67, N = 147.8411.6211.46MIN: 7.49 / MAX: 13.76MIN: 7.71 / MAX: 25.56MIN: 7.56 / MAX: 31.761. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetClang 13Clang 12Clang 1148121620SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.27, N = 1417.3817.6517.30MIN: 16.17 / MAX: 40.05MIN: 16.51 / MAX: 19.27MIN: 15.79 / MAX: 37.291. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500Clang 13Clang 12Clang 1130K60K90K120K150KSE +/- 843.46, N = 3SE +/- 1145.12, N = 15SE +/- 798.53, N = 3145589.94121643.80122576.131. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 11130260390520650SE +/- 0.92, N = 3SE +/- 6.83, N = 4SE +/- 5.71, N = 13598.02611.93608.60MIN: 575.43MIN: 569.44MIN: 570.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedClang 13Clang 12Clang 111122334455SE +/- 0.53, N = 15SE +/- 0.55, N = 15SE +/- 0.51, N = 447.045.745.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedClang 13Clang 12Clang 1120406080100SE +/- 0.64, N = 10SE +/- 0.45, N = 3SE +/- 0.85, N = 1583.680.484.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceClang 13Clang 12Clang 112004006008001000SE +/- 8.76, N = 3SE +/- 5.60, N = 15SE +/- 6.84, N = 38668528521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

C-Blosc

Compressor: blosclz

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0Compressor: blosclzClang 13Clang 12Clang 115K10K15K20K25KSE +/- 33.59, N = 3SE +/- 34.61, N = 3SE +/- 34.48, N = 324671.524249.323751.91. (CXX) g++ options: -O3 -march=native

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KClang 13Clang 12Clang 111.28482.56963.85445.13926.424SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.715.625.601. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 13Clang 12Clang 110.66891.33782.00672.67563.3445SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.027, N = 32.9482.9732.9381. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 13Clang 12Clang 1120K40K60K80K100KSE +/- 119.61, N = 3SE +/- 85.43, N = 3SE +/- 786.14, N = 38485484132851621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KClang 13Clang 12Clang 1148121620SE +/- 0.14, N = 15SE +/- 0.09, N = 3SE +/- 0.13, N = 315.3514.1114.321. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

nginx

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1Clang 13Clang 12Clang 1116K32K48K64K80KSE +/- 160.67, N = 3SE +/- 102.54, N = 3SE +/- 131.50, N = 375270.6575015.2974606.791. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 1305.06, N = 3SE +/- 632.41, N = 3SE +/- 947.34, N = 3205977.68205740.39205224.871. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 936.49, N = 3SE +/- 63.72, N = 3SE +/- 554.68, N = 3204880.02204058.28204519.001. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedClang 13Clang 12Clang 116001200180024003000SE +/- 7.16, N = 3SE +/- 34.31, N = 15SE +/- 32.58, N = 32775.92560.62729.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 987.48, N = 3SE +/- 431.14, N = 3SE +/- 1506.43, N = 3208590.36206297.92205660.131. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 911.83, N = 3SE +/- 312.30, N = 3SE +/- 2823.39, N = 3196023.59194567.38196465.211. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 20Clang 13Clang 12Clang 1150K100K150K200K250KSE +/- 1379.02, N = 3SE +/- 1222.93, N = 3SE +/- 2398.86, N = 3254133.55251352.56254323.921. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 1180160240320400SE +/- 0.40, N = 3SE +/- 1.79, N = 3SE +/- 2.59, N = 3365.73368.45368.25MIN: 355.4MIN: 347.27MIN: 349.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEClang 13Clang 12Clang 1111002200330044005500SE +/- 11.10, N = 3SE +/- 10.68, N = 3SE +/- 8.50, N = 35271529152891. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPClang 13Clang 12Clang 1113K26K39K52K65KSE +/- 17.71, N = 3SE +/- 82.28, N = 3SE +/- 38.93, N = 358281.7958502.8558558.261. (CXX) g++ options: -O3 -march=native -fopenmp

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Clang 13Clang 12Clang 111530456075SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 364.6465.2964.421. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulClang 13Clang 12Clang 112K4K6K8K10KSE +/- 43.11, N = 3SE +/- 11.92, N = 3SE +/- 4.48, N = 31014010185101951. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Clang 13Clang 12Clang 112M4M6M8M10MSE +/- 3785.94, N = 3SE +/- 16973.84, N = 3SE +/- 8504.90, N = 3104300001027533388800001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianClang 13Clang 12Clang 11130260390520650SE +/- 3.06, N = 3SE +/- 4.10, N = 3SE +/- 2.33, N = 36006006231. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClang 13Clang 12Clang 112004006008001000SE +/- 5.24, N = 3SE +/- 0.67, N = 3SE +/- 3.18, N = 38777587451. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClang 13Clang 12Clang 1130060090012001500SE +/- 5.78, N = 3SE +/- 5.49, N = 3SE +/- 0.67, N = 31174110411041. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 13Clang 12Clang 11160320480640800SE +/- 7.80, N = 3SE +/- 2.65, N = 3SE +/- 2.40, N = 37287237281. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlClang 13Clang 12Clang 11400800120016002000SE +/- 8.69, N = 3SE +/- 8.51, N = 3SE +/- 13.09, N = 31874186418601. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessClang 13Clang 12Clang 119K18K27K36K45KSE +/- 380.52, N = 3SE +/- 510.13, N = 3SE +/- 575.37, N = 34187140364413051. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPClang 13Clang 12Clang 118K16K24K32K40KSE +/- 450.24, N = 4SE +/- 290.04, N = 3SE +/- 19.15, N = 338153.4037760.8937221.001. (CXX) g++ options: -O3 -march=native -fopenmp

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Clang 13Clang 12Clang 11170340510680850SE +/- 0.66, N = 3SE +/- 0.70, N = 3SE +/- 0.69, N = 3539.65541.93773.47MIN: 536.06 / MAX: 559.33MIN: 536.56 / MAX: 554.17MIN: 690.55 / MAX: 820.361. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 13Clang 12Clang 111.11762.23523.35284.47045.588SE +/- 0.027, N = 3SE +/- 0.004, N = 3SE +/- 0.020, N = 34.9674.8494.8951. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedClang 13Clang 12Clang 112004006008001000SE +/- 5.79, N = 3SE +/- 1.68, N = 3SE +/- 0.29, N = 3828.0833.6798.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedClang 13Clang 12Clang 1114002800420056007000SE +/- 47.01, N = 3SE +/- 60.99, N = 3SE +/- 78.33, N = 36697.96321.06433.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedClang 13Clang 12Clang 112004006008001000SE +/- 1.29, N = 3SE +/- 2.11, N = 3SE +/- 1.30, N = 3863.1865.6826.71. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeClang 13Clang 12Clang 11816243240SE +/- 0.02, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 336.0036.4336.171. (CC) gcc options: -lm -O3 -march=native

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptClang 13Clang 12Clang 1112002400360048006000SE +/- 0.20, N = 3SE +/- 0.79, N = 3SE +/- 0.51, N = 35753.645700.335714.921. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Clang 13Clang 12Clang 1112002400360048006000SE +/- 0.33, N = 3SE +/- 5.15, N = 3SE +/- 0.30, N = 35761.305735.255743.351. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Clang 13Clang 12Clang 11140280420560700SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3400.26402.26657.10MIN: 399.7 / MAX: 401.33MIN: 401.13 / MAX: 403.41MIN: 656.47 / MAX: 657.891. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkClang 13Clang 12Clang 11918273645SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 339.9936.6534.551. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 11510152025SE +/- 0.24, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 319.6019.3919.321. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111020304050SE +/- 0.42, N = 7SE +/- 0.40, N = 3SE +/- 0.36, N = 945.3044.8444.061. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClang 13Clang 12Clang 112004006008001000SE +/- 1.46, N = 3SE +/- 0.95, N = 3SE +/- 0.04, N = 3849.96874.91866.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Clang 13Clang 12Clang 112004006008001000SE +/- 4.73, N = 3SE +/- 0.87, N = 3SE +/- 0.40, N = 3855.76882.17869.351. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Clang 13Clang 12Clang 116001200180024003000SE +/- 8.72, N = 3SE +/- 4.59, N = 3SE +/- 5.39, N = 32657.32606.32589.51. (CXX) g++ options: -O3 -march=native -rdynamic

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishClang 13Clang 12Clang 1120K40K60K80K100KSE +/- 41.83, N = 3SE +/- 138.68, N = 3SE +/- 249.93, N = 31124071113351132001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptClang 13Clang 12Clang 1170140210280350SE +/- 0.04, N = 3SE +/- 3.35, N = 3SE +/- 0.09, N = 3333.28326.80325.701. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishClang 13Clang 12Clang 1170140210280350SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3327.93329.53326.771. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptClang 13Clang 12Clang 1160120180240300SE +/- 0.27, N = 3SE +/- 0.24, N = 3SE +/- 0.36, N = 3290.47288.61279.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishClang 13Clang 12Clang 1160120180240300SE +/- 0.22, N = 3SE +/- 0.97, N = 3SE +/- 0.21, N = 3288.81293.73286.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Aircrack-ng

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.5.2Clang 13Clang 12Clang 1150K100K150K200K250KSE +/- 467.06, N = 3SE +/- 504.25, N = 3SE +/- 607.21, N = 3212825.68210157.65211799.581. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lsqlite3 -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClang 13Clang 12Clang 11306090120150SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3114.50114.83115.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Clang 13Clang 12Clang 11306090120150SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3116.10115.33114.751. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 376.2576.2473.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 377.0976.9274.401. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Clang 13Clang 12Clang 11110220330440550SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3513.01503.93491.031. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClang 13Clang 12Clang 11500K1000K1500K2000K2500KSE +/- 1510.09, N = 3SE +/- 3711.92, N = 3SE +/- 2482.75, N = 32185330.042146447.382103884.891. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.73831.47662.21492.95323.6915SE +/- 0.00367, N = 3SE +/- 0.00384, N = 3SE +/- 0.00886, N = 33.281143.058213.05315MIN: 3.08MIN: 2.86MIN: 2.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 13Clang 12Clang 113691215SE +/- 0.069, N = 5SE +/- 0.006, N = 5SE +/- 0.013, N = 59.6329.9089.4231. (CXX) g++ options: -O3 -march=native -logg -lm

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.38, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 358.9857.9257.841. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.62921.25841.88762.51683.146SE +/- 0.00435, N = 3SE +/- 0.00591, N = 3SE +/- 0.00451, N = 32.789102.796582.78642MIN: 2.65MIN: 2.65MIN: 2.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClang 13Clang 12Clang 1148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.5915.2214.801. (CC) gcc options: -lm -lpthread -O3 -march=native

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeClang 13Clang 12Clang 1148121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 313.4713.9514.251. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Chimera 1080p 10-bitClang 13Clang 12Clang 112004006008001000SE +/- 2.33, N = 3SE +/- 1.52, N = 3SE +/- 0.82, N = 3843.96844.78842.95MIN: 503.8 / MAX: 1131.86MIN: 517.09 / MAX: 1121.33MIN: 515.35 / MAX: 1115.741. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 356.6554.4754.441. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.38830.77661.16491.55321.9415SE +/- 0.00225, N = 3SE +/- 0.00427, N = 3SE +/- 0.00619, N = 31.717701.725581.71640MIN: 1.55MIN: 1.57MIN: 1.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Church FacadeClang 13Clang 12Clang 1115003000450060007500SE +/- 4.26, N = 3SE +/- 3.18, N = 37202705770001. (CXX) g++ options: -O3 -march=native

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: LionClang 13Clang 12Clang 1113002600390052006500SE +/- 1.73, N = 35967587058271. (CXX) g++ options: -O3 -march=native

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Summer Nature 4KClang 13Clang 12Clang 11120240360480600SE +/- 0.44, N = 3SE +/- 0.77, N = 3SE +/- 2.65, N = 3533.02532.51528.63MIN: 186.74 / MAX: 587.2MIN: 186.64 / MAX: 587.12MIN: 176.65 / MAX: 587.681. (CC) gcc options: -O3 -march=native -pthread

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Clang 13Clang 12Clang 1120406080100SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 384.6183.86105.40MIN: 84.23 / MAX: 85MIN: 83.28 / MAX: 84.63MIN: 104.83 / MAX: 106.11. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.46820.93641.40461.87282.341SE +/- 0.00196, N = 3SE +/- 0.00197, N = 3SE +/- 0.00072, N = 32.071682.069182.08089MIN: 1.99MIN: 1.99MIN: 1.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 13Clang 12Clang 1180160240320400SE +/- 4.27, N = 3SE +/- 2.53, N = 3SE +/- 1.62, N = 3374.91365.79362.671. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.81341.62682.44023.25364.067SE +/- 0.00211, N = 3SE +/- 0.00151, N = 3SE +/- 0.00881, N = 33.614893.579523.59463MIN: 3.52MIN: 3.5MIN: 3.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 13Clang 12Clang 11100200300400500SE +/- 5.26, N = 3SE +/- 4.57, N = 3SE +/- 4.56, N = 3476.72456.18460.201. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 13Clang 12Clang 1180160240320400SE +/- 3.85, N = 3SE +/- 1.14, N = 3SE +/- 2.06, N = 3353.72340.21344.591. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 13Clang 12Clang 11300K600K900K1200K1500KSE +/- 806.80, N = 5SE +/- 1489.72, N = 5SE +/- 794.00, N = 51491816148138814774101. (CC) gcc options: -O3 -march=native

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 13Clang 12Clang 11140280420560700SE +/- 3.02, N = 3SE +/- 2.53, N = 3SE +/- 2.02, N = 3626.33604.45616.451. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.4