LLVM Clang 13 Bencmarking Intel Xeon Ice Lake

LLVM Clang benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2108236-TJ-LLVMCLANG38&sor.

LLVM Clang 13 Bencmarking Intel Xeon Ice LakeProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 11Clang 12Clang 132 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 0998504GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 21.045.14.0-rc1-folio (x86_64) 20210715GNOME Shell 3.38.4X Server 1.20.11Clang 11.0.1-2ubuntu4ext41920x1080Clang 12.0.0-3ubuntu1~21.04.1Clang 13.0.0-++20210820072921+23ba3732246a-1~exp1~20210820174536.53OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Python Details- Clang 12, Clang 13: Python 3.9.5

LLVM Clang 13 Bencmarking Intel Xeon Ice Lakeblosc: blosclzquantlib: fftw: Float + SSE - 2D FFT Size 4096compress-zstd: 3 - Compression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptlibraw: Post-Processing Benchmarktscp: AI Chess Performancejohn-the-ripper: Blowfishjohn-the-ripper: MD5graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacedav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pvpxenc: Speed 0 - Bosphorus 4Kvpxenc: Speed 5 - Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondpjsip: INVITEpjsip: OPTIONS, Statefulpjsip: OPTIONS, Statelessc-ray: Total Time - 4K, 16 Rays Per Pixelonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUaobench: 2048 x 2048 - Total Timeencode-opus: WAV To Opus Encodetachyon: Total Timesynthmark: VoiceMark_100aircrack-ng: securemark: SecureMark-TLSfinancebench: Repo OpenMPfinancebench: Bonds OpenMPmysqlslap: 2048mysqlslap: 4096pgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Writepgbench: 100 - 250 - Read Write - Average Latencysqlite-speedtest: Timed Time - Size 1,000draco: Liondraco: Church Facadencnn: CPU - mobilenetncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1nginx: 1nginx: 20nginx: 100nginx: 200nginx: 500nginx: 1000apache: 500apache: 1000Clang 11Clang 12Clang 1323751.92589.5181356433.32729.084.8826.7798.845.574.39573.5615743.3475714.916286.034279.963326.769325.704114.754115.648869.352866.09534.551477410113200888000018607287451104473623852528.63842.9519.3244.0654.444.89557.841344.59616.45460.20362.675.6014.322103884.8925215289101954130514.8012.786421.716402.080893.053153.59463608.604368.24836.1679.42314.2482491.027211799.58325208937221.00390658558.2552087003199672140.260851622.93864.4155827700017.3011.4610.9015.326.0721.599.9722.3626.7919.7236.266912.120773.467105.397657.10074606.79254323.92205660.13196465.21204519.00205224.87122576.13120827.4524249.32606.3184436321.02560.680.4865.6833.645.776.91676.2435735.2465700.326293.730288.611329.534326.798115.327114.829882.165874.91136.6514813881113351027533318647237581104452600852532.51844.7819.3944.8454.474.84957.917340.21604.45456.18365.795.6214.112146447.3800825291101854036415.2242.796581.725582.069183.058213.57952611.926368.44936.4289.90813.9533503.930210157.65125084937760.88541758502.8541676983199815250.255841322.97365.2945870705717.6511.6211.0113.756.9421.2411.0421.4828.3519.2637.114353.389541.92783.860402.25575015.29251352.56206297.92194567.38204058.28205740.39121643.80112287.1724671.52657.3184896697.92775.983.6863.1828.047.077.09476.2485761.3035753.637288.807290.469327.933333.278116.103114.500855.764849.95639.9914918161124071043000018747288771174485600866533.02843.9619.6045.3056.654.96758.982353.72626.33476.72374.915.7115.352185330.0367745271101404187112.5942.789101.717702.071683.281143.61489598.020365.72736.0009.63213.4699513.010212825.67724044738153.40429758281.7864587123219734320.257848542.94864.6355967720217.387.848.7912.664.8617.879.6421.7127.9419.2123.984370.099539.64684.610400.26175270.65254133.55208590.36196023.59204880.02205977.68145589.94112600.02OpenBenchmarking.org

C-Blosc

Compressor: blosclz

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0Compressor: blosclzClang 13Clang 12Clang 115K10K15K20K25KSE +/- 33.59, N = 3SE +/- 34.61, N = 3SE +/- 34.48, N = 324671.524249.323751.91. (CXX) g++ options: -O3 -march=native

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Clang 13Clang 12Clang 116001200180024003000SE +/- 8.72, N = 3SE +/- 4.59, N = 3SE +/- 5.39, N = 32657.32606.32589.51. (CXX) g++ options: -O3 -march=native -rdynamic

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Clang 13Clang 12Clang 114K8K12K16K20KSE +/- 43.73, N = 3SE +/- 34.47, N = 3SE +/- 187.54, N = 41848918443181351. (CC) gcc options: -pthread -O3 -march=native -lm

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedClang 13Clang 11Clang 1214002800420056007000SE +/- 47.01, N = 3SE +/- 78.33, N = 3SE +/- 60.99, N = 36697.96433.36321.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedClang 13Clang 11Clang 126001200180024003000SE +/- 7.16, N = 3SE +/- 32.58, N = 3SE +/- 34.31, N = 152775.92729.02560.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedClang 11Clang 13Clang 1220406080100SE +/- 0.85, N = 15SE +/- 0.64, N = 10SE +/- 0.45, N = 384.883.680.41. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedClang 12Clang 13Clang 112004006008001000SE +/- 2.11, N = 3SE +/- 1.29, N = 3SE +/- 1.30, N = 3865.6863.1826.71. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedClang 12Clang 13Clang 112004006008001000SE +/- 1.68, N = 3SE +/- 5.79, N = 3SE +/- 0.29, N = 3833.6828.0798.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedClang 13Clang 12Clang 111122334455SE +/- 0.53, N = 15SE +/- 0.55, N = 15SE +/- 0.51, N = 447.045.745.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 377.0976.9274.401. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 376.2576.2473.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Clang 13Clang 11Clang 1212002400360048006000SE +/- 0.33, N = 3SE +/- 0.30, N = 3SE +/- 5.15, N = 35761.305743.355735.251. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptClang 13Clang 11Clang 1212002400360048006000SE +/- 0.20, N = 3SE +/- 0.51, N = 3SE +/- 0.79, N = 35753.645714.925700.331. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishClang 12Clang 13Clang 1160120180240300SE +/- 0.97, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 3293.73288.81286.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptClang 13Clang 12Clang 1160120180240300SE +/- 0.27, N = 3SE +/- 0.24, N = 3SE +/- 0.36, N = 3290.47288.61279.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishClang 12Clang 13Clang 1170140210280350SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3329.53327.93326.771. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptClang 13Clang 12Clang 1170140210280350SE +/- 0.04, N = 3SE +/- 3.35, N = 3SE +/- 0.09, N = 3333.28326.80325.701. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Clang 13Clang 12Clang 11306090120150SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3116.10115.33114.751. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClang 11Clang 12Clang 13306090120150SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3115.65114.83114.501. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Clang 12Clang 11Clang 132004006008001000SE +/- 0.87, N = 3SE +/- 0.40, N = 3SE +/- 4.73, N = 3882.17869.35855.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClang 12Clang 11Clang 132004006008001000SE +/- 0.95, N = 3SE +/- 0.04, N = 3SE +/- 1.46, N = 3874.91866.10849.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkClang 13Clang 12Clang 11918273645SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 339.9936.6534.551. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 13Clang 12Clang 11300K600K900K1200K1500KSE +/- 806.80, N = 5SE +/- 1489.72, N = 5SE +/- 794.00, N = 51491816148138814774101. (CC) gcc options: -O3 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishClang 11Clang 13Clang 1220K40K60K80K100KSE +/- 249.93, N = 3SE +/- 41.83, N = 3SE +/- 138.68, N = 31132001124071113351. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Clang 13Clang 12Clang 112M4M6M8M10MSE +/- 3785.94, N = 3SE +/- 16973.84, N = 3SE +/- 8504.90, N = 3104300001027533388800001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlClang 13Clang 12Clang 11400800120016002000SE +/- 8.69, N = 3SE +/- 8.51, N = 3SE +/- 13.09, N = 31874186418601. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 13Clang 11Clang 12160320480640800SE +/- 7.80, N = 3SE +/- 2.40, N = 3SE +/- 2.65, N = 37287287231. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClang 13Clang 12Clang 112004006008001000SE +/- 5.24, N = 3SE +/- 0.67, N = 3SE +/- 3.18, N = 38777587451. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClang 13Clang 12Clang 1130060090012001500SE +/- 5.78, N = 3SE +/- 5.49, N = 3SE +/- 0.67, N = 31174110411041. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 13Clang 11Clang 12100200300400500SE +/- 8.56, N = 15SE +/- 14.35, N = 12SE +/- 9.74, N = 154854734521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianClang 11Clang 13Clang 12130260390520650SE +/- 2.33, N = 3SE +/- 3.06, N = 3SE +/- 4.10, N = 36236006001. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceClang 13Clang 12Clang 112004006008001000SE +/- 8.76, N = 3SE +/- 5.60, N = 15SE +/- 6.84, N = 38668528521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Summer Nature 4KClang 13Clang 12Clang 11120240360480600SE +/- 0.44, N = 3SE +/- 0.77, N = 3SE +/- 2.65, N = 3533.02532.51528.63MIN: 186.74 / MAX: 587.2MIN: 186.64 / MAX: 587.12MIN: 176.65 / MAX: 587.681. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Chimera 1080p 10-bitClang 12Clang 13Clang 112004006008001000SE +/- 1.52, N = 3SE +/- 2.33, N = 3SE +/- 0.82, N = 3844.78843.96842.95MIN: 517.09 / MAX: 1121.33MIN: 503.8 / MAX: 1131.86MIN: 515.35 / MAX: 1115.741. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 11510152025SE +/- 0.24, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 319.6019.3919.321. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111020304050SE +/- 0.42, N = 7SE +/- 0.40, N = 3SE +/- 0.36, N = 945.3044.8444.061. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 356.6554.4754.441. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 13Clang 11Clang 121.11762.23523.35284.47045.588SE +/- 0.027, N = 3SE +/- 0.020, N = 3SE +/- 0.004, N = 34.9674.8954.8491. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.38, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 358.9857.9257.841. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 13Clang 11Clang 1280160240320400SE +/- 3.85, N = 3SE +/- 2.06, N = 3SE +/- 1.14, N = 3353.72344.59340.211. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 13Clang 11Clang 12140280420560700SE +/- 3.02, N = 3SE +/- 2.02, N = 3SE +/- 2.53, N = 3626.33616.45604.451. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 13Clang 11Clang 12100200300400500SE +/- 5.26, N = 3SE +/- 4.56, N = 3SE +/- 4.57, N = 3476.72460.20456.181. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 13Clang 12Clang 1180160240320400SE +/- 4.27, N = 3SE +/- 2.53, N = 3SE +/- 1.62, N = 3374.91365.79362.671. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KClang 13Clang 12Clang 111.28482.56963.85445.13926.424SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.715.625.601. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KClang 13Clang 11Clang 1248121620SE +/- 0.14, N = 15SE +/- 0.13, N = 3SE +/- 0.09, N = 315.3514.3214.111. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClang 13Clang 12Clang 11500K1000K1500K2000K2500KSE +/- 1510.09, N = 3SE +/- 3711.92, N = 3SE +/- 2482.75, N = 32185330.042146447.382103884.891. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEClang 12Clang 11Clang 1311002200330044005500SE +/- 10.68, N = 3SE +/- 8.50, N = 3SE +/- 11.10, N = 35291528952711. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulClang 11Clang 12Clang 132K4K6K8K10KSE +/- 4.48, N = 3SE +/- 11.92, N = 3SE +/- 43.11, N = 31019510185101401. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessClang 13Clang 11Clang 129K18K27K36K45KSE +/- 380.52, N = 3SE +/- 575.37, N = 3SE +/- 510.13, N = 34187141305403641. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClang 13Clang 11Clang 1248121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.5914.8015.221. (CC) gcc options: -lm -lpthread -O3 -march=native

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUClang 11Clang 13Clang 120.62921.25841.88762.51683.146SE +/- 0.00451, N = 3SE +/- 0.00435, N = 3SE +/- 0.00591, N = 32.786422.789102.79658MIN: 2.65MIN: 2.65MIN: 2.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUClang 11Clang 13Clang 120.38830.77661.16491.55321.9415SE +/- 0.00619, N = 3SE +/- 0.00225, N = 3SE +/- 0.00427, N = 31.716401.717701.72558MIN: 1.53MIN: 1.55MIN: 1.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUClang 12Clang 13Clang 110.46820.93641.40461.87282.341SE +/- 0.00197, N = 3SE +/- 0.00196, N = 3SE +/- 0.00072, N = 32.069182.071682.08089MIN: 1.99MIN: 1.99MIN: 1.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUClang 11Clang 12Clang 130.73831.47662.21492.95323.6915SE +/- 0.00886, N = 3SE +/- 0.00384, N = 3SE +/- 0.00367, N = 33.053153.058213.28114MIN: 2.87MIN: 2.86MIN: 3.081. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUClang 12Clang 11Clang 130.81341.62682.44023.25364.067SE +/- 0.00151, N = 3SE +/- 0.00881, N = 3SE +/- 0.00211, N = 33.579523.594633.61489MIN: 3.5MIN: 3.5MIN: 3.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 11Clang 12130260390520650SE +/- 0.92, N = 3SE +/- 5.71, N = 13SE +/- 6.83, N = 4598.02608.60611.93MIN: 575.43MIN: 570.79MIN: 569.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 11Clang 1280160240320400SE +/- 0.40, N = 3SE +/- 2.59, N = 3SE +/- 1.79, N = 3365.73368.25368.45MIN: 355.4MIN: 349.99MIN: 347.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeClang 13Clang 11Clang 12816243240SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.24, N = 336.0036.1736.431. (CC) gcc options: -lm -O3 -march=native

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 11Clang 13Clang 123691215SE +/- 0.013, N = 5SE +/- 0.069, N = 5SE +/- 0.006, N = 59.4239.6329.9081. (CXX) g++ options: -O3 -march=native -logg -lm

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeClang 13Clang 12Clang 1148121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 313.4713.9514.251. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Clang 13Clang 12Clang 11110220330440550SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3513.01503.93491.031. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Aircrack-ng

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.5.2Clang 13Clang 11Clang 1250K100K150K200K250KSE +/- 467.06, N = 3SE +/- 607.21, N = 3SE +/- 504.25, N = 3212825.68211799.58210157.651. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lsqlite3 -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 11Clang 12Clang 1350K100K150K200K250KSE +/- 87.48, N = 3SE +/- 339.82, N = 3SE +/- 40.95, N = 32520892508492404471. (CC) gcc options: -pedantic -O3

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPClang 11Clang 12Clang 138K16K24K32K40KSE +/- 19.15, N = 3SE +/- 290.04, N = 3SE +/- 450.24, N = 437221.0037760.8938153.401. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPClang 13Clang 12Clang 1113K26K39K52K65KSE +/- 17.71, N = 3SE +/- 82.28, N = 3SE +/- 38.93, N = 358281.7958502.8558558.261. (CXX) g++ options: -O3 -march=native -fopenmp

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 2048Clang 13Clang 11Clang 12150300450600750SE +/- 1.71, N = 3SE +/- 7.30, N = 9SE +/- 6.96, N = 9712700698-lpthread -lm -lstdc++-lbz2 -lsnappy-lpthread -lm -lstdc++1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 4096Clang 13Clang 12Clang 1170140210280350SE +/- 1.31, N = 3SE +/- 1.07, N = 3SE +/- 1.96, N = 3321319319-lpthread -lm -lstdc++-lpthread -lm -lstdc++-lbz2 -lsnappy1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 12Clang 13Clang 11200K400K600K800K1000KSE +/- 3346.94, N = 3SE +/- 10672.94, N = 3SE +/- 16009.81, N = 159815259734329672141. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 12Clang 13Clang 110.05850.1170.17550.2340.2925SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 150.2550.2570.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 11Clang 13Clang 1220K40K60K80K100KSE +/- 786.14, N = 3SE +/- 119.61, N = 3SE +/- 85.43, N = 38516284854841321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 11Clang 13Clang 120.66891.33782.00672.67563.3445SE +/- 0.027, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 32.9382.9482.9731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Clang 11Clang 13Clang 121530456075SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 364.4264.6465.291. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: LionClang 11Clang 12Clang 1313002600390052006500SE +/- 1.73, N = 35827587059671. (CXX) g++ options: -O3 -march=native

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Church FacadeClang 11Clang 12Clang 1315003000450060007500SE +/- 3.18, N = 3SE +/- 4.26, N = 37000705772021. (CXX) g++ options: -O3 -march=native

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetClang 11Clang 13Clang 1248121620SE +/- 0.27, N = 14SE +/- 0.11, N = 3SE +/- 0.16, N = 317.3017.3817.65MIN: 15.79 / MAX: 37.29MIN: 16.17 / MAX: 40.05MIN: 16.51 / MAX: 19.271. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Clang 13Clang 11Clang 123691215SE +/- 0.01, N = 3SE +/- 0.67, N = 14SE +/- 1.05, N = 37.8411.4611.62MIN: 7.49 / MAX: 13.76MIN: 7.56 / MAX: 31.76MIN: 7.71 / MAX: 25.561. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetClang 13Clang 11Clang 123691215SE +/- 1.33, N = 3SE +/- 0.66, N = 14SE +/- 1.31, N = 38.7910.9011.01MIN: 6.69 / MAX: 26.58MIN: 6.9 / MAX: 32.84MIN: 7.09 / MAX: 20.31. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Clang 13Clang 12Clang 1148121620SE +/- 2.13, N = 3SE +/- 2.00, N = 3SE +/- 1.01, N = 1412.6613.7515.32MIN: 8.73 / MAX: 27.07MIN: 9.12 / MAX: 41.76MIN: 8.59 / MAX: 45.641. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceClang 13Clang 11Clang 12246810SE +/- 0.01, N = 3SE +/- 0.47, N = 14SE +/- 1.08, N = 34.866.076.94MIN: 4.57 / MAX: 9.45MIN: 4.35 / MAX: 27.78MIN: 4.49 / MAX: 19.881. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetClang 13Clang 12Clang 11510152025SE +/- 0.31, N = 3SE +/- 1.87, N = 3SE +/- 1.02, N = 1417.8721.2421.59MIN: 16.58 / MAX: 37.88MIN: 17.28 / MAX: 39.77MIN: 16.19 / MAX: 50.351. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetClang 13Clang 11Clang 123691215SE +/- 0.17, N = 3SE +/- 0.16, N = 14SE +/- 1.31, N = 39.649.9711.04MIN: 8.9 / MAX: 11.05MIN: 8.85 / MAX: 29.23MIN: 9.24 / MAX: 14.731. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Clang 12Clang 13Clang 11510152025SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.61, N = 1421.4821.7122.36MIN: 19.9 / MAX: 40.42MIN: 19.54 / MAX: 44.52MIN: 18.98 / MAX: 71.341. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyClang 11Clang 13Clang 12714212835SE +/- 0.26, N = 14SE +/- 0.54, N = 3SE +/- 0.55, N = 326.7927.9428.35MIN: 23.92 / MAX: 56.1MIN: 25.71 / MAX: 39.58MIN: 25.63 / MAX: 60.641. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdClang 13Clang 12Clang 11510152025SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.53, N = 1419.2119.2619.72MIN: 18.04 / MAX: 38.36MIN: 18.05 / MAX: 35.83MIN: 17.67 / MAX: 58.391. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mClang 13Clang 11Clang 12918273645SE +/- 1.77, N = 3SE +/- 2.48, N = 14SE +/- 3.84, N = 323.9836.2637.11MIN: 20.12 / MAX: 66.47MIN: 18.78 / MAX: 124.57MIN: 19.88 / MAX: 96.761. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetClang 12Clang 13Clang 1115003000450060007500SE +/- 1.66, N = 3SE +/- 2.30, N = 3SE +/- 3.50, N = 34353.394370.106912.12MIN: 4332.87 / MAX: 4442MIN: 4351.72 / MAX: 4450.96MIN: 6888.33 / MAX: 6947.751. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Clang 13Clang 12Clang 11170340510680850SE +/- 0.66, N = 3SE +/- 0.70, N = 3SE +/- 0.69, N = 3539.65541.93773.47MIN: 536.06 / MAX: 559.33MIN: 536.56 / MAX: 554.17MIN: 690.55 / MAX: 820.361. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Clang 12Clang 13Clang 1120406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 383.8684.61105.40MIN: 83.28 / MAX: 84.63MIN: 84.23 / MAX: 85MIN: 104.83 / MAX: 106.11. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Clang 13Clang 12Clang 11140280420560700SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3400.26402.26657.10MIN: 399.7 / MAX: 401.33MIN: 401.13 / MAX: 403.41MIN: 656.47 / MAX: 657.891. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

nginx

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1Clang 13Clang 12Clang 1116K32K48K64K80KSE +/- 160.67, N = 3SE +/- 102.54, N = 3SE +/- 131.50, N = 375270.6575015.2974606.791. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 20Clang 11Clang 13Clang 1250K100K150K200K250KSE +/- 2398.86, N = 3SE +/- 1379.02, N = 3SE +/- 1222.93, N = 3254323.92254133.55251352.561. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 987.48, N = 3SE +/- 431.14, N = 3SE +/- 1506.43, N = 3208590.36206297.92205660.131. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200Clang 11Clang 13Clang 1240K80K120K160K200KSE +/- 2823.39, N = 3SE +/- 911.83, N = 3SE +/- 312.30, N = 3196465.21196023.59194567.381. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500Clang 13Clang 11Clang 1240K80K120K160K200KSE +/- 936.49, N = 3SE +/- 554.68, N = 3SE +/- 63.72, N = 3204880.02204519.00204058.281. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 1305.06, N = 3SE +/- 632.41, N = 3SE +/- 947.34, N = 3205977.68205740.39205224.871. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500Clang 13Clang 11Clang 1230K60K90K120K150KSE +/- 843.46, N = 3SE +/- 798.53, N = 3SE +/- 1145.12, N = 15145589.94122576.13121643.801. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000Clang 11Clang 13Clang 1230K60K90K120K150KSE +/- 924.52, N = 15SE +/- 485.15, N = 3SE +/- 534.15, N = 3120827.45112600.02112287.171. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native


Phoronix Test Suite v10.8.4