LLVM Clang 13 Bencmarking Intel Xeon Ice Lake

LLVM Clang benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2108236-TJ-LLVMCLANG38&rdt.

LLVM Clang 13 Bencmarking Intel Xeon Ice LakeProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 13Clang 12Clang 112 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 0998504GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 21.045.14.0-rc1-folio (x86_64) 20210715GNOME Shell 3.38.4X Server 1.20.11Clang 13.0.0-++20210820072921+23ba3732246a-1~exp1~20210820174536.53ext41920x1080Clang 12.0.0-3ubuntu1~21.04.1Clang 11.0.1-2ubuntu4OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0Python Details- Clang 13, Clang 12: Python 3.9.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 13 Bencmarking Intel Xeon Ice Lakeblosc: blosclzquantlib: fftw: Float + SSE - 2D FFT Size 4096compress-zstd: 3 - Compression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptlibraw: Post-Processing Benchmarktscp: AI Chess Performancejohn-the-ripper: Blowfishjohn-the-ripper: MD5graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacedav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pvpxenc: Speed 0 - Bosphorus 4Kvpxenc: Speed 5 - Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondpjsip: INVITEpjsip: OPTIONS, Statefulpjsip: OPTIONS, Statelessc-ray: Total Time - 4K, 16 Rays Per Pixelonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUaobench: 2048 x 2048 - Total Timeencode-opus: WAV To Opus Encodetachyon: Total Timesynthmark: VoiceMark_100aircrack-ng: securemark: SecureMark-TLSfinancebench: Repo OpenMPfinancebench: Bonds OpenMPmysqlslap: 2048mysqlslap: 4096pgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Writepgbench: 100 - 250 - Read Write - Average Latencysqlite-speedtest: Timed Time - Size 1,000draco: Liondraco: Church Facadencnn: CPU - mobilenetncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1nginx: 1nginx: 20nginx: 100nginx: 200nginx: 500nginx: 1000apache: 500apache: 1000Clang 13Clang 12Clang 1124671.52657.3184896697.92775.983.6863.1828.047.077.09476.2485761.3035753.637288.807290.469327.933333.278116.103114.500855.764849.95639.9914918161124071043000018747288771174485600866533.02843.9619.6045.3056.654.96758.982353.72626.33476.72374.915.7115.352185330.0367745271101404187112.5942.789101.717702.071683.281143.61489598.020365.72736.0009.63213.4699513.010212825.67724044738153.40429758281.7864587123219734320.257848542.94864.6355967720217.387.848.7912.664.8617.879.6421.7127.9419.2123.984370.099539.64684.610400.26175270.65254133.55208590.36196023.59204880.02205977.68145589.94112600.0224249.32606.3184436321.02560.680.4865.6833.645.776.91676.2435735.2465700.326293.730288.611329.534326.798115.327114.829882.165874.91136.6514813881113351027533318647237581104452600852532.51844.7819.3944.8454.474.84957.917340.21604.45456.18365.795.6214.112146447.3800825291101854036415.2242.796581.725582.069183.058213.57952611.926368.44936.4289.90813.9533503.930210157.65125084937760.88541758502.8541676983199815250.255841322.97365.2945870705717.6511.6211.0113.756.9421.2411.0421.4828.3519.2637.114353.389541.92783.860402.25575015.29251352.56206297.92194567.38204058.28205740.39121643.80112287.1723751.92589.5181356433.32729.084.8826.7798.845.574.39573.5615743.3475714.916286.034279.963326.769325.704114.754115.648869.352866.09534.551477410113200888000018607287451104473623852528.63842.9519.3244.0654.444.89557.841344.59616.45460.20362.675.6014.322103884.8925215289101954130514.8012.786421.716402.080893.053153.59463608.604368.24836.1679.42314.2482491.027211799.58325208937221.00390658558.2552087003199672140.260851622.93864.4155827700017.3011.4610.9015.326.0721.599.9722.3626.7919.7236.266912.120773.467105.397657.10074606.79254323.92205660.13196465.21204519.00205224.87122576.13120827.45OpenBenchmarking.org

C-Blosc

Compressor: blosclz

OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.0Compressor: blosclzClang 13Clang 12Clang 115K10K15K20K25KSE +/- 33.59, N = 3SE +/- 34.61, N = 3SE +/- 34.48, N = 324671.524249.323751.91. (CXX) g++ options: -O3 -march=native

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Clang 13Clang 12Clang 116001200180024003000SE +/- 8.72, N = 3SE +/- 4.59, N = 3SE +/- 5.39, N = 32657.32606.32589.51. (CXX) g++ options: -O3 -march=native -rdynamic

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Clang 13Clang 12Clang 114K8K12K16K20KSE +/- 43.73, N = 3SE +/- 34.47, N = 3SE +/- 187.54, N = 41848918443181351. (CC) gcc options: -pthread -O3 -march=native -lm

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedClang 13Clang 12Clang 1114002800420056007000SE +/- 47.01, N = 3SE +/- 60.99, N = 3SE +/- 78.33, N = 36697.96321.06433.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedClang 13Clang 12Clang 116001200180024003000SE +/- 7.16, N = 3SE +/- 34.31, N = 15SE +/- 32.58, N = 32775.92560.62729.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedClang 13Clang 12Clang 1120406080100SE +/- 0.64, N = 10SE +/- 0.45, N = 3SE +/- 0.85, N = 1583.680.484.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedClang 13Clang 12Clang 112004006008001000SE +/- 1.29, N = 3SE +/- 2.11, N = 3SE +/- 1.30, N = 3863.1865.6826.71. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedClang 13Clang 12Clang 112004006008001000SE +/- 5.79, N = 3SE +/- 1.68, N = 3SE +/- 0.29, N = 3828.0833.6798.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedClang 13Clang 12Clang 111122334455SE +/- 0.53, N = 15SE +/- 0.55, N = 15SE +/- 0.51, N = 447.045.745.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 377.0976.9274.401. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClang 13Clang 12Clang 1120406080100SE +/- 0.02, N = 3SE +/- 0.23, N = 3SE +/- 0.02, N = 376.2576.2473.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256Clang 13Clang 12Clang 1112002400360048006000SE +/- 0.33, N = 3SE +/- 5.15, N = 3SE +/- 0.30, N = 35761.305735.255743.351. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptClang 13Clang 12Clang 1112002400360048006000SE +/- 0.20, N = 3SE +/- 0.79, N = 3SE +/- 0.51, N = 35753.645700.335714.921. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishClang 13Clang 12Clang 1160120180240300SE +/- 0.22, N = 3SE +/- 0.97, N = 3SE +/- 0.21, N = 3288.81293.73286.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptClang 13Clang 12Clang 1160120180240300SE +/- 0.27, N = 3SE +/- 0.24, N = 3SE +/- 0.36, N = 3290.47288.61279.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishClang 13Clang 12Clang 1170140210280350SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3327.93329.53326.771. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptClang 13Clang 12Clang 1170140210280350SE +/- 0.04, N = 3SE +/- 3.35, N = 3SE +/- 0.09, N = 3333.28326.80325.701. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Clang 13Clang 12Clang 11306090120150SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3116.10115.33114.751. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClang 13Clang 12Clang 11306090120150SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3114.50114.83115.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Clang 13Clang 12Clang 112004006008001000SE +/- 4.73, N = 3SE +/- 0.87, N = 3SE +/- 0.40, N = 3855.76882.17869.351. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClang 13Clang 12Clang 112004006008001000SE +/- 1.46, N = 3SE +/- 0.95, N = 3SE +/- 0.04, N = 3849.96874.91866.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkClang 13Clang 12Clang 11918273645SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 339.9936.6534.551. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 13Clang 12Clang 11300K600K900K1200K1500KSE +/- 806.80, N = 5SE +/- 1489.72, N = 5SE +/- 794.00, N = 51491816148138814774101. (CC) gcc options: -O3 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishClang 13Clang 12Clang 1120K40K60K80K100KSE +/- 41.83, N = 3SE +/- 138.68, N = 3SE +/- 249.93, N = 31124071113351132001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Clang 13Clang 12Clang 112M4M6M8M10MSE +/- 3785.94, N = 3SE +/- 16973.84, N = 3SE +/- 8504.90, N = 3104300001027533388800001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlClang 13Clang 12Clang 11400800120016002000SE +/- 8.69, N = 3SE +/- 8.51, N = 3SE +/- 13.09, N = 31874186418601. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 13Clang 12Clang 11160320480640800SE +/- 7.80, N = 3SE +/- 2.65, N = 3SE +/- 2.40, N = 37287237281. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClang 13Clang 12Clang 112004006008001000SE +/- 5.24, N = 3SE +/- 0.67, N = 3SE +/- 3.18, N = 38777587451. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClang 13Clang 12Clang 1130060090012001500SE +/- 5.78, N = 3SE +/- 5.49, N = 3SE +/- 0.67, N = 31174110411041. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 13Clang 12Clang 11100200300400500SE +/- 8.56, N = 15SE +/- 9.74, N = 15SE +/- 14.35, N = 124854524731. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianClang 13Clang 12Clang 11130260390520650SE +/- 3.06, N = 3SE +/- 4.10, N = 3SE +/- 2.33, N = 36006006231. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceClang 13Clang 12Clang 112004006008001000SE +/- 8.76, N = 3SE +/- 5.60, N = 15SE +/- 6.84, N = 38668528521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Summer Nature 4KClang 13Clang 12Clang 11120240360480600SE +/- 0.44, N = 3SE +/- 0.77, N = 3SE +/- 2.65, N = 3533.02532.51528.63MIN: 186.74 / MAX: 587.2MIN: 186.64 / MAX: 587.12MIN: 176.65 / MAX: 587.681. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.1Video Input: Chimera 1080p 10-bitClang 13Clang 12Clang 112004006008001000SE +/- 2.33, N = 3SE +/- 1.52, N = 3SE +/- 0.82, N = 3843.96844.78842.95MIN: 503.8 / MAX: 1131.86MIN: 517.09 / MAX: 1121.33MIN: 515.35 / MAX: 1115.741. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 11510152025SE +/- 0.24, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 319.6019.3919.321. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111020304050SE +/- 0.42, N = 7SE +/- 0.40, N = 3SE +/- 0.36, N = 945.3044.8444.061. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.1Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 356.6554.4754.441. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 13Clang 12Clang 111.11762.23523.35284.47045.588SE +/- 0.027, N = 3SE +/- 0.004, N = 3SE +/- 0.020, N = 34.9674.8494.8951. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KClang 13Clang 12Clang 111326395265SE +/- 0.38, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 358.9857.9257.841. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 13Clang 12Clang 1180160240320400SE +/- 3.85, N = 3SE +/- 1.14, N = 3SE +/- 2.06, N = 3353.72340.21344.591. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 13Clang 12Clang 11140280420560700SE +/- 3.02, N = 3SE +/- 2.53, N = 3SE +/- 2.02, N = 3626.33604.45616.451. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 13Clang 12Clang 11100200300400500SE +/- 5.26, N = 3SE +/- 4.57, N = 3SE +/- 4.56, N = 3476.72456.18460.201. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 13Clang 12Clang 1180160240320400SE +/- 4.27, N = 3SE +/- 2.53, N = 3SE +/- 1.62, N = 3374.91365.79362.671. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

VP9 libvpx Encoding

Speed: Speed 0 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 4KClang 13Clang 12Clang 111.28482.56963.85445.13926.424SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.715.625.601. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4KClang 13Clang 12Clang 1148121620SE +/- 0.14, N = 15SE +/- 0.09, N = 3SE +/- 0.13, N = 315.3514.1114.321. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClang 13Clang 12Clang 11500K1000K1500K2000K2500KSE +/- 1510.09, N = 3SE +/- 3711.92, N = 3SE +/- 2482.75, N = 32185330.042146447.382103884.891. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEClang 13Clang 12Clang 1111002200330044005500SE +/- 11.10, N = 3SE +/- 10.68, N = 3SE +/- 8.50, N = 35271529152891. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulClang 13Clang 12Clang 112K4K6K8K10KSE +/- 43.11, N = 3SE +/- 11.92, N = 3SE +/- 4.48, N = 31014010185101951. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessClang 13Clang 12Clang 119K18K27K36K45KSE +/- 380.52, N = 3SE +/- 510.13, N = 3SE +/- 575.37, N = 34187140364413051. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClang 13Clang 12Clang 1148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.5915.2214.801. (CC) gcc options: -lm -lpthread -O3 -march=native

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.62921.25841.88762.51683.146SE +/- 0.00435, N = 3SE +/- 0.00591, N = 3SE +/- 0.00451, N = 32.789102.796582.78642MIN: 2.65MIN: 2.65MIN: 2.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.38830.77661.16491.55321.9415SE +/- 0.00225, N = 3SE +/- 0.00427, N = 3SE +/- 0.00619, N = 31.717701.725581.71640MIN: 1.55MIN: 1.57MIN: 1.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.46820.93641.40461.87282.341SE +/- 0.00196, N = 3SE +/- 0.00197, N = 3SE +/- 0.00072, N = 32.071682.069182.08089MIN: 1.99MIN: 1.99MIN: 1.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.73831.47662.21492.95323.6915SE +/- 0.00367, N = 3SE +/- 0.00384, N = 3SE +/- 0.00886, N = 33.281143.058213.05315MIN: 3.08MIN: 2.86MIN: 2.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 110.81341.62682.44023.25364.067SE +/- 0.00211, N = 3SE +/- 0.00151, N = 3SE +/- 0.00881, N = 33.614893.579523.59463MIN: 3.52MIN: 3.5MIN: 3.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 11130260390520650SE +/- 0.92, N = 3SE +/- 6.83, N = 4SE +/- 5.71, N = 13598.02611.93608.60MIN: 575.43MIN: 569.44MIN: 570.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUClang 13Clang 12Clang 1180160240320400SE +/- 0.40, N = 3SE +/- 1.79, N = 3SE +/- 2.59, N = 3365.73368.45368.25MIN: 355.4MIN: 347.27MIN: 349.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeClang 13Clang 12Clang 11816243240SE +/- 0.02, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 336.0036.4336.171. (CC) gcc options: -lm -O3 -march=native

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 13Clang 12Clang 113691215SE +/- 0.069, N = 5SE +/- 0.006, N = 5SE +/- 0.013, N = 59.6329.9089.4231. (CXX) g++ options: -O3 -march=native -logg -lm

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeClang 13Clang 12Clang 1148121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 313.4713.9514.251. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Clang 13Clang 12Clang 11110220330440550SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3513.01503.93491.031. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Aircrack-ng

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.5.2Clang 13Clang 12Clang 1150K100K150K200K250KSE +/- 467.06, N = 3SE +/- 504.25, N = 3SE +/- 607.21, N = 3212825.68210157.65211799.581. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lsqlite3 -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 13Clang 12Clang 1150K100K150K200K250KSE +/- 40.95, N = 3SE +/- 339.82, N = 3SE +/- 87.48, N = 32404472508492520891. (CC) gcc options: -pedantic -O3

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPClang 13Clang 12Clang 118K16K24K32K40KSE +/- 450.24, N = 4SE +/- 290.04, N = 3SE +/- 19.15, N = 338153.4037760.8937221.001. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPClang 13Clang 12Clang 1113K26K39K52K65KSE +/- 17.71, N = 3SE +/- 82.28, N = 3SE +/- 38.93, N = 358281.7958502.8558558.261. (CXX) g++ options: -O3 -march=native -fopenmp

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 2048Clang 13Clang 12Clang 11150300450600750SE +/- 1.71, N = 3SE +/- 6.96, N = 9SE +/- 7.30, N = 9712698700-lpthread -lm -lstdc++-lpthread -lm -lstdc++-lbz2 -lsnappy1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.6.4Clients: 4096Clang 13Clang 12Clang 1170140210280350SE +/- 1.31, N = 3SE +/- 1.07, N = 3SE +/- 1.96, N = 3321319319-lpthread -lm -lstdc++-lpthread -lm -lstdc++-lbz2 -lsnappy1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -lrt -lz -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 13Clang 12Clang 11200K400K600K800K1000KSE +/- 10672.94, N = 3SE +/- 3346.94, N = 3SE +/- 16009.81, N = 159734329815259672141. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 13Clang 12Clang 110.05850.1170.17550.2340.2925SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 150.2570.2550.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 13Clang 12Clang 1120K40K60K80K100KSE +/- 119.61, N = 3SE +/- 85.43, N = 3SE +/- 786.14, N = 38485484132851621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 13Clang 12Clang 110.66891.33782.00672.67563.3445SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.027, N = 32.9482.9732.9381. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Clang 13Clang 12Clang 111530456075SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 364.6465.2964.421. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: LionClang 13Clang 12Clang 1113002600390052006500SE +/- 1.73, N = 35967587058271. (CXX) g++ options: -O3 -march=native

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Church FacadeClang 13Clang 12Clang 1115003000450060007500SE +/- 4.26, N = 3SE +/- 3.18, N = 37202705770001. (CXX) g++ options: -O3 -march=native

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetClang 13Clang 12Clang 1148121620SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.27, N = 1417.3817.6517.30MIN: 16.17 / MAX: 40.05MIN: 16.51 / MAX: 19.27MIN: 15.79 / MAX: 37.291. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Clang 13Clang 12Clang 113691215SE +/- 0.01, N = 3SE +/- 1.05, N = 3SE +/- 0.67, N = 147.8411.6211.46MIN: 7.49 / MAX: 13.76MIN: 7.71 / MAX: 25.56MIN: 7.56 / MAX: 31.761. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetClang 13Clang 12Clang 113691215SE +/- 1.33, N = 3SE +/- 1.31, N = 3SE +/- 0.66, N = 148.7911.0110.90MIN: 6.69 / MAX: 26.58MIN: 7.09 / MAX: 20.3MIN: 6.9 / MAX: 32.841. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Clang 13Clang 12Clang 1148121620SE +/- 2.13, N = 3SE +/- 2.00, N = 3SE +/- 1.01, N = 1412.6613.7515.32MIN: 8.73 / MAX: 27.07MIN: 9.12 / MAX: 41.76MIN: 8.59 / MAX: 45.641. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceClang 13Clang 12Clang 11246810SE +/- 0.01, N = 3SE +/- 1.08, N = 3SE +/- 0.47, N = 144.866.946.07MIN: 4.57 / MAX: 9.45MIN: 4.49 / MAX: 19.88MIN: 4.35 / MAX: 27.781. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetClang 13Clang 12Clang 11510152025SE +/- 0.31, N = 3SE +/- 1.87, N = 3SE +/- 1.02, N = 1417.8721.2421.59MIN: 16.58 / MAX: 37.88MIN: 17.28 / MAX: 39.77MIN: 16.19 / MAX: 50.351. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetClang 13Clang 12Clang 113691215SE +/- 0.17, N = 3SE +/- 1.31, N = 3SE +/- 0.16, N = 149.6411.049.97MIN: 8.9 / MAX: 11.05MIN: 9.24 / MAX: 14.73MIN: 8.85 / MAX: 29.231. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Clang 13Clang 12Clang 11510152025SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 0.61, N = 1421.7121.4822.36MIN: 19.54 / MAX: 44.52MIN: 19.9 / MAX: 40.42MIN: 18.98 / MAX: 71.341. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyClang 13Clang 12Clang 11714212835SE +/- 0.54, N = 3SE +/- 0.55, N = 3SE +/- 0.26, N = 1427.9428.3526.79MIN: 25.71 / MAX: 39.58MIN: 25.63 / MAX: 60.64MIN: 23.92 / MAX: 56.11. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdClang 13Clang 12Clang 11510152025SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.53, N = 1419.2119.2619.72MIN: 18.04 / MAX: 38.36MIN: 18.05 / MAX: 35.83MIN: 17.67 / MAX: 58.391. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mClang 13Clang 12Clang 11918273645SE +/- 1.77, N = 3SE +/- 3.84, N = 3SE +/- 2.48, N = 1423.9837.1136.26MIN: 20.12 / MAX: 66.47MIN: 19.88 / MAX: 96.76MIN: 18.78 / MAX: 124.571. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetClang 13Clang 12Clang 1115003000450060007500SE +/- 2.30, N = 3SE +/- 1.66, N = 3SE +/- 3.50, N = 34370.104353.396912.12MIN: 4351.72 / MAX: 4450.96MIN: 4332.87 / MAX: 4442MIN: 6888.33 / MAX: 6947.751. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Clang 13Clang 12Clang 11170340510680850SE +/- 0.66, N = 3SE +/- 0.70, N = 3SE +/- 0.69, N = 3539.65541.93773.47MIN: 536.06 / MAX: 559.33MIN: 536.56 / MAX: 554.17MIN: 690.55 / MAX: 820.361. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Clang 13Clang 12Clang 1120406080100SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 384.6183.86105.40MIN: 84.23 / MAX: 85MIN: 83.28 / MAX: 84.63MIN: 104.83 / MAX: 106.11. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Clang 13Clang 12Clang 11140280420560700SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3400.26402.26657.10MIN: 399.7 / MAX: 401.33MIN: 401.13 / MAX: 403.41MIN: 656.47 / MAX: 657.891. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

nginx

Concurrent Requests: 1

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1Clang 13Clang 12Clang 1116K32K48K64K80KSE +/- 160.67, N = 3SE +/- 102.54, N = 3SE +/- 131.50, N = 375270.6575015.2974606.791. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 20Clang 13Clang 12Clang 1150K100K150K200K250KSE +/- 1379.02, N = 3SE +/- 1222.93, N = 3SE +/- 2398.86, N = 3254133.55251352.56254323.921. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 987.48, N = 3SE +/- 431.14, N = 3SE +/- 1506.43, N = 3208590.36206297.92205660.131. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 911.83, N = 3SE +/- 312.30, N = 3SE +/- 2823.39, N = 3196023.59194567.38196465.211. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 936.49, N = 3SE +/- 63.72, N = 3SE +/- 554.68, N = 3204880.02204058.28204519.001. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000Clang 13Clang 12Clang 1140K80K120K160K200KSE +/- 1305.06, N = 3SE +/- 632.41, N = 3SE +/- 947.34, N = 3205977.68205740.39205224.871. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500Clang 13Clang 12Clang 1130K60K90K120K150KSE +/- 843.46, N = 3SE +/- 1145.12, N = 15SE +/- 798.53, N = 3145589.94121643.80122576.131. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000Clang 13Clang 12Clang 1130K60K90K120K150KSE +/- 485.15, N = 3SE +/- 534.15, N = 3SE +/- 924.52, N = 15112600.02112287.17120827.451. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native


Phoronix Test Suite v10.8.4