apple m2 compilers

Apple M2 compiler benchmarks for a future article by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2208168-NE-APPLEM2CO23.

apple m2 compilersProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionClangGCCApple M2 @ 2.42GHz (4 Cores / 8 Threads)Apple MacBook Air (13 h M2 2022)8GB251GB APPLE SSD AP0256Z + 2 x 0GB APPLE SSD AP0256ZllvmpipeBroadcom Device 4433 + Broadcom Device 5f71Arch rolling5.19.0-rc7-asahi-2-1-ARCH (aarch64)KDE Plasma 5.25.4X Server 1.21.1.44.5 Mesa 22.1.6 (LLVM 14.0.6 128 bits)Clang 14.0.6ext42560x1600GCC 12.1.0 + Clang 14.0.6OpenBenchmarking.orgEnvironment Details- CFLAGS=-O3Processor Details- Scaling Governor: apple-cpufreq schedutilPython Details- Python 3.10.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected Compiler Details- GCC: --build=aarch64-unknown-linux-gnu --disable-libssp --disable-libstdcxx-pch --disable-multilib --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-fix-cortex-a53-835769 --enable-fix-cortex-a53-843419 --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,fortran,go,lto,objc,obj-c++ --enable-lto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-unknown-linux-gnu --mandir=/usr/share/man --with-arch=armv8-a --with-linker-hash-style=gnu

apple m2 compilerssockperf: Throughputcryptopp: Keyed Algorithmscryptopp: Unkeyed Algorithmscryptopp: Integer + Elliptic Curve Public Key Algorithmsmrbayes: Primate Phylogeny Analysislammps: Rhodopsin Proteinlibgav1: Chimera 1080plibgav1: Summer Nature 4Klibgav1: Summer Nature 1080plibgav1: Chimera 1080p 10-bitsimdjson: Kostyasimdjson: TopTweetsimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDxmrig: Monero - 1Mxmrig: Wownero - 1Mcompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedjpegxl: PNG - 5jpegxl: PNG - 7jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8luajit: Compositeluajit: Monte Carloluajit: Fast Fourier Transformluajit: Sparse Matrix Multiplyluajit: Dense LU Matrix Factorizationluajit: Jacobi Successive Over-Relaxationscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacedav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitcoremark: CoreMark Size 666 - Iterations Per Secondhimeno: Poisson Pressure Solverc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeprimesieve: 1e12aobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodeespeak: Text-To-Speech Synthesisgcrypt: ngspice: C2670ngspice: C7552openjpeg: NASA Curiosity Panorama M34openssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 1 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 8 - 256 - 57tjbench: Decompression Throughputbasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3sqlite-speedtest: Timed Time - Size 1,000draco: Liondraco: Church Facadencnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - mnasnettnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1encode-wavpack: WAV To WavPackgnupg: 2.7GB Sample File Encryptionncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetClangGCC793185404.764881360.1759632161.313491192.7533.440161.8455.66133.1681.343.034.431.004.354.452520.62676.23525.24234.8880.24385.725.93932.4272.44647.8691.94854.721.04146.635.5813.12114.37114.9838.651387.75424.27560.451902.443157.76893.812870.78450.23500.914508.826907.381986.5693.79491.826351.494347.604436.062436.816136.758136.985578.630570.921311157197155613121950376.77105.09527.87283.70175753.7642258507.40742195.54288.92640.36629.47541.5197.57314.66520.297255.91376.92656.0974872490582297201529.5107954.23846333376895333153386667190544667214.69863826.1636.54835.88170.15645.4533433504415.033.473.313.136.552.2811.0329.596.1611.1515.6920.1812.6410.383.637411.322426.55756.613330.08917.97843.85013.073.623.253.003.535.562.0910.8334.445.9011.8028.7120.2412.0910.17544.384.05847339591.620305590.7156971947.622786240.5663.315156.3255.13131.3577.303.054.251.044.184.192329.92541.93526.64821.4866.14986.026.84513.1265.45241.1716.95478.321.94543.420.789.78101.26102.1338.161386.16429.71560.211890.853156.17893.842454.22463.86563.134132.955124.151987.0384.29784.114341.749348.144415.066416.060136.749137.213558.134547.81732215861021715391641361375.87115.13534.01280.98199947.6892067634.59103479.02791.45639.09525.75137.2515.67814.29816.933258.393100.08466.0614882985348118071520.3106464.43031800060534667121106667167353333222.66638024.1306.00334.12370.00144.6353476508011.942.202.061.923.821.8612.5933.707.8412.0815.3614.6714.405.282.235229.211306.61453.284321.82518.42043.89111.992.212.201.902.283.261.9012.7232.807.0412.3915.2414.519.285.19501.171.87OpenBenchmarking.org

Sockperf

Test: Throughput

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.7Test: ThroughputClangGCC200K400K600K800K1000KSE +/- 6642.87, N = 5SE +/- 3804.35, N = 57931858473391. (CXX) g++ options: --param -O3 -rdynamic

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsClangGCC130260390520650SE +/- 0.04, N = 3SE +/- 0.03, N = 3404.76591.621. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsClangGCC130260390520650SE +/- 0.02, N = 3SE +/- 0.08, N = 3360.18590.721. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsClangGCC5001000150020002500SE +/- 0.40, N = 3SE +/- 0.86, N = 32161.311947.621. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClangGCC50100150200250SE +/- 1.35, N = 12SE +/- 3.68, N = 9192.75240.571. (CC) gcc options: -O3 -std=c99 -pedantic -lm -lreadline

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinClangGCC0.7741.5482.3223.0963.87SE +/- 0.035, N = 3SE +/- 0.038, N = 43.4403.3151. (CXX) g++ options: -O3 -ldl

libgav1

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.17Video Input: Chimera 1080pClangGCC4080120160200SE +/- 0.29, N = 3SE +/- 1.36, N = 3161.84156.321. (CXX) g++ options: -O3 -lrt

libgav1

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.17Video Input: Summer Nature 4KClangGCC1326395265SE +/- 0.16, N = 3SE +/- 0.21, N = 355.6655.131. (CXX) g++ options: -O3 -lrt

libgav1

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.17Video Input: Summer Nature 1080pClangGCC306090120150SE +/- 1.30, N = 3SE +/- 0.29, N = 3133.16131.351. (CXX) g++ options: -O3 -lrt

libgav1

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterlibgav1 0.17Video Input: Chimera 1080p 10-bitClangGCC20406080100SE +/- 0.49, N = 3SE +/- 1.03, N = 381.3477.301. (CXX) g++ options: -O3 -lrt

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaClangGCC0.68631.37262.05892.74523.4315SE +/- 0.00, N = 3SE +/- 0.00, N = 33.033.051. (CXX) g++ options: -O3

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetClangGCC0.99681.99362.99043.98724.984SE +/- 0.00, N = 3SE +/- 0.00, N = 34.434.251. (CXX) g++ options: -O3

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomClangGCC0.2340.4680.7020.9361.17SE +/- 0.00, N = 3SE +/- 0.00, N = 31.001.041. (CXX) g++ options: -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsClangGCC0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.00, N = 34.354.181. (CXX) g++ options: -O3

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDClangGCC1.00132.00263.00394.00525.0065SE +/- 0.00, N = 3SE +/- 0.00, N = 34.454.191. (CXX) g++ options: -O3

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MClangGCC5001000150020002500SE +/- 20.43, N = 3SE +/- 44.51, N = 72520.62329.9-funroll-loops-static-libgcc -static-libstdc++1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MClangGCC6001200180024003000SE +/- 38.43, N = 3SE +/- 20.64, N = 32676.22541.9-funroll-loops-static-libgcc -static-libstdc++1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedClangGCC8001600240032004000SE +/- 29.67, N = 3SE +/- 7.65, N = 33525.23526.61. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression SpeedClangGCC10002000300040005000SE +/- 3.94, N = 3SE +/- 0.42, N = 34234.84821.41. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedClangGCC2004006008001000SE +/- 10.66, N = 3SE +/- 6.29, N = 3880.2866.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression SpeedClangGCC11002200330044005500SE +/- 0.60, N = 3SE +/- 14.08, N = 34385.74986.01. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedClangGCC612182430SE +/- 0.20, N = 3SE +/- 0.17, N = 325.926.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedClangGCC10002000300040005000SE +/- 3.23, N = 3SE +/- 6.68, N = 33932.44513.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedClangGCC60120180240300SE +/- 3.71, N = 3SE +/- 2.09, N = 3272.4265.41. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression SpeedClangGCC11002200330044005500SE +/- 0.98, N = 3SE +/- 5.93, N = 34647.85241.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedClangGCC150300450600750SE +/- 10.06, N = 15SE +/- 4.40, N = 15691.9716.91. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression SpeedClangGCC12002400360048006000SE +/- 1.01, N = 15SE +/- 1.43, N = 154854.75478.31. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedClangGCC510152025SE +/- 0.26, N = 3SE +/- 0.29, N = 321.021.91. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedClangGCC10002000300040005000SE +/- 4.49, N = 3SE +/- 2.75, N = 34146.64543.41. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

JPEG XL libjxl

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 5ClangGCC816243240SE +/- 0.36, N = 15SE +/- 0.20, N = 1535.5820.78-Xclang -mrelax-all1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie

JPEG XL libjxl

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 7ClangGCC3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 313.129.78-Xclang -mrelax-all1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 5ClangGCC306090120150SE +/- 0.68, N = 3SE +/- 0.28, N = 3114.37101.26-Xclang -mrelax-all1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 7ClangGCC306090120150SE +/- 0.49, N = 3SE +/- 0.55, N = 3114.98102.13-Xclang -mrelax-all1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 8ClangGCC918273645SE +/- 0.24, N = 3SE +/- 0.19, N = 338.6538.16-Xclang -mrelax-all1. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie

LuaJIT

Test: Composite

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: CompositeClangGCC30060090012001500SE +/- 0.49, N = 3SE +/- 3.74, N = 31387.751386.161. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Monte CarloClangGCC90180270360450SE +/- 0.17, N = 3SE +/- 2.66, N = 3424.27429.711. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Fast Fourier TransformClangGCC120240360480600SE +/- 0.91, N = 3SE +/- 0.16, N = 3560.45560.211. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Sparse Matrix MultiplyClangGCC400800120016002000SE +/- 1.66, N = 3SE +/- 13.70, N = 31902.441890.851. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Dense LU Matrix FactorizationClangGCC7001400210028003500SE +/- 0.85, N = 3SE +/- 7.46, N = 33157.763156.171. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

LuaJIT

Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterLuaJIT 2.1-gitTest: Jacobi Successive Over-RelaxationClangGCC2004006008001000SE +/- 0.04, N = 3SE +/- 0.02, N = 3893.81893.841. (CC) gcc options: -lm -ldl -O2 -fomit-frame-pointer -O3 -U_FORTIFY_SOURCE -fno-stack-protector

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClangGCC6001200180024003000SE +/- 3.27, N = 3SE +/- 3.41, N = 32870.782454.221. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClangGCC100200300400500SE +/- 0.00, N = 3SE +/- 0.00, N = 3450.23463.861. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClangGCC120240360480600SE +/- 0.71, N = 3SE +/- 0.57, N = 3500.91563.131. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClangGCC10002000300040005000SE +/- 16.69, N = 3SE +/- 23.87, N = 34508.824132.951. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClangGCC15003000450060007500SE +/- 8.30, N = 3SE +/- 6.68, N = 36907.385124.151. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClangGCC400800120016002000SE +/- 0.12, N = 3SE +/- 0.43, N = 31986.561987.031. (CC) gcc options: -O3 -lm

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIClangGCC20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 393.7984.301. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClangGCC20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 391.8384.111. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishClangGCC80160240320400SE +/- 0.04, N = 3SE +/- 0.11, N = 3351.49341.751. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptClangGCC80160240320400SE +/- 0.00, N = 3SE +/- 0.03, N = 3347.60348.141. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishClangGCC90180270360450SE +/- 0.13, N = 3SE +/- 0.06, N = 3436.06415.071. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptClangGCC90180270360450SE +/- 0.01, N = 3SE +/- 0.06, N = 3436.82416.061. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256ClangGCC306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3136.76136.751. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClangGCC306090120150SE +/- 0.01, N = 3SE +/- 0.02, N = 3136.99137.211. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305ClangGCC130260390520650SE +/- 0.13, N = 3SE +/- 0.08, N = 3578.63558.131. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClangGCC120240360480600SE +/- 0.16, N = 3SE +/- 0.29, N = 3570.92547.821. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlClangGCC70140210280350SE +/- 6.11, N = 15SE +/- 4.72, N = 153113221. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClangGCC30060090012001500SE +/- 11.02, N = 3SE +/- 2.60, N = 3157115861. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClangGCC20406080100SE +/- 1.14, N = 15SE +/- 1.24, N = 15971021. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClangGCC4080120160200SE +/- 1.66, N = 15SE +/- 1.86, N = 31551711. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClangGCC130260390520650SE +/- 6.24, N = 3SE +/- 4.56, N = 156135391. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianClangGCC4080120160200SE +/- 1.33, N = 15SE +/- 2.09, N = 151211641. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceClangGCC30060090012001500SE +/- 12.49, N = 15SE +/- 18.02, N = 395013611. (CC) gcc options: -fopenmp -O3 -lwebp -lwebpmux -llcms2 -ltiff -lfreetype -ljasper -ljpeg -lwmflite -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080pClangGCC80160240320400SE +/- 5.38, N = 3SE +/- 4.00, N = 3376.77375.871. (CC) gcc options: -O3 -pthread -lm

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 4KClangGCC306090120150SE +/- 2.69, N = 15SE +/- 1.12, N = 3105.09115.131. (CC) gcc options: -O3 -pthread -lm

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Summer Nature 1080pClangGCC120240360480600SE +/- 2.72, N = 3SE +/- 0.68, N = 3527.87534.011. (CC) gcc options: -O3 -pthread -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.0Video Input: Chimera 1080p 10-bitClangGCC60120180240300SE +/- 4.07, N = 3SE +/- 2.29, N = 9283.70280.981. (CC) gcc options: -O3 -pthread -lm

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClangGCC40K80K120K160K200KSE +/- 188.51, N = 3SE +/- 212.74, N = 3175753.76199947.691. (CC) gcc options: -O2 -O3 -lrt" -lrt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClangGCC2K4K6K8K10KSE +/- 3.70, N = 3SE +/- 2.90, N = 38507.417634.591. (CC) gcc options: -O3

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClangGCC20406080100SE +/- 1.77, N = 15SE +/- 0.60, N = 1595.5479.031. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClangGCC20406080100SE +/- 0.91, N = 4SE +/- 1.04, N = 488.9391.46-R/usr/lib1. (CXX) g++ options: -pipe -O3 -ffast-math -lSDL -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

Length: 1e12

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e12ClangGCC918273645SE +/- 0.79, N = 15SE +/- 0.78, N = 1540.3739.101. (CXX) g++ options: -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeClangGCC714212835SE +/- 0.04, N = 3SE +/- 0.03, N = 329.4825.751. (CC) gcc options: -lm -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLACClangGCC918273645SE +/- 0.06, N = 5SE +/- 0.05, N = 541.5237.25-fvisibility=hidden1. (CXX) g++ options: -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3ClangGCC246810SE +/- 0.007, N = 3SE +/- 0.013, N = 37.5735.678-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -lncurses -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClangGCC48121620SE +/- 0.01, N = 5SE +/- 0.02, N = 514.6714.30-fvisibility=hidden1. (CXX) g++ options: -logg -lm

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisClangGCC510152025SE +/- 0.02, N = 4SE +/- 0.03, N = 420.3016.931. (CC) gcc options: -O3 -std=c99 -lpthread -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9ClangGCC60120180240300SE +/- 0.53, N = 3SE +/- 0.56, N = 3255.91258.391. (CC) gcc options: -O3 -fvisibility=hidden -lgpg-error

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670ClangGCC20406080100SE +/- 0.28, N = 3SE +/- 3.54, N = 1576.93100.081. (CC) gcc options: -O3 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552ClangGCC1530456075SE +/- 0.24, N = 3SE +/- 1.02, N = 1556.1066.061. (CC) gcc options: -O3 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34ClangGCC10K20K30K40K50KSE +/- 90.00, N = 3SE +/- 145.89, N = 348724488291. (CXX) g++ options: -rdynamic

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256ClangGCC2000M4000M6000M8000M10000MSE +/- 10759857.48, N = 3SE +/- 13180414.49, N = 390582297208534811807-Qunused-arguments1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096ClangGCC30060090012001500SE +/- 10.91, N = 3SE +/- 15.18, N = 31529.51520.3-Qunused-arguments1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096ClangGCC20K40K60K80K100KSE +/- 712.36, N = 3SE +/- 1045.44, N = 3107954.2106464.4-Qunused-arguments1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57ClangGCC8M16M24M32M40MSE +/- 4977.73, N = 3SE +/- 3511.88, N = 338463333303180001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57ClangGCC16M32M48M64M80MSE +/- 2403.70, N = 3SE +/- 4910.31, N = 376895333605346671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57ClangGCC30M60M90M120M150MSE +/- 133832.40, N = 3SE +/- 8819.17, N = 31533866671211066671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57ClangGCC40M80M120M160M200MSE +/- 1293649.64, N = 15SE +/- 521674.65, N = 31905446671673533331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputClangGCC50100150200250SE +/- 0.04, N = 3SE +/- 0.03, N = 3214.70222.67-lm1. (CC) gcc options: -O3 -rdynamic

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1SClangGCC612182430SE +/- 0.20, N = 3SE +/- 0.11, N = 326.1624.131. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0ClangGCC246810SE +/- 0.012, N = 3SE +/- 0.007, N = 36.5486.0031. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2ClangGCC816243240SE +/- 0.24, N = 3SE +/- 0.14, N = 335.8834.121. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3ClangGCC1632486480SE +/- 0.68, N = 3SE +/- 0.74, N = 370.1670.001. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000ClangGCC1020304050SE +/- 0.05, N = 3SE +/- 0.07, N = 345.4544.641. (CC) gcc options: -O3 -lz

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: LionClangGCC7001400210028003500SE +/- 2.67, N = 3SE +/- 8.25, N = 3343334761. (CXX) g++ options: -O3

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeClangGCC11002200330044005500SE +/- 3.71, N = 3SE +/- 4.58, N = 3504450801. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetClangGCC48121620SE +/- 1.12, N = 13SE +/- 0.15, N = 415.0311.94-lomp - MIN: 10.47 / MAX: 34.45-lgomp - MIN: 8.2 / MAX: 27.451. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2ClangGCC0.78081.56162.34243.12323.904SE +/- 0.06, N = 13SE +/- 0.01, N = 43.472.20-lomp - MIN: 3.08 / MAX: 5.31-lgomp - MIN: 2.17 / MAX: 2.41. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3ClangGCC0.74481.48962.23442.97923.724SE +/- 0.08, N = 13SE +/- 0.01, N = 43.312.06-lomp - MIN: 2.56 / MAX: 7.12-lgomp - MIN: 2.03 / MAX: 2.171. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2ClangGCC0.70431.40862.11292.81723.5215SE +/- 0.07, N = 13SE +/- 0.00, N = 43.131.92-lomp - MIN: 2.48 / MAX: 4.66-lgomp - MIN: 1.9 / MAX: 2.281. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0ClangGCC246810SE +/- 0.15, N = 13SE +/- 0.20, N = 46.553.82-lomp - MIN: 5.13 / MAX: 7.99-lgomp - MIN: 3.55 / MAX: 25.761. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceClangGCC0.5131.0261.5392.0522.565SE +/- 0.12, N = 13SE +/- 0.04, N = 42.281.86-lomp - MIN: 1.38 / MAX: 5.9-lgomp - MIN: 1.02 / MAX: 8.51. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetClangGCC3691215SE +/- 0.26, N = 13SE +/- 0.06, N = 411.0312.59-lomp - MIN: 9.1 / MAX: 21.81-lgomp - MIN: 8.3 / MAX: 21.521. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16ClangGCC816243240SE +/- 0.34, N = 13SE +/- 0.10, N = 429.5933.70-lomp - MIN: 27.88 / MAX: 48.18-lgomp - MIN: 28.28 / MAX: 51.281. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18ClangGCC246810SE +/- 0.07, N = 13SE +/- 0.12, N = 46.167.84-lomp - MIN: 5.6 / MAX: 7.49-lgomp - MIN: 5.48 / MAX: 21.031. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetClangGCC3691215SE +/- 0.31, N = 13SE +/- 0.11, N = 411.1512.08-lomp - MIN: 10.07 / MAX: 22.61-lgomp - MIN: 9.11 / MAX: 22.771. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50ClangGCC48121620SE +/- 0.48, N = 13SE +/- 0.12, N = 415.6915.36-lomp - MIN: 13.48 / MAX: 29.95-lgomp - MIN: 13.43 / MAX: 24.891. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyClangGCC510152025SE +/- 0.57, N = 13SE +/- 0.31, N = 420.1814.67-lomp - MIN: 15.44 / MAX: 33.23-lgomp - MIN: 12.77 / MAX: 26.021. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdClangGCC48121620SE +/- 0.17, N = 13SE +/- 0.06, N = 412.6414.40-lomp - MIN: 9.67 / MAX: 27.34-lgomp - MIN: 9.79 / MAX: 29.221. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mClangGCC3691215SE +/- 0.26, N = 13SE +/- 0.04, N = 410.385.28-lomp - MIN: 8.02 / MAX: 15.57-lgomp - MIN: 5.16 / MAX: 5.431. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetClangGCC0.81681.63362.45043.26724.084SE +/- 0.08, N = 133.632.23-lomp - MIN: 2.94 / MAX: 4.76-lgomp - MIN: 2.22 / MAX: 2.331. (CXX) g++ options: -O3 -rdynamic -lpthread

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetClangGCC16003200480064008000SE +/- 63.12, N = 8SE +/- 2.99, N = 37411.325229.21-fopenmp=libomp - MIN: 5728.98 / MAX: 13889.4-fopenmp - MIN: 5076.63 / MAX: 5318.221. (CXX) g++ options: -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2ClangGCC90180270360450SE +/- 3.88, N = 7SE +/- 0.23, N = 3426.56306.61-fopenmp=libomp - MIN: 350.24 / MAX: 452.86-fopenmp - MIN: 296.56 / MAX: 311.251. (CXX) g++ options: -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2ClangGCC1326395265SE +/- 0.03, N = 3SE +/- 0.00, N = 356.6153.28-fopenmp=libomp - MIN: 56.51 / MAX: 56.7-fopenmp - MIN: 53.24 / MAX: 53.391. (CXX) g++ options: -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1ClangGCC70140210280350SE +/- 0.02, N = 3SE +/- 0.06, N = 3330.09321.83-fopenmp=libomp - MIN: 329.96 / MAX: 330.29-fopenmp - MIN: 321.48 / MAX: 322.341. (CXX) g++ options: -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackClangGCC510152025SE +/- 0.00, N = 5SE +/- 0.00, N = 517.9818.421. (CXX) g++ options: -rdynamic

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionClangGCC1020304050SE +/- 0.20, N = 3SE +/- 0.26, N = 343.8543.891. (CC) gcc options: -O3

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetClangGCC3691215SE +/- 0.34, N = 12SE +/- 0.15, N = 313.0711.99-lomp - MIN: 9.68 / MAX: 25.4-lgomp - MIN: 8.33 / MAX: 23.041. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2ClangGCC0.81451.6292.44353.2584.0725SE +/- 0.27, N = 12SE +/- 0.08, N = 33.622.21-lomp - MIN: 2.9 / MAX: 8.87-lgomp - MIN: 2.09 / MAX: 5.321. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3ClangGCC0.73131.46262.19392.92523.6565SE +/- 0.11, N = 12SE +/- 0.03, N = 33.252.20-lomp - MIN: 2.49 / MAX: 9.1-lgomp - MIN: 2.08 / MAX: 5.171. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2ClangGCC0.6751.352.0252.73.375SE +/- 0.07, N = 12SE +/- 0.05, N = 33.001.90-lomp - MIN: 2.11 / MAX: 12.45-lgomp - MIN: 1.82 / MAX: 5.071. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetClangGCC0.79431.58862.38293.17723.9715SE +/- 0.07, N = 12SE +/- 0.09, N = 33.532.28-lomp - MIN: 2.72 / MAX: 7.36-lgomp - MIN: 2.11 / MAX: 5.511. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0ClangGCC1.2512.5023.7535.0046.255SE +/- 0.11, N = 12SE +/- 0.02, N = 35.563.26-lomp - MIN: 4.41 / MAX: 15.78-lgomp - MIN: 3.21 / MAX: 12.851. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceClangGCC0.47030.94061.41091.88122.3515SE +/- 0.06, N = 12SE +/- 0.05, N = 32.091.90-lomp - MIN: 1.37 / MAX: 3.21-lgomp - MIN: 0.98 / MAX: 7.721. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetClangGCC3691215SE +/- 0.19, N = 12SE +/- 0.07, N = 310.8312.72-lomp - MIN: 8.82 / MAX: 22.43-lgomp - MIN: 8.11 / MAX: 27.991. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16ClangGCC816243240SE +/- 1.89, N = 12SE +/- 0.02, N = 334.4432.80-lomp - MIN: 27.08 / MAX: 74.43-lgomp - MIN: 27.43 / MAX: 42.651. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18ClangGCC246810SE +/- 0.05, N = 12SE +/- 0.18, N = 35.907.04-lomp - MIN: 5.32 / MAX: 7.13-lgomp - MIN: 5.13 / MAX: 19.311. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetClangGCC3691215SE +/- 0.48, N = 12SE +/- 0.12, N = 311.8012.39-lomp - MIN: 9.79 / MAX: 24.49-lgomp - MIN: 8.84 / MAX: 22.791. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50ClangGCC714212835SE +/- 3.28, N = 12SE +/- 0.03, N = 328.7115.24-lomp - MIN: 13.32 / MAX: 216.5-lgomp - MIN: 13.38 / MAX: 25.591. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyClangGCC510152025SE +/- 1.95, N = 12SE +/- 0.07, N = 320.2414.51-lomp - MIN: 13.87 / MAX: 70.08-lgomp - MIN: 12.49 / MAX: 23.481. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdClangGCC3691215SE +/- 1.64, N = 12SE +/- 0.10, N = 312.099.28-lomp - MIN: 7.83 / MAX: 31.07-lgomp - MIN: 7.07 / MAX: 20.521. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mClangGCC3691215SE +/- 0.41, N = 12SE +/- 0.03, N = 310.175.19-lomp - MIN: 7.32 / MAX: 18.84-lgomp - MIN: 5.13 / MAX: 9.151. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerClangGCC120240360480600SE +/- 10.55, N = 12SE +/- 0.71, N = 3544.38501.17-lomp - MIN: 375.2 / MAX: 1425.41-lgomp - MIN: 475.83 / MAX: 544.851. (CXX) g++ options: -O3 -rdynamic -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetClangGCC0.91131.82262.73393.64524.5565SE +/- 0.40, N = 11SE +/- 0.11, N = 34.051.87-lomp - MIN: 2.43 / MAX: 8.08-lgomp - MIN: 1.69 / MAX: 12.731. (CXX) g++ options: -O3 -rdynamic -lpthread


Phoronix Test Suite v10.8.4