Apple M1 Compilers

Apple M1 compiler testing for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2204107-NE-APPLEM1CO68
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 11.2.0
April 08 2022
  5 Hours, 11 Minutes
Clang 13.0.1
April 09 2022
  5 Hours, 35 Minutes
Invert Behavior (Only Show Selected Data)
  5 Hours, 23 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Apple M1 CompilersOpenBenchmarking.orgPhoronix Test SuiteApple M1 @ 2.06GHz (4 Cores / 8 Threads)Apple Mac mini (M1 2020)8GB251GB APPLE SSD AP0256Q + 2 x 0GB APPLE SSD AP0256QllvmpipeBroadcom NetXtreme BCM57762 PCIe + Broadcom BRCM4378 + Broadcom Device 5f69Arch Linux ARM5.17.0-rc7-asahi-next-20220310-5-2-ARCH (aarch64)KDE Plasma 5.24.4X Server 1.21.1.34.5 Mesa 22.0.1 (LLVM 13.0.1 128 bits)GCC 11.2.0 + Clang 13.0.1Clang 13.0.1ext41920x1080ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilersFile-SystemScreen ResolutionApple M1 Compilers PerformanceSystem Logs- CXXFLAGS="-O3 -flto" CFLAGS="-O3 -flto"- GCC 11.2.0: --build=aarch64-unknown-linux-gnu --disable-libssp --disable-libstdcxx-pch --disable-multilib --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-fix-cortex-a53-835769 --enable-fix-cortex-a53-843419 --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,fortran,go,lto,objc,obj-c++,d --enable-lto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-unknown-linux-gnu --mandir=/usr/share/man --with-arch=armv8-a --with-isl --with-linker-hash-style=gnu - MQ-DEADLINE / relatime,rw / Block Size: 4096- Scaling Governor: apple-cpufreq schedutil- Python 3.10.4- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 11.2.0 vs. Clang 13.0.1 ComparisonPhoronix Test SuiteBaseline+43.2%+43.2%+86.4%+86.4%+129.6%+129.6%74.9%35.4%31.8%31.7%31.7%30%28.3%19.6%15.4%6.1%5.7%5.1%4.8%4.3%3.9%3.6%3.3%3.2%3.2%3.1%3.1%3.1%2.7%2.2%CPU - resnet50172.7%CPU - alexnet168.1%CPU - resnet18148.7%CPU - vgg16138.1%CPU - mnasnet132.5%CPU - efficientnet-b0127.5%CPU-v2-v2 - mobilenet-v2123.8%CPU-v3-v3 - mobilenet-v3106%CPU - yolov4-tiny90.5%CPU - googlenet88.5%Vector MathCPU - shufflenet-v273.3%Unkeyed Algorithms59.4%CPU - mobilenet54.2%CPU - squeezenet_ssd41.3%CPU - regnety_400m37.4%Total Time - 4.1.R.P.P36.3%Keyed Algorithms35.7%Memory Copying4 - 256 - 572 - 256 - 571 - 256 - 578 - 256 - 57Matrix Math2048 x 2048 - Total Time21.6%CoreMark Size 666 - I.P.S21.3%WAV To FLACAll Algorithms16%Trace Time212.7%WAV To WavPack12.3%WAV To MP312.2%I.E.C.P.K.AP.P.S5.8%3, Long Mode - Compression Speed5.6%SHA256T.T.S.S5.1%6, Lossless6D.T4.2%19 - D.S9 - D.S3.6%3 - D.S3.6%N.C.P.M3 - D.S19, Long Mode - D.S3, Long Mode - D.S8 - Compression Speed3.1%10, Lossless8, Long Mode - D.S8 - D.STimed Time - Size 1,0003%Eigen19 - Compression SpeedNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNStress-NGNCNNCrypto++NCNNNCNNNCNNC-RayCrypto++Stress-NGLiquid-DSPLiquid-DSPLiquid-DSPLiquid-DSPStress-NGAOBenchCoremarkFLAC Audio EncodingCrypto++POV-Raylibavif avifencWavPack Audio EncodingLAME MP3 EncodingCrypto++Himeno BenchmarkZstd Compressionlibavif avifencOpenSSLeSpeak-NG Speech Enginelibavif avifenclibavif avifenclibjpeg-turbo tjbenchZstd CompressionLZ4 CompressionLZ4 CompressionOpenJPEGZstd CompressionZstd CompressionZstd CompressionZstd Compressionlibavif avifencZstd CompressionZstd CompressionSQLite SpeedtestLeelaChessZeroZstd CompressionGCC 11.2.0Clang 13.0.1

Apple M1 Compilerscryptopp: All Algorithmsxmrig: Monero - 1Mcryptopp: Keyed Algorithmslczero: Eigenxmrig: Wownero - 1Mavifenc: 0cryptopp: Integer + Elliptic Curve Public Key Algorithmsopenssl: SHA256avifenc: 2compress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedencode-flac: WAV To FLACpovray: Trace Timestress-ng: Vector Mathcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetc-ray: Total Time - 4K, 16 Rays Per Pixelcryptopp: Unkeyed Algorithmscompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedopenssl: RSA4096openssl: RSA4096compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedsqlite-speedtest: Timed Time - Size 1,000compress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedespeak: Text-To-Speech Synthesisencode-wavpack: WAV To WavPackaobench: 2048 x 2048 - Total Timestress-ng: Cryptostress-ng: Memory Copyingstress-ng: IO_uringstress-ng: Matrix Mathstress-ng: Socket Activityprimesieve: 1e12 Prime Number Generationcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedcoremark: CoreMark Size 666 - Iterations Per Secondavifenc: 6tjbench: Decompression Throughputliquid-dsp: 8 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57avifenc: 6, Losslesshimeno: Poisson Pressure Solveropenjpeg: NASA Curiosity Panorama M34encode-mp3: WAV To MP3draco: Church Facadeavifenc: 10, Losslessdraco: LionGCC 11.2.0Clang 13.0.1954.9561132247.2508.83644812632798.2287.3971766.9858808059691050143.4424221.1240.070.64872.01723954.103765.418.85.8814.2617.2017.1611.817.3133.7813.324.182.522.172.342.6114.4064.437539.2818273546.222.717478.548.9499370.51408.517490.951.9951.3724416.3693.04016.4721.53850.23341.222.28917.20527.4581511.752763.25144281.6723588.964331.7129.11827018.521909.45179896.59941114.094206.177350151120000115230000576110002877866715.6537577.316534538907.23956496.0703747823.1535322209.7374.89617512972804.8303.5501875.5235208474527350161.6124356.6253.759.07462.41641899.943887.318.88.0820.1532.7746.7931.6618.1880.4425.119.515.863.764.825.8422.2187.824338.4083693684.323.216863.349.8999445.41391.416877.451.3252.9004553.4703.44141.0699.63977.73301.123.42919.32033.4021527.173741.17147040.9830254.214313.4829.62626736.421875.54148361.36244013.516197.945225196510000151820000758986673789733314.9297158.970486520248.12457225.8873772OpenBenchmarking.org

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: All AlgorithmsGCC 11.2.0Clang 13.0.12004006008001000SE +/- 0.23, N = 3SE +/- 0.17, N = 3954.96823.151. (CXX) g++ options: -O3 -flto -fPIC -pthread -pipe

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MGCC 11.2.0Clang 13.0.15001000150020002500SE +/- 9.05, N = 3SE +/- 7.70, N = 32247.22209.7-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -flto -fexceptions -fno-rtti -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsGCC 11.2.0Clang 13.0.1110220330440550SE +/- 0.07, N = 3SE +/- 1.08, N = 3508.84374.901. (CXX) g++ options: -O3 -flto -fPIC -pthread -pipe

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenGCC 11.2.0Clang 13.0.130060090012001500SE +/- 10.69, N = 3SE +/- 18.26, N = 3126312971. (CXX) g++ options: -flto -O3 -pthread

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MGCC 11.2.0Clang 13.0.16001200180024003000SE +/- 1.83, N = 3SE +/- 1.95, N = 32798.22804.8-static-libgcc -static-libstdc++-funroll-loops1. (CXX) g++ options: -O3 -flto -fexceptions -fno-rtti -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0GCC 11.2.0Clang 13.0.170140210280350SE +/- 0.92, N = 3SE +/- 0.16, N = 3287.40303.551. (CXX) g++ options: -O3 -fPIC -flto -lm

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsGCC 11.2.0Clang 13.0.1400800120016002000SE +/- 0.67, N = 3SE +/- 1.78, N = 31766.991875.521. (CXX) g++ options: -O3 -flto -fPIC -pthread -pipe

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256GCC 11.2.0Clang 13.0.12000M4000M6000M8000M10000MSE +/- 12283962.01, N = 3SE +/- 3887401.32, N = 380596910508474527350-Qunused-arguments1. (CC) gcc options: -pthread -O3 -flto -lssl -lcrypto -ldl

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2GCC 11.2.0Clang 13.0.14080120160200SE +/- 0.32, N = 3SE +/- 0.72, N = 3143.44161.611. (CXX) g++ options: -O3 -fPIC -flto -lm

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression SpeedGCC 11.2.0Clang 13.0.19001800270036004500SE +/- 0.25, N = 15SE +/- 0.40, N = 34221.14356.61. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedGCC 11.2.0Clang 13.0.160120180240300SE +/- 2.00, N = 15SE +/- 3.51, N = 3240.0253.71. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format ten times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLACGCC 11.2.0Clang 13.0.11632486480SE +/- 0.01, N = 5SE +/- 0.01, N = 570.6559.07-fvisibility=hidden1. (CXX) g++ options: -O3 -flto -logg -lm

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 11.2.0Clang 13.0.11632486480SE +/- 0.85, N = 4SE +/- 0.64, N = 572.0262.42-R/usr/lib1. (CXX) g++ options: -pipe -O3 -ffast-math -flto -lSDL -lpthread -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Vector MathGCC 11.2.0Clang 13.0.19K18K27K36K45KSE +/- 195.44, N = 15SE +/- 2.19, N = 323954.1041899.941. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedGCC 11.2.0Clang 13.0.18001600240032004000SE +/- 0.92, N = 3SE +/- 0.69, N = 43765.43887.31. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedGCC 11.2.0Clang 13.0.1510152025SE +/- 0.13, N = 3SE +/- 0.21, N = 418.818.81. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mGCC 11.2.0Clang 13.0.1246810SE +/- 0.03, N = 3SE +/- 0.00, N = 35.888.08-lgomp -lpthread - MIN: 5.78 / MAX: 8.62MIN: 8.05 / MAX: 8.151. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdGCC 11.2.0Clang 13.0.1510152025SE +/- 0.17, N = 3SE +/- 0.00, N = 314.2620.15-lgomp -lpthread - MIN: 9.6 / MAX: 28.57MIN: 20.08 / MAX: 20.211. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyGCC 11.2.0Clang 13.0.1816243240SE +/- 0.07, N = 3SE +/- 0.00, N = 317.2032.77-lgomp -lpthread - MIN: 14.01 / MAX: 27.33MIN: 32.68 / MAX: 32.881. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50GCC 11.2.0Clang 13.0.11122334455SE +/- 0.08, N = 3SE +/- 0.01, N = 317.1646.79-lgomp -lpthread - MIN: 15.54 / MAX: 27.86MIN: 46.7 / MAX: 46.91. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetGCC 11.2.0Clang 13.0.1714212835SE +/- 0.10, N = 3SE +/- 0.00, N = 311.8131.66-lgomp -lpthread - MIN: 9.48 / MAX: 21.58MIN: 31.62 / MAX: 33.421. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18GCC 11.2.0Clang 13.0.148121620SE +/- 0.04, N = 3SE +/- 0.01, N = 37.3118.18-lgomp -lpthread - MIN: 6.17 / MAX: 16.92MIN: 18.14 / MAX: 18.231. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16GCC 11.2.0Clang 13.0.120406080100SE +/- 0.14, N = 3SE +/- 0.01, N = 333.7880.44-lgomp -lpthread - MIN: 30.68 / MAX: 45.72MIN: 80.22 / MAX: 80.951. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetGCC 11.2.0Clang 13.0.1612182430SE +/- 0.10, N = 3SE +/- 0.01, N = 313.3225.11-lgomp -lpthread - MIN: 9.14 / MAX: 21.97MIN: 25.07 / MAX: 25.161. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0GCC 11.2.0Clang 13.0.13691215SE +/- 0.02, N = 3SE +/- 0.00, N = 34.189.51-lgomp -lpthread - MIN: 4.13 / MAX: 8.1MIN: 9.47 / MAX: 9.671. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetGCC 11.2.0Clang 13.0.11.31852.6373.95555.2746.5925SE +/- 0.01, N = 3SE +/- 0.01, N = 22.525.86-lgomp -lpthread - MIN: 2.48 / MAX: 2.84MIN: 5.84 / MAX: 5.871. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2GCC 11.2.0Clang 13.0.10.8461.6922.5383.3844.23SE +/- 0.01, N = 3SE +/- 0.00, N = 32.173.76-lgomp -lpthread - MIN: 2.15 / MAX: 2.48MAX: 3.851. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3GCC 11.2.0Clang 13.0.11.08452.1693.25354.3385.4225SE +/- 0.01, N = 3SE +/- 0.01, N = 32.344.82-lgomp -lpthread - MIN: 2.32 / MAX: 2.49MIN: 4.8 / MAX: 4.851. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2GCC 11.2.0Clang 13.0.11.3142.6283.9425.2566.57SE +/- 0.05, N = 3SE +/- 0.01, N = 32.615.84-lgomp -lpthread - MIN: 2.48 / MAX: 12.2MIN: 5.81 / MAX: 5.871. (CXX) g++ options: -O3 -flto -rdynamic

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetGCC 11.2.0Clang 13.0.1510152025SE +/- 0.17, N = 3SE +/- 0.01, N = 314.4022.21-lgomp -lpthread - MIN: 9.21 / MAX: 25.2MIN: 22.15 / MAX: 22.251. (CXX) g++ options: -O3 -flto -rdynamic

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 11.2.0Clang 13.0.120406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 364.4487.821. (CC) gcc options: -lm -lpthread -O3 -flto

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 11.2.0Clang 13.0.1120240360480600SE +/- 0.04, N = 3SE +/- 0.01, N = 3539.28338.411. (CXX) g++ options: -O3 -flto -fPIC -pthread -pipe

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedGCC 11.2.0Clang 13.0.18001600240032004000SE +/- 0.15, N = 3SE +/- 1.62, N = 33546.23684.31. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedGCC 11.2.0Clang 13.0.1612182430SE +/- 0.07, N = 3SE +/- 0.17, N = 322.723.21. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedGCC 11.2.0Clang 13.0.14K8K12K16K20KSE +/- 1.03, N = 3SE +/- 3.18, N = 317478.516863.31. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedGCC 11.2.0Clang 13.0.11122334455SE +/- 0.09, N = 3SE +/- 0.12, N = 348.9449.891. (CC) gcc options: -O3

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096GCC 11.2.0Clang 13.0.120K40K60K80K100KSE +/- 18.59, N = 3SE +/- 16.80, N = 399370.599445.4-Qunused-arguments1. (CC) gcc options: -pthread -O3 -flto -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096GCC 11.2.0Clang 13.0.130060090012001500SE +/- 0.78, N = 3SE +/- 0.15, N = 31408.51391.4-Qunused-arguments1. (CC) gcc options: -pthread -O3 -flto -lssl -lcrypto -ldl

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 11.2.0Clang 13.0.14K8K12K16K20KSE +/- 0.40, N = 3SE +/- 3.46, N = 317490.916877.41. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedGCC 11.2.0Clang 13.0.11224364860SE +/- 0.05, N = 3SE +/- 0.09, N = 351.9951.321. (CC) gcc options: -O3

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 11.2.0Clang 13.0.11224364860SE +/- 0.04, N = 3SE +/- 0.07, N = 351.3752.901. (CC) gcc options: -O3 -flto -lz

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression SpeedGCC 11.2.0Clang 13.0.110002000300040005000SE +/- 1.55, N = 3SE +/- 3.13, N = 34416.34553.41. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedGCC 11.2.0Clang 13.0.1150300450600750SE +/- 2.35, N = 3SE +/- 2.38, N = 3693.0703.41. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression SpeedGCC 11.2.0Clang 13.0.19001800270036004500SE +/- 1.95, N = 3SE +/- 3.02, N = 34016.44141.01. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression SpeedGCC 11.2.0Clang 13.0.1160320480640800SE +/- 3.70, N = 3SE +/- 4.97, N = 3721.5699.61. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression SpeedGCC 11.2.0Clang 13.0.19001800270036004500SE +/- 0.87, N = 3SE +/- 0.75, N = 33850.23977.71. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedGCC 11.2.0Clang 13.0.17001400210028003500SE +/- 6.19, N = 3SE +/- 39.46, N = 33341.23301.11. (CC) gcc options: -O3 -flto -pthread -lz -llzma -llz4

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisGCC 11.2.0Clang 13.0.1612182430SE +/- 0.03, N = 4SE +/- 0.03, N = 422.2923.431. (CC) gcc options: -O3 -flto -std=c99 -lpthread -lm

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackGCC 11.2.0Clang 13.0.1510152025SE +/- 0.00, N = 5SE +/- 0.00, N = 517.2119.321. (CXX) g++ options: -O3 -flto -rdynamic

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 11.2.0Clang 13.0.1816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 327.4633.401. (CC) gcc options: -lm -O3 -flto

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: CryptoGCC 11.2.0Clang 13.0.130060090012001500SE +/- 16.47, N = 3SE +/- 0.94, N = 31511.751527.171. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Memory CopyingGCC 11.2.0Clang 13.0.18001600240032004000SE +/- 6.71, N = 3SE +/- 15.21, N = 32763.253741.171. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: IO_uringGCC 11.2.0Clang 13.0.130K60K90K120K150KSE +/- 28.54, N = 3SE +/- 271.95, N = 3144281.67147040.981. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Matrix MathGCC 11.2.0Clang 13.0.16K12K18K24K30KSE +/- 332.61, N = 3SE +/- 0.69, N = 323588.9630254.211. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.13.02Test: Socket ActivityGCC 11.2.0Clang 13.0.19001800270036004500SE +/- 4.58, N = 3SE +/- 13.20, N = 34331.714313.481. (CC) gcc options: -O3 -flto -O2 -std=gnu99 -lm -laio -lbsd -lcrypt -lrt -lz -ldl -pthread -lkmod -lc -latomic

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.71e12 Prime Number GenerationGCC 11.2.0Clang 13.0.1714212835SE +/- 0.01, N = 3SE +/- 0.03, N = 329.1229.631. (CXX) g++ options: -O3 -flto

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedGCC 11.2.0Clang 13.0.16K12K18K24K30KSE +/- 1.47, N = 3SE +/- 8.86, N = 327018.526736.41. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedGCC 11.2.0Clang 13.0.15K10K15K20K25KSE +/- 5.50, N = 3SE +/- 3.05, N = 321909.4521875.541. (CC) gcc options: -O3

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 11.2.0Clang 13.0.140K80K120K160K200KSE +/- 59.94, N = 3SE +/- 146.27, N = 3179896.60148361.361. (CC) gcc options: -O2 -O3 -flto -lrt" -lrt

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6GCC 11.2.0Clang 13.0.148121620SE +/- 0.14, N = 6SE +/- 0.15, N = 514.0913.521. (CXX) g++ options: -O3 -fPIC -flto -lm

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 11.2.0Clang 13.0.150100150200250SE +/- 0.03, N = 3SE +/- 2.27, N = 3206.18197.95-lm1. (CC) gcc options: -O3 -flto -rdynamic

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57GCC 11.2.0Clang 13.0.140M80M120M160M200MSE +/- 0.00, N = 3SE +/- 0.00, N = 31511200001965100001. (CC) gcc options: -O3 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57GCC 11.2.0Clang 13.0.130M60M90M120M150MSE +/- 0.00, N = 3SE +/- 0.00, N = 31152300001518200001. (CC) gcc options: -O3 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57GCC 11.2.0Clang 13.0.116M32M48M64M80MSE +/- 2081.67, N = 3SE +/- 1763.83, N = 357611000758986671. (CC) gcc options: -O3 -flto -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57GCC 11.2.0Clang 13.0.18M16M24M32M40MSE +/- 3527.67, N = 3SE +/- 2905.93, N = 328778667378973331. (CC) gcc options: -O3 -flto -pthread -lm -lc -lliquid

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, LosslessGCC 11.2.0Clang 13.0.148121620SE +/- 0.18, N = 3SE +/- 0.21, N = 315.6514.931. (CXX) g++ options: -O3 -fPIC -flto -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 11.2.0Clang 13.0.116003200480064008000SE +/- 2.64, N = 3SE +/- 3.44, N = 37577.327158.971. (CC) gcc options: -O3 -flto

OpenJPEG

OpenJPEG is an open-source JPEG 2000 codec written in the C programming language. The default input for this test profile is the NASA/JPL-Caltech/MSSS Curiosity panorama 717MB TIFF image file converting to JPEG2000 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34GCC 11.2.0Clang 13.0.112K24K36K48K60KSE +/- 92.73, N = 3SE +/- 161.48, N = 353890520241. (CXX) g++ options: -O3 -flto -rdynamic

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 11.2.0Clang 13.0.1246810SE +/- 0.011, N = 3SE +/- 0.005, N = 37.2398.124-pipe1. (CC) gcc options: -O3 -flto -lncurses -lm

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: Church FacadeGCC 11.2.0Clang 13.0.112002400360048006000SE +/- 7.21, N = 3SE +/- 3.79, N = 3564957221. (CXX) g++ options: -O3 -flto

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, LosslessGCC 11.2.0Clang 13.0.1246810SE +/- 0.049, N = 3SE +/- 0.047, N = 36.0705.8871. (CXX) g++ options: -O3 -fPIC -flto -lm

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: LionGCC 11.2.0Clang 13.0.18001600240032004000SE +/- 2.73, N = 3SE +/- 0.58, N = 3374737721. (CXX) g++ options: -O3 -flto

72 Results Shown

Crypto++
Xmrig
Crypto++
LeelaChessZero
Xmrig
libavif avifenc
Crypto++
OpenSSL
libavif avifenc
Zstd Compression:
  3, Long Mode - Decompression Speed
  3, Long Mode - Compression Speed
FLAC Audio Encoding
POV-Ray
Stress-NG
Zstd Compression:
  19, Long Mode - Decompression Speed
  19, Long Mode - Compression Speed
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
C-Ray
Crypto++
Zstd Compression:
  19 - Decompression Speed
  19 - Compression Speed
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
OpenSSL:
  RSA4096:
    verify/s
    sign/s
LZ4 Compression:
  3 - Decompression Speed
  3 - Compression Speed
SQLite Speedtest
Zstd Compression:
  8, Long Mode - Decompression Speed
  8, Long Mode - Compression Speed
  8 - Decompression Speed
  8 - Compression Speed
  3 - Decompression Speed
  3 - Compression Speed
eSpeak-NG Speech Engine
WavPack Audio Encoding
AOBench
Stress-NG:
  Crypto
  Memory Copying
  IO_uring
  Matrix Math
  Socket Activity
Primesieve
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Coremark
libavif avifenc
libjpeg-turbo tjbench
Liquid-DSP:
  8 - 256 - 57
  4 - 256 - 57
  2 - 256 - 57
  1 - 256 - 57
libavif avifenc
Himeno Benchmark
OpenJPEG
LAME MP3 Encoding
Google Draco
libavif avifenc
Google Draco