Intel Core i9 11900K Compiler Benchmarks

GCC 11.1 versus LLVM Clang 12 on Intel Core i9 11900K Rocket Lake. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2105176-IB-11900KCOM25.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 11.1Clang 12 -O2 -O3 -march=native -O3 -march=native -flto -O2 -O3 -march=native -O3 -march=native -fltoIntel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0AMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)GCC 11.1.1 20210428btrfs3840x2160Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- Clang 12: -O2: CXXFLAGS=-O2 CFLAGS=-O2- Clang 12: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 11.1: -O2, GCC 11.1: -O3 -march=native, GCC 11.1: -O3 -march=native -flto: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.1Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

mrbayes: Primate Phylogeny Analysishmmer: Pfam Database Searchlammps: Rhodopsin Proteinwebp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressiongraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingsvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondhimeno: Poisson Pressure Solverpjsip: INVITEpjsip: OPTIONS, Statefulpjsip: OPTIONS, Statelessc-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodeliquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57tjbench: Decompression Throughputastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivesqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1GCC 11.1Clang 12 -O2 -O3 -march=native -O3 -march=native -flto -O2 -O3 -march=native -O3 -march=native -flto87.297103.2918.02313.7635.36027.84110661642191091136.31273.60191.83198.01160.6515.64430127.4981896305.48185050019381239792106.52224.4586.0867.3046.467635506667711343333261.0347855.248112.094991.379943.61615.154.203.183.115.2311.1154.8011.309.6322.0716.159.61243.416236.05086.696100.7378.06712.9015.12727.26411411952701198139.13278.72195.87201.70164.7715.81432583.9643526878.5076864959938924143947.34521.5445.9315.4795.587686530000722893333273.1000465.182011.384685.415744.08511.833.242.552.304.3810.2054.5011.089.6318.2315.538.62230.019227.66384.92999.9718.32812.7065.10327.07410721952691229141.83278.59195.07201.10166.0515.40435901.4439597079.8838705058939523989247.61321.5775.9365.3765.575684356667722393333272.6007585.170511.395285.420743.77713.343.252.522.274.3210.2754.1311.399.7018.4315.928.91247.889242.55085.572101.5038.14013.0174.88528.25710511622171044138.07271.50193.85199.37163.9715.59377278.1314906204.0522474965936224131282.65924.9987.5937.0346.206742070000813766667273.2984343.883210.508885.560946.26012.603.452.642.394.4610.8655.6611.5010.0119.0615.349.55308.514239.53282.55499.6108.16413.0544.76028.07510801632531070142.10276.29195.08199.59164.9515.52366868.6319516291.5764565024938224142684.05422.9975.9566.4615.952712080000768436667282.7129663.78139.555974.779846.54312.053.362.562.334.3610.5854.4011.229.9118.3315.359.20336.911259.56783.55899.0088.23912.8884.73127.08110921632541195146.01283.29199.49203.83167.9515.93373279.5814116434.83467085.08022.8995.9586.2055.870699980000753703333282.6666843.77109.565374.804046.37412.133.282.462.234.3910.5354.3811.199.8618.3115.468.99342.856258.985OpenBenchmarking.org

Timed MrBayes Analysis

Primate Phylogeny Analysis

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.53, N = 3SE +/- 0.06, N = 3SE +/- 0.32, N = 3SE +/- 0.56, N = 3SE +/- 0.04, N = 3SE +/- 0.18, N = 387.3086.7084.9385.5782.5583.561. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -O3 -std=c99 -pedantic -lm

Timed HMMer Search

Pfam Database Search

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3103.29100.7499.97101.5099.6199.011. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

GCC 11.1Clang 12OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.106, N = 3SE +/- 0.063, N = 15SE +/- 0.055, N = 15SE +/- 0.093, N = 4SE +/- 0.034, N = 3SE +/- 0.030, N = 38.0238.0678.3288.1408.1648.2391. (CXX) g++ options: -O2 -pthread -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless

GCC 11.1Clang 12OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 313.7612.9012.7113.0213.0512.891. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

GCC 11.1Clang 12OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-O2-O3 -march=native-O3 -march=native -flto1.2062.4123.6184.8246.03SE +/- 0.005, N = 3SE +/- 0.014, N = 3SE +/- 0.008, N = 3SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 35.3605.1275.1034.8854.7604.7311. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

GCC 11.1Clang 12OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression-O2-O3 -march=native-O3 -march=native -flto714212835SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 327.8427.2627.0728.2628.0827.081. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

GraphicsMagick

Operation: Rotate

GCC 11.1Clang 12OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O2-O3 -march=native-O3 -march=native -flto2004006008001000SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 3SE +/- 2.85, N = 3SE +/- 1.67, N = 31066114110721051108010921. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

GCC 11.1Clang 12OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 31641951951621631631. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

GCC 11.1Clang 12OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 32192702692172532541. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

GCC 11.1Clang 12OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O2-O3 -march=native-O3 -march=native -flto30060090012001500SE +/- 6.89, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 1.73, N = 3SE +/- 5.24, N = 31091119812291044107011951. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto306090120150SE +/- 1.53, N = 4SE +/- 1.58, N = 4SE +/- 1.44, N = 5SE +/- 1.53, N = 5SE +/- 0.16, N = 3SE +/- 0.02, N = 3136.31139.13141.83138.07142.10146.011. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.52, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.56, N = 3SE +/- 0.62, N = 3SE +/- 0.47, N = 3273.60278.72278.59271.50276.29283.291. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 1.51, N = 10SE +/- 1.48, N = 10SE +/- 1.49, N = 10SE +/- 1.33, N = 3SE +/- 0.47, N = 3SE +/- 0.25, N = 3191.83195.87195.07193.85195.08199.491. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.20, N = 3SE +/- 0.42, N = 3SE +/- 0.21, N = 3198.01201.70201.10199.37199.59203.831. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto4080120160200SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.31, N = 3SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.26, N = 3160.65164.77166.05163.97164.95167.951. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

GCC 11.1Clang 12OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.21, N = 3SE +/- 0.13, N = 15SE +/- 0.15, N = 6SE +/- 0.04, N = 3SE +/- 0.13, N = 8SE +/- 0.13, N = 315.6415.8115.4015.5915.5215.931. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Coremark

CoreMark Size 666 - Iterations Per Second

GCC 11.1Clang 12OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O2-O3 -march=native-O3 -march=native -flto90K180K270K360K450KSE +/- 1236.61, N = 3SE +/- 1364.82, N = 3SE +/- 166.46, N = 3SE +/- 476.82, N = 3SE +/- 494.48, N = 3SE +/- 176.14, N = 3430127.50432583.96435901.44377278.13366868.63373279.581. (CC) gcc options: -O2 -lrt" -lrt

Himeno Benchmark

Poisson Pressure Solver

GCC 11.1Clang 12OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O3 -march=native-O3 -march=native -flto15003000450060007500SE +/- 0.74, N = 3SE +/- 6.62, N = 3SE +/- 3.24, N = 3SE +/- 4.35, N = 3SE +/- 3.97, N = 3SE +/- 16.27, N = 36305.486878.517079.886204.056291.586434.831. (CC) gcc options: -O3 -mavx2

PJSIP

Method: INVITE

GCC 11.1Clang 12OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O2-O3 -march=native-O3 -march=native -flto11002200330044005500SE +/- 32.83, N = 3SE +/- 41.25, N = 3SE +/- 3.18, N = 3SE +/- 34.53, N = 3SE +/- 13.67, N = 3500149595058496550241. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateful

GCC 11.1Clang 12OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O2-O3 -march=native-O3 -march=native -flto2K4K6K8K10KSE +/- 1.67, N = 3SE +/- 6.96, N = 3SE +/- 4.58, N = 3SE +/- 2.33, N = 3938193899395936293821. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateless

GCC 11.1Clang 12OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O2-O3 -march=native-O3 -march=native -flto50K100K150K200K250KSE +/- 504.43, N = 3SE +/- 1015.58, N = 3SE +/- 101.47, N = 3SE +/- 468.55, N = 3SE +/- 238.95, N = 32397922414392398922413122414261. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.18, N = 3SE +/- 0.13, N = 3SE +/- 0.31, N = 3106.5247.3547.6182.6684.0585.081. (CC) gcc options: -lm -lpthread -O3

AOBench

Size: 2048 x 2048 - Total Time

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O2-O3 -march=native-O3 -march=native -flto612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 324.4621.5421.5825.0023.0022.901. (CC) gcc options: -lm -O3

FLAC Audio Encoding

WAV To FLAC

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.003, N = 5SE +/- 0.004, N = 5SE +/- 0.003, N = 5SE +/- 0.004, N = 5SE +/- 0.008, N = 5SE +/- 0.004, N = 56.0865.9315.9367.5935.9565.9581. (CXX) g++ options: -logg -lm

LAME MP3 Encoding

WAV To MP3

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.048, N = 3SE +/- 0.010, N = 3SE +/- 0.003, N = 3SE +/- 0.029, N = 3SE +/- 0.019, N = 3SE +/- 0.018, N = 37.3045.4795.3767.0346.4616.2051. (CC) gcc options: -O3 -pipe -lm

Opus Codec Encoding

WAV To Opus Encode

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.030, N = 5SE +/- 0.007, N = 5SE +/- 0.033, N = 5SE +/- 0.030, N = 5SE +/- 0.036, N = 5SE +/- 0.014, N = 56.4675.5875.5756.2065.9525.8701. (CXX) g++ options: -logg -lm

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

GCC 11.1Clang 12OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto160M320M480M640M800MSE +/- 766753.62, N = 3SE +/- 2160717.47, N = 3SE +/- 2050604.25, N = 3SE +/- 4115754.28, N = 3SE +/- 3025001.38, N = 3SE +/- 3850155.84, N = 36355066676865300006843566677420700007120800006999800001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

GCC 11.1Clang 12OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto200M400M600M800M1000MSE +/- 189414.30, N = 3SE +/- 209549.78, N = 3SE +/- 322714.18, N = 3SE +/- 49103.07, N = 3SE +/- 317612.62, N = 3SE +/- 391975.06, N = 37113433337228933337223933338137666677684366677537033331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

libjpeg-turbo tjbench

Test: Decompression Throughput

GCC 11.1Clang 12OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.16, N = 3SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.86, N = 3SE +/- 0.87, N = 3SE +/- 0.50, N = 3261.03273.10272.60273.30282.71282.671. (CC) gcc options: -O3 -rdynamic

ASTC Encoder

Preset: Medium

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium-O2-O3 -march=native-O3 -march=native -flto1.18082.36163.54244.72325.904SE +/- 0.0027, N = 3SE +/- 0.0013, N = 3SE +/- 0.0065, N = 3SE +/- 0.0014, N = 3SE +/- 0.0028, N = 3SE +/- 0.0020, N = 35.24815.18205.17053.88323.78133.77101. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Thorough

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.0131, N = 3SE +/- 0.0205, N = 3SE +/- 0.0120, N = 3SE +/- 0.0121, N = 3SE +/- 0.0123, N = 3SE +/- 0.0155, N = 312.094911.384611.395210.50889.55599.56531. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Exhaustive

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 391.3885.4285.4285.5674.7874.801. (CXX) g++ options: -O2 -flto -pthread

SQLite Speedtest

Timed Time - Size 1,000

GCC 11.1Clang 12OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O2-O3 -march=native-O3 -march=native -flto1122334455SE +/- 0.15, N = 3SE +/- 0.30, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.28, N = 3SE +/- 0.09, N = 343.6244.0943.7846.2646.5446.371. (CC) gcc options: -ldl -lz -lpthread

NCNN

Target: CPU - Model: mobilenet

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.14, N = 15SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.1511.8313.3412.6012.0512.131. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O2-O3 -march=native-O3 -march=native -flto0.9451.892.8353.784.725SE +/- 0.02, N = 15SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 34.203.243.253.453.363.281. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O2-O3 -march=native-O3 -march=native -flto0.71551.4312.14652.8623.5775SE +/- 0.01, N = 15SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 33.182.552.522.642.562.461. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: mnasnet

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O2-O3 -march=native-O3 -march=native -flto0.69981.39962.09942.79923.499SE +/- 0.01, N = 14SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 33.112.302.272.392.332.231. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O2-O3 -march=native-O3 -march=native -flto1.17682.35363.53044.70725.884SE +/- 0.02, N = 15SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 35.234.384.324.464.364.391. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: googlenet

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.08, N = 15SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 311.1110.2010.2710.8610.5810.531. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: vgg16

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O2-O3 -march=native-O3 -march=native -flto1326395265SE +/- 0.05, N = 15SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.17, N = 354.8054.5054.1355.6654.4054.381. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: resnet18

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.06, N = 14SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 311.3011.0811.3911.5011.2211.191. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: alexnet

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.01, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 39.639.639.7010.019.919.861. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: resnet50

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O2-O3 -march=native-O3 -march=native -flto510152025SE +/- 0.08, N = 15SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 322.0718.2318.4319.0618.3318.311. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.01, N = 15SE +/- 0.12, N = 3SE +/- 0.28, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 316.1515.5315.9215.3415.3515.461. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.02, N = 12SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 39.618.628.919.559.208.991. (CXX) g++ options: -O2 -rdynamic -lomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O2-O3 -march=native-O3 -march=native -flto70140210280350SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.25, N = 3SE +/- 0.21, N = 3SE +/- 0.06, N = 3243.42230.02247.89308.51336.91342.861. (CXX) g++ options: -O2 -fopenmp=libomp -pthread -fvisibility=hidden -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

GCC 11.1Clang 12OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3236.05227.66242.55239.53259.57258.991. (CXX) g++ options: -O2 -fopenmp=libomp -pthread -fvisibility=hidden -rdynamic -ldl


Phoronix Test Suite v10.8.5