Ryzen 9 5950X Clang 12 vs. GCC 11 Benchmarks

GCC 11.1 versus LLVM Clang 12 on AMD Ryzen 9 5950X. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2105198-IB-11900KCOM08&sgm=1&imw=1&rdt&grr.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionClang 12GCC 11.1 -O2 -O3 -march=native -O3 -march=native -flto -O3 -march=native -flto -O3 -march=native -O2AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3302 BIOS)AMD Starship/Matisse32GB500GB Western Digital WDS500G3X0C-00SJG0AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)Clang 12.0.0btrfs3840x2160GCC 11.1.1 20210428 + Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12: -O2: CXXFLAGS=-O2 CFLAGS=-O2- Clang 12: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected Compiler Details- GCC 11.1: -O3 -march=native -flto, GCC 11.1: -O3 -march=native, GCC 11.1: -O2: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver

himeno: Poisson Pressure Solverncnn: CPU - yolov4-tinyncnn: CPU - blazefacencnn: CPU - shufflenet-v2ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetpjsip: INVITEhmmer: Pfam Database Searchmrbayes: Primate Phylogeny Analysispjsip: OPTIONS, Statefulgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Rotateastcenc: Exhaustivesqlite-speedtest: Timed Time - Size 1,000c-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timewebp: Quality 100, Lossless, Highest Compressioncoremark: CoreMark Size 666 - Iterations Per Secondtjbench: Decompression Throughputx265: Bosphorus 4Kliquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1webp: Quality 100, Losslesspjsip: OPTIONS, Statelessastcenc: Thoroughencode-flac: WAV To FLACencode-opus: WAV To Opus Encodeencode-mp3: WAV To MP3lammps: Rhodopsin Proteinwebp: Quality 100, Highest Compressionastcenc: Mediumsvt-hevc: 7 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080pClang 12GCC 11.1 -O2 -O3 -march=native -O3 -march=native -flto -O3 -march=native -flto -O3 -march=native -O24877.30641021.971.543.8112.9312.6122.9911.0914.0557.2611.884.523.263.363.5611.58457596.09293.5297942206411169392152.204847.79249.27533.77829.119712758.357826267.55306827.771056400000576660000271.028208.55513.6472211079.39507.6875.8746.56513.0874.7783.3367225.15225.57231.35235.64368.555192.19004221.731.523.7412.5912.3523.2511.1613.9056.9311.774.513.263.073.5711.46470895.21490.7857958235452176798151.823047.63045.01030.23428.640722694.006275272.51355927.751061900000576050000355.925215.78114.1052225729.35495.6665.5086.09813.2994.7033.3179229.27227.73234.71239.88375.955289.91757121.581.463.6912.1412.4322.9011.2513.8656.1111.624.303.082.933.4211.0593.31192.405237451176296151.897448.58444.74229.73027.400714933.858983266.22982928.471073466667587800000261.836207.92513.6369.44325.7565.5585.81013.3424.7223.3728236.05227.66232.66238.13375.085445.63522123.942.556.0118.5314.5025.8411.1314.8057.8413.265.363.923.864.4613.74461697.71493.3377916382449208596456.190046.47925.48226.31227.949849671.959442270.90915026.171093266667619393333215.341202.57213.2542225817.52996.1955.4755.40312.8135.1844.6995227.04222.33230.43234.70368.555314.73390421.001.804.3917.1813.9325.3310.9914.3357.6112.855.373.953.844.4312.70467197.74492.97379823704492120103656.641146.91725.40525.47928.327808580.862650268.11345625.911085000000611560000227.768214.27713.5502217597.57186.2375.3875.50312.7495.2004.6277221.71223.11231.16235.59369.545120.94026221.061.854.3317.2414.6126.1811.1614.5056.8213.155.413.974.154.5412.95481599.72296.7857860226422180398557.144547.40460.93131.73227.424830811.532116277.49680226.241041200000594850000219.130208.78013.1812305997.68415.8746.5156.80212.8555.3554.6296217.32222.63230.53235.27365.79OpenBenchmarking.org

Himeno Benchmark

Poisson Pressure Solver

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClang 12GCC 11.112002400360048006000SE +/- 62.44, N = 15SE +/- 83.12, N = 15SE +/- 66.92, N = 15SE +/- 90.91, N = 15SE +/- 96.57, N = 13SE +/- 68.97, N = 154877.315120.945192.195314.735289.925445.641. (CC) gcc options: -O3 -mavx2

NCNN

Target: CPU - Model: yolov4-tiny

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyClang 12GCC 11.1612182430SE +/- 0.18, N = 3SE +/- 0.11, N = 15SE +/- 0.12, N = 3SE +/- 0.14, N = 15SE +/- 0.10, N = 3SE +/- 0.41, N = 321.9721.0621.7321.0021.5823.941. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: blazeface

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefaceClang 12GCC 11.10.57381.14761.72142.29522.869SE +/- 0.04, N = 3SE +/- 0.01, N = 15SE +/- 0.04, N = 3SE +/- 0.01, N = 15SE +/- 0.05, N = 3SE +/- 0.01, N = 31.541.851.521.801.462.551. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2Clang 12GCC 11.1246810SE +/- 0.05, N = 3SE +/- 0.01, N = 14SE +/- 0.05, N = 3SE +/- 0.15, N = 15SE +/- 0.02, N = 3SE +/- 0.02, N = 33.814.333.744.393.696.011. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: regnety_400m

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mClang 12GCC 11.1510152025SE +/- 0.20, N = 3SE +/- 0.06, N = 15SE +/- 0.16, N = 3SE +/- 0.05, N = 15SE +/- 0.14, N = 3SE +/- 0.09, N = 312.9317.2412.5917.1812.1418.531. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdClang 12GCC 11.148121620SE +/- 0.31, N = 3SE +/- 0.10, N = 15SE +/- 0.20, N = 3SE +/- 0.09, N = 15SE +/- 0.20, N = 3SE +/- 0.06, N = 312.6114.6112.3513.9312.4314.501. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet50

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50Clang 12GCC 11.1612182430SE +/- 0.11, N = 3SE +/- 0.18, N = 15SE +/- 0.06, N = 3SE +/- 0.10, N = 15SE +/- 0.09, N = 3SE +/- 0.21, N = 322.9926.1823.2525.3322.9025.841. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: alexnet

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetClang 12GCC 11.13691215SE +/- 0.03, N = 3SE +/- 0.06, N = 15SE +/- 0.05, N = 3SE +/- 0.04, N = 15SE +/- 0.10, N = 3SE +/- 0.06, N = 311.0911.1611.1610.9911.2511.131. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet18

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18Clang 12GCC 11.148121620SE +/- 0.23, N = 3SE +/- 0.11, N = 15SE +/- 0.13, N = 3SE +/- 0.09, N = 15SE +/- 0.13, N = 3SE +/- 0.22, N = 314.0514.5013.9014.3313.8614.801. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: vgg16

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16Clang 12GCC 11.11326395265SE +/- 0.08, N = 3SE +/- 0.08, N = 15SE +/- 0.53, N = 3SE +/- 0.08, N = 15SE +/- 0.45, N = 3SE +/- 0.23, N = 357.2656.8256.9357.6156.1157.841. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: googlenet

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetClang 12GCC 11.13691215SE +/- 0.13, N = 3SE +/- 0.22, N = 15SE +/- 0.16, N = 3SE +/- 0.12, N = 15SE +/- 0.09, N = 3SE +/- 0.44, N = 311.8813.1511.7712.8511.6213.261. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0Clang 12GCC 11.11.21732.43463.65194.86926.0865SE +/- 0.06, N = 3SE +/- 0.05, N = 15SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.03, N = 3SE +/- 0.02, N = 34.525.414.515.374.305.361. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: mnasnet

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetClang 12GCC 11.10.89331.78662.67993.57324.4665SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.05, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.02, N = 33.263.973.263.953.083.921. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3Clang 12GCC 11.10.93381.86762.80143.73524.669SE +/- 0.07, N = 3SE +/- 0.03, N = 15SE +/- 0.05, N = 3SE +/- 0.02, N = 15SE +/- 0.01, N = 3SE +/- 0.03, N = 33.364.153.073.842.933.861. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2Clang 12GCC 11.11.02152.0433.06454.0865.1075SE +/- 0.11, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.01, N = 15SE +/- 0.08, N = 3SE +/- 0.01, N = 33.564.543.574.433.424.461. (CXX) g++ options: -O2 -rdynamic -lpthread

NCNN

Target: CPU - Model: mobilenet

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetClang 12GCC 11.148121620SE +/- 0.03, N = 3SE +/- 0.11, N = 15SE +/- 0.11, N = 3SE +/- 0.13, N = 15SE +/- 0.12, N = 3SE +/- 0.17, N = 311.5812.9511.4612.7011.0513.741. (CXX) g++ options: -O2 -rdynamic -lpthread

PJSIP

Method: INVITE

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITEClang 12GCC 11.110002000300040005000SE +/- 42.78, N = 3SE +/- 16.33, N = 3SE +/- 66.36, N = 3SE +/- 32.67, N = 15SE +/- 65.77, N = 3457548154708467146161. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Timed HMMer Search

Pfam Database Search

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database SearchClang 12GCC 11.120406080100SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 396.0999.7295.2197.7493.3197.711. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Timed MrBayes Analysis

Primate Phylogeny Analysis

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisClang 12GCC 11.120406080100SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 393.5396.7990.7992.9792.4193.341. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -lm

PJSIP

Method: OPTIONS, Stateful

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatefulClang 12GCC 11.12K4K6K8K10KSE +/- 27.78, N = 3SE +/- 72.34, N = 3SE +/- 47.54, N = 3SE +/- 21.79, N = 3SE +/- 56.75, N = 3794278607958798279161. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

GraphicsMagick

Operation: Sharpen

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenClang 12GCC 11.180160240320400SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 32062262353702373821. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedClang 12GCC 11.1100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34114224524494514491. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 12GCC 11.15001000150020002500SE +/- 4.48, N = 3SE +/- 0.88, N = 3SE +/- 3.61, N = 3SE +/- 1.53, N = 3SE +/- 2.85, N = 3SE +/- 3.71, N = 31693180317672120176220851. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 12GCC 11.12004006008001000SE +/- 5.55, N = 3SE +/- 11.70, N = 3SE +/- 0.58, N = 3SE +/- 5.57, N = 3SE +/- 4.37, N = 3SE +/- 3.48, N = 392198598110369619641. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

ASTC Encoder

Preset: Exhaustive

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveClang 12GCC 11.11326395265SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 352.2057.1451.8256.6451.9056.191. (CXX) g++ options: -flto -O2 -pthread

SQLite Speedtest

Timed Time - Size 1,000

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Clang 12GCC 11.11122334455SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.50, N = 3SE +/- 0.16, N = 3SE +/- 0.18, N = 3SE +/- 0.54, N = 347.7947.4047.6346.9248.5846.481. (CC) gcc options: -ldl -lz -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelClang 12GCC 11.11428425670SE +/- 0.16, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 349.2860.9345.0125.4144.7425.481. (CC) gcc options: -lm -lpthread -O3

AOBench

Size: 2048 x 2048 - Total Time

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeClang 12GCC 11.1816243240SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.43, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 333.7831.7330.2325.4829.7326.311. (CC) gcc options: -lm -O3

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionClang 12GCC 11.1714212835SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 329.1227.4228.6428.3327.4027.951. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Coremark

CoreMark Size 666 - Iterations Per Second

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondClang 12GCC 11.1200K400K600K800K1000KSE +/- 789.22, N = 3SE +/- 989.68, N = 3SE +/- 845.83, N = 3SE +/- 2230.96, N = 3SE +/- 581.76, N = 3SE +/- 836.68, N = 3712758.36830811.53722694.01808580.86714933.86849671.961. (CC) gcc options: -O2 -lrt" -lrt

libjpeg-turbo tjbench

Test: Decompression Throughput

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputClang 12GCC 11.160120180240300SE +/- 1.65, N = 3SE +/- 1.92, N = 3SE +/- 0.59, N = 3SE +/- 1.27, N = 3SE +/- 0.54, N = 3SE +/- 0.45, N = 3267.55277.50272.51268.11266.23270.911. (CC) gcc options: -O3 -rdynamic

x265

Video Input: Bosphorus 4K

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KClang 12GCC 11.1714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 327.7726.2427.7525.9128.4726.171. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57Clang 12GCC 11.1200M400M600M800M1000MSE +/- 2690724.81, N = 3SE +/- 4650806.38, N = 3SE +/- 493288.29, N = 3SE +/- 1252996.41, N = 3SE +/- 3219903.38, N = 3SE +/- 783865.07, N = 31056400000104120000010619000001085000000107346666710932666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57Clang 12GCC 11.1130M260M390M520M650MSE +/- 1824399.44, N = 3SE +/- 185202.59, N = 3SE +/- 4072558.57, N = 3SE +/- 4455270.29, N = 3SE +/- 1855020.22, N = 3SE +/- 1343755.10, N = 35766600005948500005760500006115600005878000006193933331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

TNN

Target: CPU - Model: MobileNet v2

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Clang 12GCC 11.180160240320400SE +/- 0.41, N = 3SE +/- 1.57, N = 3SE +/- 0.48, N = 3SE +/- 0.86, N = 3SE +/- 0.55, N = 3SE +/- 1.27, N = 3271.03219.13355.93227.77261.84215.341. (CXX) g++ options: -pthread -fvisibility=hidden -O2 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Clang 12GCC 11.150100150200250SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.69, N = 3SE +/- 0.50, N = 3SE +/- 0.41, N = 3SE +/- 0.11, N = 3208.56208.78215.78214.28207.93202.571. (CXX) g++ options: -pthread -fvisibility=hidden -O2 -rdynamic -ldl

WebP Image Encode

Encode Settings: Quality 100, Lossless

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessClang 12GCC 11.148121620SE +/- 0.19, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 4SE +/- 0.17, N = 313.6513.1814.1113.5513.6413.251. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

PJSIP

Method: OPTIONS, Stateless

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, StatelessClang 12GCC 11.150K100K150K200K250KSE +/- 1827.48, N = 3SE +/- 586.86, N = 3SE +/- 962.04, N = 3SE +/- 1749.48, N = 3SE +/- 705.50, N = 32211072305992225722217592225811. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

ASTC Encoder

Preset: Thorough

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughClang 12GCC 11.13691215SE +/- 0.0279, N = 3SE +/- 0.0109, N = 3SE +/- 0.0407, N = 3SE +/- 0.0351, N = 3SE +/- 0.0114, N = 3SE +/- 0.0109, N = 39.39507.68419.35497.57189.44327.52991. (CXX) g++ options: -flto -O2 -pthread

FLAC Audio Encoding

WAV To FLAC

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACClang 12GCC 11.1246810SE +/- 0.024, N = 5SE +/- 0.045, N = 5SE +/- 0.005, N = 5SE +/- 0.020, N = 5SE +/- 0.030, N = 5SE +/- 0.036, N = 57.6875.8745.6666.2375.7566.1951. (CXX) g++ options: -logg -lm

Opus Codec Encoding

WAV To Opus Encode

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 12GCC 11.1246810SE +/- 0.052, N = 5SE +/- 0.035, N = 5SE +/- 0.045, N = 5SE +/- 0.048, N = 5SE +/- 0.016, N = 5SE +/- 0.023, N = 55.8746.5155.5085.3875.5585.4751. (CXX) g++ options: -logg -lm

LAME MP3 Encoding

WAV To MP3

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3Clang 12GCC 11.1246810SE +/- 0.068, N = 4SE +/- 0.084, N = 3SE +/- 0.045, N = 3SE +/- 0.071, N = 3SE +/- 0.056, N = 3SE +/- 0.010, N = 36.5656.8026.0985.5035.8105.4031. (CC) gcc options: -O3 -pipe -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinClang 12GCC 11.13691215SE +/- 0.08, N = 3SE +/- 0.11, N = 8SE +/- 0.02, N = 3SE +/- 0.14, N = 4SE +/- 0.01, N = 3SE +/- 0.09, N = 1513.0912.8613.3012.7513.3412.811. (CXX) g++ options: -O2 -pthread -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionClang 12GCC 11.11.20492.40983.61474.81966.0245SE +/- 0.059, N = 4SE +/- 0.021, N = 3SE +/- 0.026, N = 3SE +/- 0.023, N = 3SE +/- 0.027, N = 3SE +/- 0.036, N = 34.7785.3554.7035.2004.7225.1841. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

ASTC Encoder

Preset: Medium

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumClang 12GCC 11.11.05742.11483.17224.22965.287SE +/- 0.0156, N = 3SE +/- 0.0337, N = 3SE +/- 0.0243, N = 3SE +/- 0.0292, N = 3SE +/- 0.0302, N = 3SE +/- 0.0045, N = 33.33674.62963.31794.62773.37284.69951. (CXX) g++ options: -flto -O2 -pthread

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 12GCC 11.150100150200250SE +/- 0.95, N = 3SE +/- 0.49, N = 3SE +/- 0.32, N = 3SE +/- 0.83, N = 3SE +/- 1.15, N = 3SE +/- 0.03, N = 3225.15217.32229.27221.71236.05227.041. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 12GCC 11.150100150200250SE +/- 1.28, N = 3SE +/- 0.82, N = 3SE +/- 0.75, N = 3SE +/- 0.87, N = 3SE +/- 0.89, N = 3SE +/- 0.97, N = 3225.57222.63227.73223.11227.66222.331. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pClang 12GCC 11.150100150200250SE +/- 0.33, N = 3SE +/- 0.09, N = 3SE +/- 0.32, N = 3SE +/- 0.46, N = 3SE +/- 0.70, N = 3SE +/- 0.18, N = 3231.35230.53234.71231.16232.66230.431. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 12GCC 11.150100150200250SE +/- 0.18, N = 3SE +/- 0.37, N = 3SE +/- 0.40, N = 3SE +/- 0.23, N = 3SE +/- 0.48, N = 3SE +/- 0.33, N = 3235.64235.27239.88235.59238.13234.701. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 12GCC 11.180160240320400SE +/- 0.23, N = 3SE +/- 1.24, N = 3SE +/- 1.66, N = 3SE +/- 1.07, N = 3SE +/- 0.90, N = 3SE +/- 0.57, N = 3368.55365.79375.95369.54375.08368.551. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Geometric Mean Of All Test Results

Result Composite - Ryzen 9 5950X Clang 12 vs. GCC 11 Benchmarks

-O2-O3 -march=native-O3 -march=native -fltoOpenBenchmarking.orgGeometric Mean, More Is BetterGeometric Mean Of All Test ResultsResult Composite - Ryzen 9 5950X Clang 12 vs. GCC 11 BenchmarksClang 12GCC 11.1122436486052.5150.7153.7853.5154.7352.51


Phoronix Test Suite v10.8.4