Ryzen 9 5950X Clang 12 vs. GCC 11 Benchmarks

GCC 11.1 versus LLVM Clang 12 on AMD Ryzen 9 5950X. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2105198-IB-11900KCOM08&sro&grw.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 11.1Clang 12 -O2 -O3 -march=native -O3 -march=native -flto -O2 -O3 -march=native -O3 -march=native -fltoAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3302 BIOS)AMD Starship/Matisse32GB500GB Western Digital WDS500G3X0C-00SJG0AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Fedora 345.11.20-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.0.3 (LLVM 12.0.0)GCC 11.1.1 20210428 + Clang 12.0.0btrfs3840x2160Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- GCC 11.1: -O2: CXXFLAGS=-O2 CFLAGS=-O2- GCC 11.1: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.1: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- Clang 12: -O2: CXXFLAGS=-O2 CFLAGS=-O2- Clang 12: -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12: -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 11.1: -O2, GCC 11.1: -O3 -march=native, GCC 11.1: -O3 -march=native -flto: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

encode-flac: WAV To FLACencode-mp3: WAV To MP3tjbench: Decompression Throughputencode-opus: WAV To Opus Encodeastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivewebp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionhmmer: Pfam Database Searchmrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solvertnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - shufflenet-v2ncnn: CPU - blazefacencnn: CPU - yolov4-tinylammps: Rhodopsin Proteincoremark: CoreMark Size 666 - Iterations Per Secondaobench: 2048 x 2048 - Total Timegraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kc-ray: Total Time - 4K, 16 Rays Per Pixelsvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080pliquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57sqlite-speedtest: Timed Time - Size 1,000pjsip: INVITEpjsip: OPTIONS, Statefulpjsip: OPTIONS, StatelessGCC 11.1Clang 12 -O2 -O3 -march=native -O3 -march=native -flto -O2 -O3 -march=native -O3 -march=native -flto5.8746.802277.4968026.5154.62967.684157.144513.1815.35527.42499.72296.7855120.940262219.130208.78012.954.544.153.975.4113.1556.8214.5011.1626.1814.6117.244.331.8521.0612.855830811.53211631.7329852264221803230.53235.27222.6326.2460.931217.32365.79594850000104120000047.404481578602305996.2375.503268.1134565.3874.62777.571856.641113.5505.20028.32797.74492.9735314.733904227.768214.27712.704.433.843.955.3712.8557.6114.3310.9925.3313.9317.184.391.8021.0012.749808580.86265025.47910363704492120231.16235.59223.1125.9125.405221.71369.54611560000108500000046.917467179822217596.1955.403270.9091505.4754.69957.529956.190013.2545.18427.94997.71493.3375445.635221215.341202.57213.744.463.863.925.3613.2657.8414.8011.1325.8414.5018.536.012.5523.9412.813849671.95944226.3129643824492085230.43234.70222.3326.1725.482227.04368.55619393333109326666746.479461679162225817.6876.565267.5530685.8743.33679.395052.204813.6474.77829.11996.09293.5294877.306410271.028208.55511.583.563.363.264.5211.8857.2614.0511.0922.9912.6112.933.811.5421.9713.087712758.35782633.7789212064111693231.35235.64225.5727.7749.275225.15368.55576660000105640000047.792457579422211075.6666.098272.5135595.5083.31799.354951.823014.1054.70328.64095.21490.7855192.190042355.925215.78111.463.573.073.264.5111.7756.9313.9011.1623.2512.3512.593.741.5221.7313.299722694.00627530.2349812354521767234.71239.88227.7327.7545.010229.27375.95576050000106190000047.630470879582225725.7565.810266.2298295.5583.37289.443251.897413.6364.72227.40093.31192.4055289.917571261.836207.92511.053.422.933.084.3011.6256.1113.8611.2522.9012.4312.143.691.4621.5813.342714933.85898329.7309612374511762232.66238.13227.6628.4744.742236.05375.08587800000107346666748.584OpenBenchmarking.org

FLAC Audio Encoding

WAV To FLAC

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.024, N = 5SE +/- 0.005, N = 5SE +/- 0.030, N = 5SE +/- 0.045, N = 5SE +/- 0.020, N = 5SE +/- 0.036, N = 57.6875.6665.7565.8746.2376.1951. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.068, N = 4SE +/- 0.045, N = 3SE +/- 0.056, N = 3SE +/- 0.084, N = 3SE +/- 0.071, N = 3SE +/- 0.010, N = 36.5656.0985.8106.8025.5035.4031. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

Clang 12GCC 11.1OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O2-O3 -march=native-O3 -march=native -flto60120180240300SE +/- 1.65, N = 3SE +/- 0.59, N = 3SE +/- 0.54, N = 3SE +/- 1.92, N = 3SE +/- 1.27, N = 3SE +/- 0.45, N = 3267.55272.51266.23277.50268.11270.911. (CC) gcc options: -O3 -rdynamic

Opus Codec Encoding

WAV To Opus Encode

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.052, N = 5SE +/- 0.045, N = 5SE +/- 0.016, N = 5SE +/- 0.035, N = 5SE +/- 0.048, N = 5SE +/- 0.023, N = 55.8745.5085.5586.5155.3875.4751. (CXX) g++ options: -fvisibility=hidden -logg -lm

ASTC Encoder

Preset: Medium

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium-O2-O3 -march=native-O3 -march=native -flto1.05742.11483.17224.22965.287SE +/- 0.0156, N = 3SE +/- 0.0243, N = 3SE +/- 0.0302, N = 3SE +/- 0.0337, N = 3SE +/- 0.0292, N = 3SE +/- 0.0045, N = 33.33673.31793.37284.62964.62774.69951. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Thorough

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.0279, N = 3SE +/- 0.0407, N = 3SE +/- 0.0114, N = 3SE +/- 0.0109, N = 3SE +/- 0.0351, N = 3SE +/- 0.0109, N = 39.39509.35499.44327.68417.57187.52991. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Exhaustive

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive-O2-O3 -march=native-O3 -march=native -flto1326395265SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 352.2051.8251.9057.1456.6456.191. (CXX) g++ options: -O2 -flto -pthread

WebP Image Encode

Encode Settings: Quality 100, Lossless

Clang 12GCC 11.1OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.19, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 4SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 313.6514.1113.6413.1813.5513.251. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

Clang 12GCC 11.1OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression-O2-O3 -march=native-O3 -march=native -flto1.20492.40983.61474.81966.0245SE +/- 0.059, N = 4SE +/- 0.026, N = 3SE +/- 0.027, N = 3SE +/- 0.021, N = 3SE +/- 0.023, N = 3SE +/- 0.036, N = 34.7784.7034.7225.3555.2005.1841. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

Clang 12GCC 11.1OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression-O2-O3 -march=native-O3 -march=native -flto714212835SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 329.1228.6427.4027.4228.3327.951. (CC) gcc options: -fvisibility=hidden -pthread -lm -ljpeg

Timed HMMer Search

Pfam Database Search

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 396.0995.2193.3199.7297.7497.711. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Timed MrBayes Analysis

Primate Phylogeny Analysis

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O2-O3 -march=native-O3 -march=native -flto20406080100SE +/- 0.21, N = 3SE +/- 0.18, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.29, N = 3SE +/- 0.13, N = 393.5390.7992.4196.7992.9793.341. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Himeno Benchmark

Poisson Pressure Solver

Clang 12GCC 11.1OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O2-O3 -march=native-O3 -march=native -flto12002400360048006000SE +/- 62.44, N = 15SE +/- 66.92, N = 15SE +/- 96.57, N = 13SE +/- 83.12, N = 15SE +/- 90.91, N = 15SE +/- 68.97, N = 154877.315192.195289.925120.945314.735445.641. (CC) gcc options: -O3 -mavx2

TNN

Target: CPU - Model: MobileNet v2

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O2-O3 -march=native-O3 -march=native -flto80160240320400SE +/- 0.41, N = 3SE +/- 0.48, N = 3SE +/- 0.55, N = 3SE +/- 1.57, N = 3SE +/- 0.86, N = 3SE +/- 1.27, N = 3271.03355.93261.84219.13227.77215.341. (CXX) g++ options: -O2 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.23, N = 3SE +/- 0.69, N = 3SE +/- 0.41, N = 3SE +/- 0.08, N = 3SE +/- 0.50, N = 3SE +/- 0.11, N = 3208.56215.78207.93208.78214.28202.571. (CXX) g++ options: -O2 -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

NCNN

Target: CPU - Model: mobilenet

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 15SE +/- 0.13, N = 15SE +/- 0.17, N = 311.5811.4611.0512.9512.7013.741. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O2-O3 -march=native-O3 -march=native -flto1.02152.0433.06454.0865.1075SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 33.563.573.424.544.434.461. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O2-O3 -march=native-O3 -march=native -flto0.93381.86762.80143.73524.669SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.02, N = 15SE +/- 0.03, N = 33.363.072.934.153.843.861. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O2-O3 -march=native-O3 -march=native -flto0.89331.78662.67993.57324.4665SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.02, N = 33.263.263.083.973.953.921. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O2-O3 -march=native-O3 -march=native -flto1.21732.43463.65194.86926.0865SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 15SE +/- 0.03, N = 15SE +/- 0.02, N = 34.524.514.305.415.375.361. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.13, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 15SE +/- 0.12, N = 15SE +/- 0.44, N = 311.8811.7711.6213.1512.8513.261. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O2-O3 -march=native-O3 -march=native -flto1326395265SE +/- 0.08, N = 3SE +/- 0.53, N = 3SE +/- 0.45, N = 3SE +/- 0.08, N = 15SE +/- 0.08, N = 15SE +/- 0.23, N = 357.2656.9356.1156.8257.6157.841. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 15SE +/- 0.09, N = 15SE +/- 0.22, N = 314.0513.9013.8614.5014.3314.801. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 15SE +/- 0.04, N = 15SE +/- 0.06, N = 311.0911.1611.2511.1610.9911.131. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O2-O3 -march=native-O3 -march=native -flto612182430SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 15SE +/- 0.10, N = 15SE +/- 0.21, N = 322.9923.2522.9026.1825.3325.841. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O2-O3 -march=native-O3 -march=native -flto48121620SE +/- 0.31, N = 3SE +/- 0.20, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 15SE +/- 0.09, N = 15SE +/- 0.06, N = 312.6112.3512.4314.6113.9314.501. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O2-O3 -march=native-O3 -march=native -flto510152025SE +/- 0.20, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.09, N = 312.9312.5912.1417.2417.1818.531. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O2-O3 -march=native-O3 -march=native -flto246810SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 14SE +/- 0.15, N = 15SE +/- 0.02, N = 33.813.743.694.334.396.011. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O2-O3 -march=native-O3 -march=native -flto0.57381.14761.72142.29522.869SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 31.541.521.461.851.802.551. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

Clang 12GCC 11.1OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O2-O3 -march=native-O3 -march=native -flto612182430SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 15SE +/- 0.14, N = 15SE +/- 0.41, N = 321.9721.7321.5821.0621.0023.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

Clang 12GCC 11.1OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O2-O3 -march=native-O3 -march=native -flto3691215SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 8SE +/- 0.14, N = 4SE +/- 0.09, N = 1513.0913.3013.3412.8612.7512.811. (CXX) g++ options: -O2 -pthread -lm

Coremark

CoreMark Size 666 - Iterations Per Second

Clang 12GCC 11.1OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O2-O3 -march=native-O3 -march=native -flto200K400K600K800K1000KSE +/- 789.22, N = 3SE +/- 845.83, N = 3SE +/- 581.76, N = 3SE +/- 989.68, N = 3SE +/- 2230.96, N = 3SE +/- 836.68, N = 3712758.36722694.01714933.86830811.53808580.86849671.961. (CC) gcc options: -O2 -lrt" -lrt

AOBench

Size: 2048 x 2048 - Total Time

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O2-O3 -march=native-O3 -march=native -flto816243240SE +/- 0.04, N = 3SE +/- 0.43, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 333.7830.2329.7331.7325.4826.311. (CC) gcc options: -lm -O3

GraphicsMagick

Operation: Rotate

Clang 12GCC 11.1OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O2-O3 -march=native-O3 -march=native -flto2004006008001000SE +/- 5.55, N = 3SE +/- 0.58, N = 3SE +/- 4.37, N = 3SE +/- 11.70, N = 3SE +/- 5.57, N = 3SE +/- 3.48, N = 392198196198510369641. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

Clang 12GCC 11.1OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O2-O3 -march=native-O3 -march=native -flto80160240320400SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 32062352372263703821. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

Clang 12GCC 11.1OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O2-O3 -march=native-O3 -march=native -flto100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34114524514224494491. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

Clang 12GCC 11.1OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O2-O3 -march=native-O3 -march=native -flto5001000150020002500SE +/- 4.48, N = 3SE +/- 3.61, N = 3SE +/- 2.85, N = 3SE +/- 0.88, N = 3SE +/- 1.53, N = 3SE +/- 3.71, N = 31693176717621803212020851. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.33, N = 3SE +/- 0.32, N = 3SE +/- 0.70, N = 3SE +/- 0.09, N = 3SE +/- 0.46, N = 3SE +/- 0.18, N = 3231.35234.71232.66230.53231.16230.431. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.18, N = 3SE +/- 0.40, N = 3SE +/- 0.48, N = 3SE +/- 0.37, N = 3SE +/- 0.23, N = 3SE +/- 0.33, N = 3235.64239.88238.13235.27235.59234.701. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 1.28, N = 3SE +/- 0.75, N = 3SE +/- 0.89, N = 3SE +/- 0.82, N = 3SE +/- 0.87, N = 3SE +/- 0.97, N = 3225.57227.73227.66222.63223.11222.331. (CC) gcc options: -O3 -fcommon -O2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O2-O3 -march=native-O3 -march=native -flto714212835SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 327.7727.7528.4726.2425.9126.171. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

C-Ray

Total Time - 4K, 16 Rays Per Pixel

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O2-O3 -march=native-O3 -march=native -flto1428425670SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 349.2845.0144.7460.9325.4125.481. (CC) gcc options: -lm -lpthread -O3

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto50100150200250SE +/- 0.95, N = 3SE +/- 0.32, N = 3SE +/- 1.15, N = 3SE +/- 0.49, N = 3SE +/- 0.83, N = 3SE +/- 0.03, N = 3225.15229.27236.05217.32221.71227.041. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

Clang 12GCC 11.1OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O2-O3 -march=native-O3 -march=native -flto80160240320400SE +/- 0.23, N = 3SE +/- 1.66, N = 3SE +/- 0.90, N = 3SE +/- 1.24, N = 3SE +/- 1.07, N = 3SE +/- 0.57, N = 3368.55375.95375.08365.79369.54368.551. (CC) gcc options: -O2 -fPIE -fPIC -O3 -pie -rdynamic -lpthread -lrt

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

Clang 12GCC 11.1OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto130M260M390M520M650MSE +/- 1824399.44, N = 3SE +/- 4072558.57, N = 3SE +/- 1855020.22, N = 3SE +/- 185202.59, N = 3SE +/- 4455270.29, N = 3SE +/- 1343755.10, N = 35766600005760500005878000005948500006115600006193933331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

Clang 12GCC 11.1OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O2-O3 -march=native-O3 -march=native -flto200M400M600M800M1000MSE +/- 2690724.81, N = 3SE +/- 493288.29, N = 3SE +/- 3219903.38, N = 3SE +/- 4650806.38, N = 3SE +/- 1252996.41, N = 3SE +/- 783865.07, N = 31056400000106190000010734666671041200000108500000010932666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SQLite Speedtest

Timed Time - Size 1,000

Clang 12GCC 11.1OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O2-O3 -march=native-O3 -march=native -flto1122334455SE +/- 0.13, N = 3SE +/- 0.50, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.54, N = 347.7947.6348.5847.4046.9246.481. (CC) gcc options: -ldl -lz -lpthread

PJSIP

Method: INVITE

Clang 12GCC 11.1OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O2-O3 -march=native-O3 -march=native -flto10002000300040005000SE +/- 42.78, N = 3SE +/- 66.36, N = 3SE +/- 16.33, N = 3SE +/- 32.67, N = 15SE +/- 65.77, N = 3457547084815467146161. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateful

Clang 12GCC 11.1OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O2-O3 -march=native-O3 -march=native -flto2K4K6K8K10KSE +/- 27.78, N = 3SE +/- 47.54, N = 3SE +/- 72.34, N = 3SE +/- 21.79, N = 3SE +/- 56.75, N = 3794279587860798279161. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateless

Clang 12GCC 11.1OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O2-O3 -march=native-O3 -march=native -flto50K100K150K200K250KSE +/- 1827.48, N = 3SE +/- 962.04, N = 3SE +/- 586.86, N = 3SE +/- 1749.48, N = 3SE +/- 705.50, N = 32211072225722305992217592225811. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread


Phoronix Test Suite v10.8.5