Ryzen 9 5950X AOCC 3.0 Compiler Benchmarking

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2103167-PTS-RYZEN95988&rdt&grw.

Ryzen 9 5950X AOCC 3.0 Compiler BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600 + 2000GBAMD NAVY_FLOUNDER 12GB (2855/1000MHz)AMD Device ab28ASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.6-051106-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.94.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.168GCC 10.2.0ext43840x2160Clang 11.0.0Clang 12.0.0-++rc3-1~exp1~oibaf~gClang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 10.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - AMD AOCC 2.3: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Ryzen 9 5950X AOCC 3.0 Compiler Benchmarkingtscp: AI Chess Performancecryptopp: Unkeyed Algorithmscompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedcrafty: Elapsed Timebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3ngspice: C2670ngspice: C7552encode-opus: WAV To Opus Encodeencode-wavpack: WAV To WavPackastcenc: Mediumastcenc: Thoroughastcenc: Exhaustiveetcpak: DXT1etcpak: ETC1etcpak: ETC2jpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8jpegxl-decode: 1jpegxl-decode: Alllibraw: Post-Processing Benchmarkwebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionencode-ogg: WAV To Oggsynthmark: VoiceMark_100gcrypt: quantlib: mrbayes: Primate Phylogeny Analysisrnnoise: mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3onnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400monednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUopenfoam: Motorbike 30Mbuild-llvm: Time To Compilecompress-zstd: 8 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedsysbench: CPUaom-av1: Speed 0 Two-Passaom-av1: Speed 4 Two-Passaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 8 Realtimegraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacetachyon: Total Timesvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px264: H.264 Video Encodingdav1d: Summer Nature 4Kdav1d: Summer Nature 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080px265: Bosphorus 4Kx265: Bosphorus 1080pc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeavifenc: 0avifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessbuild-godot: Time To Compilesmallpt: Global Illumination Renderer; 128 Samplesgnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformliquid-dsp: 1 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETsqlite-speedtest: Timed Time - Size 1,000simdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01965773545.91460912330.5613771.172.3613400.171.1313397.71173124919.8965.15715.90228.12671.60362.8165.48410.1494.05246.992252.92681546.299386.561245.04174.1211.201.1487.3587.0738.1356.53210.9978.662.274111.802203.8116.414367.3711.0421.65213.9905.24228.81313.578966.298171.1863196.959.86914.1975.08125.0653.2402.35132.34443361499150496721216.281211.56712.424.433.854.233.935.321.8312.7657.8914.1110.8225.6720.7713.7717.613.959799.2596717.29054.467773.554672757.521773.670.63866497.75370.5711057.44617.151.64251.71425.94737.11122.64886.236.64350.991743.720.379.2035.1329.43121.13116610563754392165454111544.3941235.04228.96208.93243.69971.796.13751.77427.8389.8025.08924.09343.61523.5388.9272.93430.9774.87579.5234.674920.84715.41063.5843.11055.0515.881844000111120000011649666673549910.503041527.372222217.523470419.902640316.1742.5993.721.225.645.732314225550.45706312456.1713595.372.2113212.668.8013188.421.2165.45315.98428.14972.78264.89410.2933.28999.201250.81402986.706285.289236.46674.6111.311.0489.5189.3235.8764.34213.6750.372.144106.383193.6006.288357.8970.9791.67313.6404.60928.43316.564807.373173.3143710.459.06814.039456646103171056067252.449203.63610.963.523.063.793.164.531.5711.9458.1114.0811.0123.3121.6612.7412.19576.1871117.64468.250.74097.31166.44586.21025.44805.636.74024.7210804861.921131928241461182440284846.1321238.11230.19210.72244.15976.936.85965.75427.4989.7444.53122.53540.74821.8348.3842.93327.6224.83279.943931.64661.61080.3853.31061.0534.875031667106726666713323333333589202.592961165.672351340.563658044.772719539.8344.1103.531.115.926.122148154552.38395412227.4713305.368.3013082.464.4313129.421.3775.52215.90728.22272.46164.5195.58910.3113.50769.499651.66293669.185383.472272.99474.7711.411.0685.6785.8136.4462.27196.3454.142.134103.011188.3106.789349.0220.9771.63813.9104.67428.67313.372795.808172.9023538.559.23414.331426634104149725937270.790206.36011.533.793.334.043.454.801.7312.5357.5113.9111.1423.5421.7012.5017.064.119309.5944217.33802.465613.644852757.751792.270.641231100.16302.6981078.14352.451.34000.91191.64456.61034.536.83957.92445437.630.429.7437.530.03118.2211081016237457178939884445.0731223.50219.12213.57244.37979.346.82365.31128.0292.1644.88722.12241.07822.0738.3422.95227.8544.80780.211911.24769.81060.1835.41054.9522.877794333106766666713352333333649832.582954866.82212779.003624414.372762047.543.2613.711.146.186.262283512538.87620412124.8113144.872.3013010.768.4012981.821.3685.63916.05528.16673.27964.91110.3433.40409.349351.45453583.008286.927242.03373.6411.171.0483.5483.6334.3459.92191.9152.682.165105.721193.4266.364356.8151.0071.66913.8414.93728.19216.540789.224175.6943646.457.99014.471465649102154745976260.663204.60311.283.533.073.873.184.501.5611.9658.9613.8611.1123.2921.9312.4012.304.096639.5719417.36302.468503.583642760.941760.570.643093610.9611096.34463.150.53608.81186.04543.81024.536.43978.2210533984.511083867240452172039280544.9981225.17221.59210.35229.03959.296.91764.27926.9688.7044.33222.53741.02822.0058.3092.94127.9144.83779.867929.14704.71065.4838.31055.8523.678734000108603333313349000003766645.922948093.682345671.033545388.922719036.23.611.126.046.23OpenBenchmarking.org

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0500K1000K1500K2000K2500KSE +/- 7442.75, N = 5SE +/- 4348.49, N = 5SE +/- 4267.44, N = 5SE +/- 3546.09, N = 519657732314225214815422835121. (CC) gcc options: -O3 -march=native

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0120240360480600SE +/- 3.29, N = 15SE +/- 1.69, N = 3SE +/- 1.73, N = 3SE +/- 2.13, N = 3545.91550.46552.38538.881. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03K6K9K12K15KSE +/- 76.55, N = 3SE +/- 57.72, N = 3SE +/- 66.76, N = 3SE +/- 49.14, N = 312330.5612456.1712227.4712124.811. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03K6K9K12K15KSE +/- 38.97, N = 3SE +/- 59.50, N = 3SE +/- 93.71, N = 3SE +/- 106.53, N = 313771.113595.313305.313144.81. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01632486480SE +/- 0.86, N = 3SE +/- 0.77, N = 5SE +/- 0.19, N = 3SE +/- 0.21, N = 372.3672.2168.3072.301. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03K6K9K12K15KSE +/- 48.22, N = 3SE +/- 15.17, N = 5SE +/- 46.92, N = 3SE +/- 30.75, N = 313400.113212.613082.413010.71. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01632486480SE +/- 0.68, N = 6SE +/- 0.51, N = 3SE +/- 0.14, N = 3SE +/- 0.68, N = 671.1368.8064.4368.401. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03K6K9K12K15KSE +/- 35.65, N = 6SE +/- 53.22, N = 3SE +/- 21.95, N = 3SE +/- 24.92, N = 613397.713188.413129.412981.81. (CC) gcc options: -O3

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 10.23M6M9M12M15MSE +/- 26371.45, N = 3117312491. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1SGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0510152025SE +/- 0.04, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.21, N = 319.9021.2221.3821.371. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01.26882.53763.80645.07526.344SE +/- 0.023, N = 3SE +/- 0.009, N = 3SE +/- 0.015, N = 3SE +/- 0.015, N = 35.1575.4535.5225.6391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 315.9015.9815.9116.061. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0714212835SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 328.1328.1528.2228.171. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01632486480SE +/- 0.21, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 371.6072.7872.4673.281. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01428425670SE +/- 0.15, N = 3SE +/- 0.54, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 362.8264.8964.5264.911. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 10.2LLVM Clang 121.25752.5153.77255.036.2875SE +/- 0.031, N = 5SE +/- 0.037, N = 55.4845.589-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.10, N = 5SE +/- 0.03, N = 5SE +/- 0.11, N = 5SE +/- 0.01, N = 510.1510.2910.3110.341. (CXX) g++ options: -O3 -march=native -rdynamic

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.91181.82362.73543.64724.559SE +/- 0.0178, N = 3SE +/- 0.0273, N = 3SE +/- 0.0018, N = 3SE +/- 0.0017, N = 34.05243.28993.50763.40401. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.0057, N = 3SE +/- 0.0090, N = 3SE +/- 0.0075, N = 3SE +/- 0.0148, N = 36.99229.20129.49969.34931. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01224364860SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 352.9350.8151.6651.451. (CXX) g++ options: -O3 -march=native -flto -pthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.08001600240032004000SE +/- 2.21, N = 3SE +/- 5.75, N = 3SE +/- 26.84, N = 3SE +/- 7.06, N = 31546.302986.713669.193583.011. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.080160240320400SE +/- 0.37, N = 3SE +/- 1.16, N = 3SE +/- 0.14, N = 3SE +/- 0.06, N = 3386.56285.29383.47286.931. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.060120180240300SE +/- 1.65, N = 3SE +/- 2.43, N = 3SE +/- 0.09, N = 3SE +/- 0.45, N = 3245.04236.47272.99242.031. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 374.1274.6174.7773.64-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 311.2011.3111.4111.17-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.25650.5130.76951.0261.2825SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31.141.041.061.04-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.14, N = 3SE +/- 0.25, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 387.3589.5185.6783.54-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.19, N = 3SE +/- 0.25, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 387.0789.3285.8183.63-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0918273645SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 338.1335.8736.4434.34-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: 1GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01428425670SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 356.5364.3462.2759.92

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: AllGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 0.29, N = 3SE +/- 0.40, N = 3SE +/- 0.05, N = 3SE +/- 0.21, N = 3210.99213.67196.34191.91

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.16, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 378.6650.3754.1452.681. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.51171.02341.53512.04682.5585SE +/- 0.005, N = 3SE +/- 0.024, N = 3SE +/- 0.011, N = 3SE +/- 0.025, N = 32.2742.1442.1342.1651. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0306090120150SE +/- 1.06, N = 3SE +/- 0.42, N = 3SE +/- 0.95, N = 3SE +/- 0.88, N = 3111.80106.38103.01105.721. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.04080120160200SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.56, N = 3203.81193.60188.31193.431. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0246810SE +/- 0.011, N = 3SE +/- 0.015, N = 3SE +/- 0.010, N = 3SE +/- 0.010, N = 36.4146.2886.7896.3641. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.080160240320400SE +/- 0.42, N = 3SE +/- 1.28, N = 3SE +/- 0.58, N = 3SE +/- 1.38, N = 3367.37357.90349.02356.821. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.23450.4690.70350.9381.1725SE +/- 0.008, N = 3SE +/- 0.006, N = 3SE +/- 0.014, N = 3SE +/- 0.005, N = 31.0420.9790.9771.0071. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.37640.75281.12921.50561.882SE +/- 0.018, N = 4SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.011, N = 31.6521.6731.6381.6691. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 313.9913.6413.9113.841. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01.17952.3593.53854.7185.8975SE +/- 0.018, N = 3SE +/- 0.016, N = 3SE +/- 0.055, N = 3SE +/- 0.020, N = 35.2424.6094.6744.9371. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0714212835SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 328.8128.4328.6728.191. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To OggGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 313.5816.5613.3716.541. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 1.26, N = 3SE +/- 5.04, N = 3SE +/- 4.01, N = 3SE +/- 5.41, N = 3966.30807.37795.81789.221. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.04080120160200SE +/- 0.29, N = 3SE +/- 1.67, N = 3SE +/- 1.10, N = 3SE +/- 0.17, N = 3171.19173.31172.90175.691. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.08001600240032004000SE +/- 33.41, N = 5SE +/- 28.46, N = 10SE +/- 49.56, N = 3SE +/- 27.64, N = 103196.93710.43538.53646.41. (CXX) g++ options: -O3 -march=native -rdynamic

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01326395265SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.81, N = 3SE +/- 0.10, N = 359.8759.0759.2357.99-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm -lreadline

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.16, N = 314.2014.0414.3314.471. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0GCC 10.21.14322.28643.42964.57285.716SE +/- 0.010, N = 35.081MIN: 4.92 / MAX: 14.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50GCC 10.2612182430SE +/- 0.02, N = 325.07MIN: 23.97 / MAX: 39.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224GCC 10.20.7291.4582.1872.9163.645SE +/- 0.049, N = 33.240MIN: 3.12 / MAX: 11.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0GCC 10.20.5291.0581.5872.1162.645SE +/- 0.027, N = 32.351MIN: 2.27 / MAX: 7.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3GCC 10.2816243240SE +/- 0.09, N = 332.34MIN: 31.33 / MAX: 42.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0100200300400500SE +/- 1.96, N = 3SE +/- 2.95, N = 3SE +/- 2.13, N = 3SE +/- 1.36, N = 3433456426465-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0140280420560700SE +/- 6.71, N = 3SE +/- 6.07, N = 12SE +/- 5.80, N = 3SE +/- 5.59, N = 12614646634649-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.17, N = 3SE +/- 0.44, N = 3SE +/- 0.29, N = 3SE +/- 0.44, N = 399103104102-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.04K8K12K16K20KSE +/- 134.84, N = 3SE +/- 193.85, N = 4SE +/- 123.46, N = 12SE +/- 177.70, N = 315049171051497215474-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.014002800420056007000SE +/- 215.50, N = 12SE +/- 34.74, N = 3SE +/- 55.09, N = 12SE +/- 38.28, N = 36721606759375976-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.060120180240300SE +/- 0.56, N = 3SE +/- 0.35, N = 3SE +/- 0.58, N = 3SE +/- 0.69, N = 3216.28252.45270.79260.66-fopenmp - MIN: 215.1 / MAX: 218.26-fopenmp=libomp - MIN: 250.25 / MAX: 255.53-fopenmp=libomp - MIN: 268.42 / MAX: 272.22-fopenmp=libomp - MIN: 257.51 / MAX: 262.881. (CXX) g++ options: -O3 -march=native -pthread -fvisibility=hidden -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 0.57, N = 3SE +/- 0.85, N = 3SE +/- 1.02, N = 3SE +/- 0.48, N = 3211.57203.64206.36204.60-fopenmp - MIN: 206.88 / MAX: 212.83-fopenmp=libomp - MIN: 201.91 / MAX: 206.13-fopenmp=libomp - MIN: 204.24 / MAX: 209.24-fopenmp=libomp - MIN: 203.72 / MAX: 206.331. (CXX) g++ options: -O3 -march=native -pthread -fvisibility=hidden -rdynamic -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.16, N = 15SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 412.4210.9611.5311.28-lgomp - MIN: 11.7 / MAX: 20.08-lomp - MIN: 10.51 / MAX: 16.79-lomp - MIN: 11.09 / MAX: 12.2-lomp - MIN: 10.61 / MAX: 20.991. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.99681.99362.99043.98724.984SE +/- 0.01, N = 15SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 44.433.523.793.53-lgomp - MIN: 4.19 / MAX: 11.09-lomp - MIN: 3.34 / MAX: 4.84-lomp - MIN: 3.63 / MAX: 5.2-lomp - MIN: 3.27 / MAX: 4.751. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.86631.73262.59893.46524.3315SE +/- 0.02, N = 15SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 43.853.063.333.07-lgomp - MIN: 3.74 / MAX: 10.85-lomp - MIN: 2.98 / MAX: 4.3-lomp - MIN: 3.19 / MAX: 5.6-lomp - MIN: 2.9 / MAX: 4.411. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.95181.90362.85543.80724.759SE +/- 0.01, N = 15SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 44.233.794.043.87-lgomp - MIN: 4.15 / MAX: 9.05-lomp - MIN: 3.64 / MAX: 4.86-lomp - MIN: 3.88 / MAX: 5.03-lomp - MIN: 3.67 / MAX: 12.941. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.88431.76862.65293.53724.4215SE +/- 0.02, N = 15SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 43.933.163.453.18-lgomp - MIN: 3.71 / MAX: 6.06-lomp - MIN: 3.06 / MAX: 4.05-lomp - MIN: 3.37 / MAX: 4.6-lomp - MIN: 3.04 / MAX: 4.481. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01.1972.3943.5914.7885.985SE +/- 0.02, N = 15SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 45.324.534.804.50-lgomp - MIN: 5.15 / MAX: 13.83-lomp - MIN: 4.35 / MAX: 6.86-lomp - MIN: 4.71 / MAX: 6.61-lomp - MIN: 4.34 / MAX: 5.71. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefaceGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.41180.82361.23541.64722.059SE +/- 0.01, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 41.831.571.731.56-lgomp - MIN: 1.77 / MAX: 3.9-lomp - MIN: 1.54 / MAX: 1.75-lomp - MIN: 1.68 / MAX: 1.79-lomp - MIN: 1.46 / MAX: 6.91. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.06, N = 15SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 412.7611.9412.5311.96-lgomp - MIN: 12.19 / MAX: 19.36-lomp - MIN: 11.62 / MAX: 12.42-lomp - MIN: 12.12 / MAX: 17.12-lomp - MIN: 11.46 / MAX: 13.271. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01326395265SE +/- 0.12, N = 15SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 457.8958.1157.5158.96-lgomp - MIN: 55.89 / MAX: 80.86-lomp - MIN: 56.81 / MAX: 67.53-lomp - MIN: 56.17 / MAX: 62.53-lomp - MIN: 57.64 / MAX: 66.681. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.05, N = 15SE +/- 0.21, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 414.1114.0813.9113.86-lgomp - MIN: 13.84 / MAX: 23.15-lomp - MIN: 13.56 / MAX: 21.13-lomp - MIN: 13.66 / MAX: 14.54-lomp - MIN: 13.44 / MAX: 16.351. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.09, N = 15SE +/- 0.00, N = 2SE +/- 0.04, N = 3SE +/- 0.04, N = 410.8211.0111.1411.11-lgomp - MIN: 10.41 / MAX: 17.59-lomp - MIN: 10.84 / MAX: 12.26-lomp - MIN: 10.96 / MAX: 13.28-lomp - MIN: 10.92 / MAX: 15.761. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0612182430SE +/- 0.21, N = 15SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.20, N = 425.6723.3123.5423.29-lgomp - MIN: 24.52 / MAX: 35.96-lomp - MIN: 22.75 / MAX: 33.51-lomp - MIN: 22.92 / MAX: 26.57-lomp - MIN: 22.43 / MAX: 25.171. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0510152025SE +/- 0.17, N = 15SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 420.7721.6621.7021.93-lgomp - MIN: 19.69 / MAX: 43.19-lomp - MIN: 21.17 / MAX: 27.18-lomp - MIN: 21.21 / MAX: 30.18-lomp - MIN: 21.28 / MAX: 24.811. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.06, N = 15SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.24, N = 413.7712.7412.5012.40-lgomp - MIN: 13.25 / MAX: 23.45-lomp - MIN: 12 / MAX: 19.89-lomp - MIN: 12.23 / MAX: 16.6-lomp - MIN: 11.72 / MAX: 15.691. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.06, N = 15SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 317.6112.1917.0612.30-lgomp - MIN: 16.94 / MAX: 25.97-lomp - MIN: 11.89 / MAX: 13.6-lomp - MIN: 16.85 / MAX: 20.53-lomp - MIN: 11.96 / MAX: 17.611. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.00.92681.85362.78043.70724.634SE +/- 0.00506, N = 3SE +/- 0.01294, N = 3SE +/- 0.01273, N = 33.959794.119304.09663-fopenmp - MIN: 3.76-fopenmp=libomp - MIN: 3.88-fopenmp=libomp - MIN: 3.91. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.03691215SE +/- 0.01340, N = 3SE +/- 0.01452, N = 3SE +/- 0.01936, N = 39.259679.594429.57194-fopenmp - MIN: 9.1-fopenmp=libomp - MIN: 9.47-fopenmp=libomp - MIN: 9.461. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.048121620SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 317.2917.3417.36-fopenmp - MIN: 16.58-fopenmp=libomp - MIN: 16.81-fopenmp=libomp - MIN: 16.831. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.01.00522.01043.01564.02085.026SE +/- 0.30276, N = 15SE +/- 0.00451, N = 3SE +/- 0.00340, N = 34.467772.465612.46850-fopenmp - MIN: 2.86-fopenmp=libomp - MIN: 2.33-fopenmp=libomp - MIN: 2.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.00.82011.64022.46033.28044.1005SE +/- 0.00753, N = 3SE +/- 0.01444, N = 3SE +/- 0.00604, N = 33.554673.644853.58364-fopenmp - MIN: 3.46-fopenmp=libomp - MIN: 3.5-fopenmp=libomp - MIN: 3.441. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.06001200180024003000SE +/- 2.01, N = 3SE +/- 5.95, N = 3SE +/- 17.19, N = 32757.522757.752760.94-fopenmp - MIN: 2719.35-fopenmp=libomp - MIN: 2734.73-fopenmp=libomp - MIN: 2717.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.0400800120016002000SE +/- 5.00, N = 3SE +/- 9.12, N = 3SE +/- 3.74, N = 31773.671792.271760.57-fopenmp - MIN: 1750.26-fopenmp=libomp - MIN: 1766.32-fopenmp=libomp - MIN: 1745.871. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUGCC 10.2LLVM Clang 12AMD AOCC 3.00.14470.28940.43410.57880.7235SE +/- 0.000722, N = 3SE +/- 0.000908, N = 3SE +/- 0.004823, N = 30.6386640.6412310.643093-fopenmp - MIN: 0.61-fopenmp=libomp - MIN: 0.61-fopenmp=libomp - MIN: 0.611. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MGCC 10.2LLVM Clang 1220406080100SE +/- 0.08, N = 3SE +/- 0.06, N = 397.75100.161. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0130260390520650SE +/- 2.79, N = 3SE +/- 5.39, N = 3SE +/- 1.23, N = 3SE +/- 5.16, N = 3370.57576.19302.70610.96

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedLLVM Clang 12GCC 10.2AMD AOCC 2.3AMD AOCC 3.02004006008001000SE +/- 11.61, N = 3SE +/- 3.93, N = 3SE +/- 9.29, N = 3SE +/- 9.53, N = 151043.21057.41117.61096.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.010002000300040005000SE +/- 26.73, N = 3SE +/- 37.90, N = 2SE +/- 8.16, N = 114617.14468.24352.44463.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01224364860SE +/- 0.20, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 351.650.751.350.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.09001800270036004500SE +/- 6.53, N = 3SE +/- 12.18, N = 3SE +/- 50.52, N = 3SE +/- 417.40, N = 34251.74097.34000.93608.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.030060090012001500SE +/- 2.43, N = 3SE +/- 4.71, N = 3SE +/- 1.19, N = 3SE +/- 2.80, N = 31425.91166.41191.61186.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.010002000300040005000SE +/- 46.74, N = 3SE +/- 31.63, N = 3SE +/- 2.17, N = 3SE +/- 33.94, N = 34737.14586.24456.64543.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedLLVM Clang 12AMD AOCC 2.3AMD AOCC 3.0GCC 10.22004006008001000SE +/- 5.17, N = 3SE +/- 4.53, N = 3SE +/- 5.99, N = 3SE +/- 2.15, N = 31039.11026.11023.51122.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedGCC 10.2AMD AOCC 2.310002000300040005000SE +/- 29.99, N = 34886.24805.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 336.636.736.836.41. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.09001800270036004500SE +/- 72.38, N = 3SE +/- 16.46, N = 3SE +/- 25.47, N = 3SE +/- 39.89, N = 34350.94024.73957.93978.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050M100M150M200M250MSE +/- 115.96, N = 3SE +/- 301436.60, N = 3SE +/- 5355.39, N = 3SE +/- 204338.54, N = 391743.72210804861.922445437.63210533984.511. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-PassGCC 10.2LLVM Clang 120.09450.1890.28350.3780.4725SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.421. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-PassGCC 10.2LLVM Clang 123691215SE +/- 0.02, N = 3SE +/- 0.05, N = 39.209.741. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 RealtimeGCC 10.2LLVM Clang 12918273645SE +/- 0.16, N = 3SE +/- 0.31, N = 335.1337.501. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-PassGCC 10.2LLVM Clang 12714212835SE +/- 0.26, N = 3SE +/- 0.13, N = 329.4330.031. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 RealtimeGCC 10.2LLVM Clang 12306090120150SE +/- 0.75, N = 3SE +/- 1.02, N = 3121.13118.221. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.030060090012001500SE +/- 3.67, N = 3SE +/- 3.71, N = 3SE +/- 4.48, N = 3SE +/- 3.84, N = 311661131110810831. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 3.51, N = 3SE +/- 8.67, N = 3SE +/- 1.86, N = 3SE +/- 2.03, N = 3105692810168671. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.080160240320400SE +/- 1.00, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 33752412372401. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34394614574521. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.05001000150020002500SE +/- 1.45, N = 3SE +/- 1.15, N = 3SE +/- 2.65, N = 321651824178917201. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0100200300400500SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34544023983921. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 1.33, N = 3SE +/- 1.00, N = 3SE +/- 3.06, N = 311158488448051. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01020304050SE +/- 0.13, N = 3SE +/- 0.20, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 344.3946.1345.0745.001. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 2.40, N = 12SE +/- 2.24, N = 13SE +/- 2.40, N = 12SE +/- 2.47, N = 12235.04238.11223.50225.171. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 0.68, N = 3SE +/- 0.90, N = 3SE +/- 0.82, N = 3SE +/- 0.39, N = 3228.96230.19219.12221.591. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 1.66, N = 9SE +/- 1.86, N = 8SE +/- 1.62, N = 12SE +/- 1.75, N = 9208.93210.72213.57210.35-mstack-alignment=64-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.050100150200250SE +/- 0.47, N = 3SE +/- 0.32, N = 3SE +/- 0.04, N = 3SE +/- 0.43, N = 3243.69244.15244.37229.03-lm - MIN: 181.29 / MAX: 252.3MIN: 180.82 / MAX: 252.96MIN: 182.08 / MAX: 252.22MIN: 171.52 / MAX: 237.171. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 1.38, N = 3SE +/- 9.00, N = 3SE +/- 2.97, N = 3SE +/- 1.29, N = 3971.79976.93979.34959.29-lm - MIN: 732.02 / MAX: 1055.82MIN: 633.01 / MAX: 1069.88MIN: 717.55 / MAX: 1062.34MIN: 714.89 / MAX: 1039.771. (CC) gcc options: -O3 -march=native -pthread

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0246810SE +/- 0.014, N = 3SE +/- 0.004, N = 3SE +/- 0.033, N = 3SE +/- 0.055, N = 36.1376.8596.8236.9171. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01530456075SE +/- 0.24, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.66, N = 351.7765.7565.3164.281. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0714212835SE +/- 0.16, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 327.8327.4928.0226.961. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 389.8089.7492.1688.701. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01020304050SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 325.0944.5344.8944.331. (CC) gcc options: -lm -lpthread -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0612182430SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 324.0922.5422.1222.541. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01020304050SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 343.6240.7541.0841.031. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0612182430SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 323.5421.8322.0722.011. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0246810SE +/- 0.048, N = 3SE +/- 0.027, N = 3SE +/- 0.055, N = 3SE +/- 0.012, N = 38.9278.3848.3428.3091. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.66421.32841.99262.65683.321SE +/- 0.014, N = 3SE +/- 0.035, N = 3SE +/- 0.016, N = 3SE +/- 0.006, N = 32.9342.9332.9522.9411. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0714212835SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 330.9827.6227.8527.911. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.01.09692.19383.29074.38765.4845SE +/- 0.022, N = 3SE +/- 0.041, N = 3SE +/- 0.038, N = 3SE +/- 0.015, N = 34.8754.8324.8074.8371. (CXX) g++ options: -O3 -fPIC -lm

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020406080100SE +/- 0.19, N = 3SE +/- 0.28, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 379.5279.9480.2179.87

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.21.05172.10343.15514.20685.2585SE +/- 0.015, N = 34.6741. (CXX) g++ options: -fopenmp -O3 -march=native

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR FiltersGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 19.67, N = 9SE +/- 20.04, N = 9SE +/- 17.91, N = 9SE +/- 20.61, N = 8920.8931.6911.2929.11. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.010002000300040005000SE +/- 16.39, N = 9SE +/- 22.36, N = 9SE +/- 10.26, N = 9SE +/- 60.32, N = 84715.44661.64769.84704.71. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 2.09, N = 9SE +/- 4.68, N = 9SE +/- 3.22, N = 9SE +/- 3.65, N = 81063.51080.31060.11065.41. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 1.32, N = 9SE +/- 2.72, N = 9SE +/- 1.22, N = 9SE +/- 2.72, N = 8843.1853.3835.4838.31. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.02004006008001000SE +/- 0.78, N = 9SE +/- 15.16, N = 9SE +/- 0.98, N = 9SE +/- 3.09, N = 81055.01061.01054.91055.81. 3.8.1.0

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0120240360480600SE +/- 0.63, N = 9SE +/- 1.95, N = 9SE +/- 0.58, N = 9SE +/- 1.23, N = 8515.8534.8522.8523.61. 3.8.1.0

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.020M40M60M80M100MSE +/- 828458.69, N = 5SE +/- 78876.13, N = 3SE +/- 171803.51, N = 3SE +/- 601612.28, N = 3818440007503166777794333787340001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0200M400M600M800M1000MSE +/- 5768882.04, N = 3SE +/- 3628743.28, N = 3SE +/- 3699249.17, N = 3SE +/- 3939684.14, N = 311112000001067266667106766666710860333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57GCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0300M600M900M1200M1500MSE +/- 497772.82, N = 3SE +/- 1125956.38, N = 3SE +/- 240370.09, N = 3SE +/- 1422439.22, N = 311649666671332333333133523333313349000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0800K1600K2400K3200K4000KSE +/- 26197.04, N = 3SE +/- 30792.29, N = 8SE +/- 31635.54, N = 3SE +/- 45854.80, N = 153549910.503589202.593649832.583766645.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0700K1400K2100K2800K3500KSE +/- 39730.96, N = 15SE +/- 40118.44, N = 3SE +/- 29853.73, N = 3SE +/- 27502.49, N = 153041527.372961165.672954866.802948093.681. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0500K1000K1500K2000K2500KSE +/- 23396.73, N = 15SE +/- 27675.97, N = 4SE +/- 30760.12, N = 3SE +/- 35143.82, N = 152222217.522351340.562212779.002345671.031. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0800K1600K2400K3200K4000KSE +/- 36718.95, N = 15SE +/- 58906.79, N = 15SE +/- 47796.61, N = 15SE +/- 11517.61, N = 33470419.903658044.773624414.373545388.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0600K1200K1800K2400K3000KSE +/- 26145.63, N = 15SE +/- 23132.25, N = 3SE +/- 28596.87, N = 3SE +/- 14014.88, N = 32640316.172719539.832762047.502719036.201. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000GCC 10.2AMD AOCC 2.3LLVM Clang 121020304050SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 342.6044.1143.261. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.8371.6742.5113.3484.185SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 33.723.533.713.611. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.00.27450.5490.82351.0981.3725SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.221.111.141.121. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0246810SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 35.645.926.186.041. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDGCC 10.2AMD AOCC 2.3LLVM Clang 12AMD AOCC 3.0246810SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 35.736.126.266.231. (CXX) g++ options: -O3 -march=native -pthread


Phoronix Test Suite v10.8.5