Ryzen 9 5950X AOCC 3.0 Compiler Benchmarking

Benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2103167-PTS-RYZEN95988&grr&sro.

Ryzen 9 5950X AOCC 3.0 Compiler BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 10.2LLVM Clang 12AMD AOCC 2.3AMD AOCC 3.0AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600 + 2000GBAMD NAVY_FLOUNDER 12GB (2855/1000MHz)AMD Device ab28ASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.6-051106-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.94.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.168GCC 10.2.0ext43840x2160Clang 12.0.0-++rc3-1~exp1~oibaf~gClang 11.0.0Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 10.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - AMD AOCC 2.3: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Ryzen 9 5950X AOCC 3.0 Compiler Benchmarkinggnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersbuild-llvm: Time To Compilewebp2: Quality 100, Lossless Compressiononnx: bertsquad-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUjpegxl: PNG - 8onnx: shufflenet-v2-10 - OpenMP CPUwebp2: Quality 95, Compression Effort 7gcrypt: cryptopp: Unkeyed Algorithmsonnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetwebp2: Quality 75, Compression Effort 7openfoam: Motorbike 30Mjpegxl: PNG - 7sysbench: CPUonednn: Recurrent Neural Network Training - f32 - CPUbuild-godot: Time To Compileonednn: Recurrent Neural Network Inference - f32 - CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0compress-zstd: 8 - Decompression Speedcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedngspice: C2670aom-av1: Speed 4 Two-Passcompress-zstd: 8 - Compression Speedsimdjson: DistinctUserIDsimdjson: PartialTweetsngspice: C7552graphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Rotategraphics-magick: Swirlmrbayes: Primate Phylogeny Analysiscompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedsimdjson: Kostyaastcenc: Exhaustivequantlib: redis: GETaom-av1: Speed 0 Two-Passsimdjson: LargeRandonednn: Deconvolution Batch shapes_1d - f32 - CPUcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedtachyon: Total Timejpegxl-decode: 1redis: LPUSHsqlite-speedtest: Timed Time - Size 1,000avifenc: 0jpegxl-decode: Allcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedredis: SADDc-ray: Total Time - 4K, 16 Rays Per Pixelcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedredis: LPOPsynthmark: VoiceMark_100webp: Quality 100, Lossless, Highest Compressionavifenc: 6, Losslessredis: SETbasis: UASTC Level 3aom-av1: Speed 6 Two-Passpovray: Trace Timeliquid-dsp: 1 - 256 - 57avifenc: 2x265: Bosphorus 4Klibraw: Post-Processing Benchmarkbasis: ETC1Sliquid-dsp: 32 - 256 - 57liquid-dsp: 16 - 256 - 57etcpak: ETC2jpegxl: PNG - 5tnn: CPU - MobileNet v2encode-wavpack: WAV To WavPackcrafty: Elapsed Timeaom-av1: Speed 6 Realtimebasis: UASTC Level 2jpegxl: JPEG - 5onednn: IP Shapes 1D - f32 - CPUencode-ogg: WAV To Oggdav1d: Summer Nature 4Ketcpak: ETC1tnn: CPU - SqueezeNet v1.1rnnoise: webp: Quality 100, Losslesssvt-av1: Enc Mode 4 - 1080pjpegxl: JPEG - 7onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pastcenc: Thoroughonednn: IP Shapes 3D - f32 - CPUencode-opus: WAV To Opus Encodex264: H.264 Video Encodingavifenc: 6jpegxl: JPEG - 8x265: Bosphorus 1080pwebp2: Quality 100, Compression Effort 5svt-av1: Enc Mode 8 - 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUbasis: UASTC Level 0aom-av1: Speed 8 Realtimewebp: Quality 100, Highest Compressionavifenc: 10, Losslesssmallpt: Global Illumination Renderer; 128 Samplesdav1d: Summer Nature 1080pastcenc: Mediumonednn: Deconvolution Batch shapes_3d - f32 - CPUsvt-vp9: Visual Quality Optimized - Bosphorus 1080pavifenc: 10webp2: Defaultetcpak: DXT1webp: Quality 100tscp: AI Chess Performancewebp: DefaultGCC 10.2LLVM Clang 12AMD AOCC 2.3AMD AOCC 3.0515.81055.0843.11063.54715.4920.8370.571367.37161467211.1415049203.811171.186545.9146099943317.6113.7720.7725.6710.8214.1157.8912.761.835.323.934.233.854.4312.42111.80297.7511.2091743.722757.5279.5231773.6732.3442.3513.24025.0655.0814617.113397.771.1371.6039.201057.45.735.6462.816375439454216511151056116659.86913400.172.363.7252.92683196.93470419.900.371.224.467774350.936.644.394156.532222217.5242.59943.615210.994251.751.63041527.3725.0891122.64886.24737.11425.913771.112330.563549910.50966.29828.81330.9772640316.1728.12629.4324.0938184400023.53827.8378.6619.89611649666671111200000245.04174.12216.28110.1491173124935.1315.90287.353.9597913.578243.69386.561211.56714.19713.9906.13787.070.638664235.046.99229.259675.484208.938.92738.1389.806.41451.77417.29055.157121.135.2424.8754.674971.794.05243.55467228.962.9342.2741546.2991.65219657731.042522.81054.9835.41060.14769.8911.2302.698349.02263459371.0614972188.310172.902552.38395410442617.0612.5021.7023.5411.1413.9157.5112.531.734.803.454.043.333.7911.53103.011100.1611.412445437.632757.7580.2111792.274352.413129.464.4372.4619.741043.26.266.1864.51923745739817898441016110859.23413082.468.303.7151.66293538.53624414.370.421.142.465613957.936.845.073162.272212779.0043.26141.078196.344000.951.32954866.844.8871039.14456.61191.613305.312227.473649832.58795.80828.67327.8542762047.528.22230.0322.1227779433322.07328.0254.1421.37713352333331067666667272.99474.77270.79010.31137.515.90785.674.1193013.372244.37383.472206.36014.33113.9106.82385.810.641231223.509.49969.594425.589213.578.34236.4492.166.78965.31117.33805.522118.224.6744.807979.343.50763.64485219.122.9522.1343669.1851.63821481540.977534.81061.0853.31080.34661.6931.6576.187357.89764660671.0417105193.600173.314550.45706310345612.1912.7421.6623.3111.0114.0858.1111.941.574.533.163.793.063.5210.96106.38311.31210804861.9279.9434468.213188.468.8072.7821117.66.125.9264.8942414614021824848928113159.06813212.672.213.5350.81403710.43658044.771.114024.736.746.132164.342351340.5644.11040.748213.674097.350.72961165.6744.5311026.14805.64586.21166.413595.312456.173589202.59807.37328.43327.6222719539.8328.14922.5357503166721.83427.4950.3721.21613323333331067266667236.46674.61252.44910.29315.98489.5116.564244.15285.289203.63614.03913.6406.85989.32238.119.2012210.728.38435.8789.746.28865.7545.4534.6094.832976.933.2899230.192.9332.1442986.7061.67323142250.979523.61055.8838.31065.44704.7929.1610.961356.81564959761.0415474193.426175.694538.87620410246512.3012.4021.9323.2911.1113.8658.9611.961.564.503.183.873.073.5311.28105.72111.17210533984.512760.9479.8671760.574463.112981.868.4073.2791096.36.236.0464.9112404523921720805867108357.99013010.772.303.6151.45453646.43545388.921.122.468503978.236.444.998159.922345671.0341.028191.913608.850.52948093.6844.3321023.54543.81186.013144.812124.813766645.92789.22428.19227.9142719036.228.16622.5377873400022.00526.9652.6821.36813349000001086033333242.03373.64260.66310.34316.05583.544.0966316.540229.03286.927204.60314.47113.8416.91783.630.643093225.179.34939.57194210.358.30934.3488.706.36464.27917.36305.6394.9374.837959.293.40403.58364221.592.9412.1653583.0081.66922835121.007OpenBenchmarking.org

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12120240360480600SE +/- 1.95, N = 9SE +/- 1.23, N = 8SE +/- 0.63, N = 9SE +/- 0.58, N = 9534.8523.6515.8522.81. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 15.16, N = 9SE +/- 3.09, N = 8SE +/- 0.78, N = 9SE +/- 0.98, N = 91061.01055.81055.01054.91. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 2.72, N = 9SE +/- 2.72, N = 8SE +/- 1.32, N = 9SE +/- 1.22, N = 9853.3838.3843.1835.41. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 4.68, N = 9SE +/- 3.65, N = 8SE +/- 2.09, N = 9SE +/- 3.22, N = 91080.31065.41063.51060.11. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1210002000300040005000SE +/- 22.36, N = 9SE +/- 60.32, N = 8SE +/- 16.39, N = 9SE +/- 10.26, N = 94661.64704.74715.44769.81. 3.8.1.0

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR FiltersAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 20.04, N = 9SE +/- 20.61, N = 8SE +/- 19.67, N = 9SE +/- 17.91, N = 9931.6929.1920.8911.21. 3.8.1.0

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12130260390520650SE +/- 5.39, N = 3SE +/- 5.16, N = 3SE +/- 2.79, N = 3SE +/- 1.23, N = 3576.19610.96370.57302.70

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1280160240320400SE +/- 1.28, N = 3SE +/- 1.38, N = 3SE +/- 0.42, N = 3SE +/- 0.58, N = 3357.90356.82367.37349.021. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12140280420560700SE +/- 6.07, N = 12SE +/- 5.59, N = 12SE +/- 6.71, N = 3SE +/- 5.80, N = 3646649614634-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1214002800420056007000SE +/- 34.74, N = 3SE +/- 38.28, N = 3SE +/- 215.50, N = 12SE +/- 55.09, N = 126067597667215937-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.25650.5130.76951.0261.2825SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.041.041.141.06-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 124K8K12K16K20KSE +/- 193.85, N = 4SE +/- 177.70, N = 3SE +/- 134.84, N = 3SE +/- 123.46, N = 1217105154741504914972-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 124080120160200SE +/- 0.03, N = 3SE +/- 0.56, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3193.60193.43203.81188.311. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 124080120160200SE +/- 1.67, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 3SE +/- 1.10, N = 3173.31175.69171.19172.901. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12120240360480600SE +/- 1.69, N = 3SE +/- 2.13, N = 3SE +/- 3.29, N = 15SE +/- 1.73, N = 3550.46538.88545.91552.381. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.44, N = 3SE +/- 0.44, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 310310299104-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12100200300400500SE +/- 2.95, N = 3SE +/- 1.36, N = 3SE +/- 1.96, N = 3SE +/- 2.13, N = 3456465433426-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 15SE +/- 0.06, N = 312.1912.3017.6117.06-lomp - MIN: 11.89 / MAX: 13.6-lomp - MIN: 11.96 / MAX: 17.61-lgomp - MIN: 16.94 / MAX: 25.97-lomp - MIN: 16.85 / MAX: 20.531. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.10, N = 3SE +/- 0.24, N = 4SE +/- 0.06, N = 15SE +/- 0.13, N = 312.7412.4013.7712.50-lomp - MIN: 12 / MAX: 19.89-lomp - MIN: 11.72 / MAX: 15.69-lgomp - MIN: 13.25 / MAX: 23.45-lomp - MIN: 12.23 / MAX: 16.61. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12510152025SE +/- 0.14, N = 3SE +/- 0.11, N = 4SE +/- 0.17, N = 15SE +/- 0.13, N = 321.6621.9320.7721.70-lomp - MIN: 21.17 / MAX: 27.18-lomp - MIN: 21.28 / MAX: 24.81-lgomp - MIN: 19.69 / MAX: 43.19-lomp - MIN: 21.21 / MAX: 30.181. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12612182430SE +/- 0.09, N = 3SE +/- 0.20, N = 4SE +/- 0.21, N = 15SE +/- 0.20, N = 323.3123.2925.6723.54-lomp - MIN: 22.75 / MAX: 33.51-lomp - MIN: 22.43 / MAX: 25.17-lgomp - MIN: 24.52 / MAX: 35.96-lomp - MIN: 22.92 / MAX: 26.571. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.00, N = 2SE +/- 0.04, N = 4SE +/- 0.09, N = 15SE +/- 0.04, N = 311.0111.1110.8211.14-lomp - MIN: 10.84 / MAX: 12.26-lomp - MIN: 10.92 / MAX: 15.76-lgomp - MIN: 10.41 / MAX: 17.59-lomp - MIN: 10.96 / MAX: 13.281. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.21, N = 3SE +/- 0.11, N = 4SE +/- 0.05, N = 15SE +/- 0.08, N = 314.0813.8614.1113.91-lomp - MIN: 13.56 / MAX: 21.13-lomp - MIN: 13.44 / MAX: 16.35-lgomp - MIN: 13.84 / MAX: 23.15-lomp - MIN: 13.66 / MAX: 14.541. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121326395265SE +/- 0.01, N = 3SE +/- 0.19, N = 4SE +/- 0.12, N = 15SE +/- 0.17, N = 358.1158.9657.8957.51-lomp - MIN: 56.81 / MAX: 67.53-lomp - MIN: 57.64 / MAX: 66.68-lgomp - MIN: 55.89 / MAX: 80.86-lomp - MIN: 56.17 / MAX: 62.531. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.08, N = 3SE +/- 0.13, N = 4SE +/- 0.06, N = 15SE +/- 0.12, N = 311.9411.9612.7612.53-lomp - MIN: 11.62 / MAX: 12.42-lomp - MIN: 11.46 / MAX: 13.27-lgomp - MIN: 12.19 / MAX: 19.36-lomp - MIN: 12.12 / MAX: 17.121. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefaceAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.41180.82361.23541.64722.059SE +/- 0.01, N = 3SE +/- 0.02, N = 4SE +/- 0.01, N = 15SE +/- 0.02, N = 31.571.561.831.73-lomp - MIN: 1.54 / MAX: 1.75-lomp - MIN: 1.46 / MAX: 6.9-lgomp - MIN: 1.77 / MAX: 3.9-lomp - MIN: 1.68 / MAX: 1.791. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121.1972.3943.5914.7885.985SE +/- 0.07, N = 3SE +/- 0.03, N = 4SE +/- 0.02, N = 15SE +/- 0.02, N = 34.534.505.324.80-lomp - MIN: 4.35 / MAX: 6.86-lomp - MIN: 4.34 / MAX: 5.7-lgomp - MIN: 5.15 / MAX: 13.83-lomp - MIN: 4.71 / MAX: 6.611. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.88431.76862.65293.53724.4215SE +/- 0.03, N = 3SE +/- 0.03, N = 4SE +/- 0.02, N = 15SE +/- 0.02, N = 33.163.183.933.45-lomp - MIN: 3.06 / MAX: 4.05-lomp - MIN: 3.04 / MAX: 4.48-lgomp - MIN: 3.71 / MAX: 6.06-lomp - MIN: 3.37 / MAX: 4.61. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.95181.90362.85543.80724.759SE +/- 0.05, N = 3SE +/- 0.06, N = 4SE +/- 0.01, N = 15SE +/- 0.05, N = 33.793.874.234.04-lomp - MIN: 3.64 / MAX: 4.86-lomp - MIN: 3.67 / MAX: 12.94-lgomp - MIN: 4.15 / MAX: 9.05-lomp - MIN: 3.88 / MAX: 5.031. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.86631.73262.59893.46524.3315SE +/- 0.03, N = 3SE +/- 0.07, N = 4SE +/- 0.02, N = 15SE +/- 0.05, N = 33.063.073.853.33-lomp - MIN: 2.98 / MAX: 4.3-lomp - MIN: 2.9 / MAX: 4.41-lgomp - MIN: 3.74 / MAX: 10.85-lomp - MIN: 3.19 / MAX: 5.61. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.99681.99362.99043.98724.984SE +/- 0.03, N = 3SE +/- 0.06, N = 4SE +/- 0.01, N = 15SE +/- 0.04, N = 33.523.534.433.79-lomp - MIN: 3.34 / MAX: 4.84-lomp - MIN: 3.27 / MAX: 4.75-lgomp - MIN: 4.19 / MAX: 11.09-lomp - MIN: 3.63 / MAX: 5.21. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.09, N = 3SE +/- 0.12, N = 4SE +/- 0.16, N = 15SE +/- 0.05, N = 310.9611.2812.4211.53-lomp - MIN: 10.51 / MAX: 16.79-lomp - MIN: 10.61 / MAX: 20.99-lgomp - MIN: 11.7 / MAX: 20.08-lomp - MIN: 11.09 / MAX: 12.21. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12306090120150SE +/- 0.42, N = 3SE +/- 0.88, N = 3SE +/- 1.06, N = 3SE +/- 0.95, N = 3106.38105.72111.80103.011. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MGCC 10.2LLVM Clang 1220406080100SE +/- 0.08, N = 3SE +/- 0.06, N = 397.75100.161. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 311.3111.1711.2011.41-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250M100M150M200M250MSE +/- 301436.60, N = 3SE +/- 204338.54, N = 3SE +/- 115.96, N = 3SE +/- 5355.39, N = 3210804861.92210533984.5191743.722445437.631. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 126001200180024003000SE +/- 17.19, N = 3SE +/- 2.01, N = 3SE +/- 5.95, N = 32760.942757.522757.75-fopenmp=libomp - MIN: 2717.59-fopenmp - MIN: 2719.35-fopenmp=libomp - MIN: 2734.731. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.28, N = 3SE +/- 0.16, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 379.9479.8779.5280.21

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 12400800120016002000SE +/- 3.74, N = 3SE +/- 5.00, N = 3SE +/- 9.12, N = 31760.571773.671792.27-fopenmp=libomp - MIN: 1745.87-fopenmp - MIN: 1750.26-fopenmp=libomp - MIN: 1766.321. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3GCC 10.2816243240SE +/- 0.09, N = 332.34MIN: 31.33 / MAX: 42.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0GCC 10.20.5291.0581.5872.1162.645SE +/- 0.027, N = 32.351MIN: 2.27 / MAX: 7.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224GCC 10.20.7291.4582.1872.9163.645SE +/- 0.049, N = 33.240MIN: 3.12 / MAX: 11.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50GCC 10.2612182430SE +/- 0.02, N = 325.07MIN: 23.97 / MAX: 39.951. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0GCC 10.21.14322.28643.42964.57285.716SE +/- 0.010, N = 35.081MIN: 4.92 / MAX: 14.741. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1210002000300040005000SE +/- 26.73, N = 3SE +/- 8.16, N = 11SE +/- 37.90, N = 24468.24463.14617.14352.41. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123K6K9K12K15KSE +/- 53.22, N = 3SE +/- 24.92, N = 6SE +/- 35.65, N = 6SE +/- 21.95, N = 313188.412981.813397.713129.41. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121632486480SE +/- 0.51, N = 3SE +/- 0.68, N = 6SE +/- 0.68, N = 6SE +/- 0.14, N = 368.8068.4071.1364.431. (CC) gcc options: -O3

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121632486480SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 372.7873.2871.6072.461. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-PassGCC 10.2LLVM Clang 123691215SE +/- 0.02, N = 3SE +/- 0.05, N = 39.209.741. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 9.29, N = 3SE +/- 9.53, N = 15SE +/- 3.93, N = 3SE +/- 11.61, N = 31117.61096.31057.41043.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12246810SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 36.126.235.736.261. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 35.926.045.646.181. (CXX) g++ options: -O3 -march=native -pthread

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121428425670SE +/- 0.54, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.20, N = 364.8964.9162.8264.521. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1280160240320400SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 32412403752371. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34614524394571. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 34023924543981. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 125001000150020002500SE +/- 2.65, N = 3SE +/- 1.45, N = 3SE +/- 1.15, N = 318241720216517891. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 3.06, N = 3SE +/- 1.33, N = 3SE +/- 1.00, N = 384880511158441. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 8.67, N = 3SE +/- 2.03, N = 3SE +/- 3.51, N = 3SE +/- 1.86, N = 3928867105610161. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1230060090012001500SE +/- 3.71, N = 3SE +/- 3.84, N = 3SE +/- 3.67, N = 3SE +/- 4.48, N = 311311083116611081. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121326395265SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.81, N = 359.0757.9959.8759.23-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm -lreadline

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123K6K9K12K15KSE +/- 15.17, N = 5SE +/- 30.75, N = 3SE +/- 48.22, N = 3SE +/- 46.92, N = 313212.613010.713400.113082.41. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121632486480SE +/- 0.77, N = 5SE +/- 0.21, N = 3SE +/- 0.86, N = 3SE +/- 0.19, N = 372.2172.3072.3668.301. (CC) gcc options: -O3

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.8371.6742.5113.3484.185SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 33.533.613.723.711. (CXX) g++ options: -O3 -march=native -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121224364860SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 350.8151.4552.9351.661. (CXX) g++ options: -O3 -march=native -flto -pthread

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 128001600240032004000SE +/- 28.46, N = 10SE +/- 27.64, N = 10SE +/- 33.41, N = 5SE +/- 49.56, N = 33710.43646.43196.93538.51. (CXX) g++ options: -O3 -march=native -rdynamic

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12800K1600K2400K3200K4000KSE +/- 58906.79, N = 15SE +/- 11517.61, N = 3SE +/- 36718.95, N = 15SE +/- 47796.61, N = 153658044.773545388.923470419.903624414.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-PassGCC 10.2LLVM Clang 120.09450.1890.28350.3780.4725SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.421. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.27450.5490.82351.0981.3725SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.111.121.221.141. (CXX) g++ options: -O3 -march=native -pthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 121.00522.01043.01564.02085.026SE +/- 0.00340, N = 3SE +/- 0.30276, N = 15SE +/- 0.00451, N = 32.468504.467772.46561-fopenmp=libomp - MIN: 2.36-fopenmp - MIN: 2.86-fopenmp=libomp - MIN: 2.331. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 129001800270036004500SE +/- 16.46, N = 3SE +/- 39.89, N = 3SE +/- 72.38, N = 3SE +/- 25.47, N = 34024.73978.24350.93957.91. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 336.736.436.636.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121020304050SE +/- 0.20, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.09, N = 346.1345.0044.3945.071. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: 1AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121428425670SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 364.3459.9256.5362.27

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12500K1000K1500K2000K2500KSE +/- 27675.97, N = 4SE +/- 35143.82, N = 15SE +/- 23396.73, N = 15SE +/- 30760.12, N = 32351340.562345671.032222217.522212779.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000AMD AOCC 2.3GCC 10.2LLVM Clang 121020304050SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 344.1142.6043.261. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121020304050SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.21, N = 3SE +/- 0.18, N = 340.7541.0343.6241.081. (CXX) g++ options: -O3 -fPIC -lm

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: AllAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 0.40, N = 3SE +/- 0.21, N = 3SE +/- 0.29, N = 3SE +/- 0.05, N = 3213.67191.91210.99196.34

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 129001800270036004500SE +/- 12.18, N = 3SE +/- 417.40, N = 3SE +/- 6.53, N = 3SE +/- 50.52, N = 34097.33608.84251.74000.91. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121224364860SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.03, N = 350.750.551.651.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12700K1400K2100K2800K3500KSE +/- 40118.44, N = 3SE +/- 27502.49, N = 15SE +/- 39730.96, N = 15SE +/- 29853.73, N = 32961165.672948093.683041527.372954866.801. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121020304050SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 344.5344.3325.0944.891. (CC) gcc options: -lm -lpthread -O3 -march=native

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 6.07, N = 3SE +/- 5.99, N = 3SE +/- 2.15, N = 3SE +/- 2.69, N = 31025.41023.51122.61034.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression SpeedAMD AOCC 2.3GCC 10.210002000300040005000SE +/- 29.99, N = 34805.64886.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1210002000300040005000SE +/- 31.63, N = 3SE +/- 33.94, N = 3SE +/- 46.74, N = 3SE +/- 2.17, N = 34586.24543.84737.14456.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1230060090012001500SE +/- 4.71, N = 3SE +/- 2.80, N = 3SE +/- 2.43, N = 3SE +/- 1.19, N = 31166.41186.01425.91191.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123K6K9K12K15KSE +/- 59.50, N = 3SE +/- 106.53, N = 3SE +/- 38.97, N = 3SE +/- 93.71, N = 313595.313144.813771.113305.31. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123K6K9K12K15KSE +/- 57.72, N = 3SE +/- 49.14, N = 3SE +/- 76.55, N = 3SE +/- 66.76, N = 312456.1712124.8112330.5612227.471. (CC) gcc options: -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12800K1600K2400K3200K4000KSE +/- 30792.29, N = 8SE +/- 45854.80, N = 15SE +/- 26197.04, N = 3SE +/- 31635.54, N = 33589202.593766645.923549910.503649832.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 5.04, N = 3SE +/- 5.41, N = 3SE +/- 1.26, N = 3SE +/- 4.01, N = 3807.37789.22966.30795.811. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12714212835SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 328.4328.1928.8128.671. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12714212835SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 327.6227.9130.9827.851. (CXX) g++ options: -O3 -fPIC -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12600K1200K1800K2400K3000KSE +/- 23132.25, N = 3SE +/- 14014.88, N = 3SE +/- 26145.63, N = 15SE +/- 28596.87, N = 32719539.832719036.202640316.172762047.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12714212835SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 328.1528.1728.1328.221. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-PassGCC 10.2LLVM Clang 12714212835SE +/- 0.26, N = 3SE +/- 0.13, N = 329.4330.031. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 322.5422.5424.0922.121. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220M40M60M80M100MSE +/- 78876.13, N = 3SE +/- 601612.28, N = 3SE +/- 828458.69, N = 5SE +/- 171803.51, N = 3750316677873400081844000777943331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12612182430SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 321.8322.0123.5422.071. (CXX) g++ options: -O3 -fPIC -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12714212835SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 327.4926.9627.8328.021. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.11, N = 350.3752.6878.6654.141. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1SAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12510152025SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 321.2221.3719.9021.381. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12300M600M900M1200M1500MSE +/- 1125956.38, N = 3SE +/- 1422439.22, N = 3SE +/- 497772.82, N = 3SE +/- 240370.09, N = 313323333331334900000116496666713352333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12200M400M600M800M1000MSE +/- 3628743.28, N = 3SE +/- 3939684.14, N = 3SE +/- 5768882.04, N = 3SE +/- 3699249.17, N = 310672666671086033333111120000010676666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1260120180240300SE +/- 2.43, N = 3SE +/- 0.45, N = 3SE +/- 1.65, N = 3SE +/- 0.09, N = 3236.47242.03245.04272.991. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 374.6173.6474.1274.77-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1260120180240300SE +/- 0.35, N = 3SE +/- 0.69, N = 3SE +/- 0.56, N = 3SE +/- 0.58, N = 3252.45260.66216.28270.79-fopenmp=libomp - MIN: 250.25 / MAX: 255.53-fopenmp=libomp - MIN: 257.51 / MAX: 262.88-fopenmp - MIN: 215.1 / MAX: 218.26-fopenmp=libomp - MIN: 268.42 / MAX: 272.221. (CXX) g++ options: -O3 -march=native -pthread -fvisibility=hidden -rdynamic -ldl

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.03, N = 5SE +/- 0.01, N = 5SE +/- 0.10, N = 5SE +/- 0.11, N = 510.2910.3410.1510.311. (CXX) g++ options: -O3 -march=native -rdynamic

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeGCC 10.23M6M9M12M15MSE +/- 26371.45, N = 3117312491. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 RealtimeGCC 10.2LLVM Clang 12918273645SE +/- 0.16, N = 3SE +/- 0.31, N = 335.1337.501. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 315.9816.0615.9015.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.25, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 389.5183.5487.3585.67-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 120.92681.85362.78043.70724.634SE +/- 0.01273, N = 3SE +/- 0.00506, N = 3SE +/- 0.01294, N = 34.096633.959794.11930-fopenmp=libomp - MIN: 3.9-fopenmp - MIN: 3.76-fopenmp=libomp - MIN: 3.881. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To OggAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 316.5616.5413.5813.371. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -march=native

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 0.32, N = 3SE +/- 0.43, N = 3SE +/- 0.47, N = 3SE +/- 0.04, N = 3244.15229.03243.69244.37MIN: 180.82 / MAX: 252.96MIN: 171.52 / MAX: 237.17-lm - MIN: 181.29 / MAX: 252.3MIN: 182.08 / MAX: 252.221. (CC) gcc options: -O3 -march=native -pthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1280160240320400SE +/- 1.16, N = 3SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.14, N = 3285.29286.93386.56383.471. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 0.85, N = 3SE +/- 0.48, N = 3SE +/- 0.57, N = 3SE +/- 1.02, N = 3203.64204.60211.57206.36-fopenmp=libomp - MIN: 201.91 / MAX: 206.13-fopenmp=libomp - MIN: 203.72 / MAX: 206.33-fopenmp - MIN: 206.88 / MAX: 212.83-fopenmp=libomp - MIN: 204.24 / MAX: 209.241. (CXX) g++ options: -O3 -march=native -pthread -fvisibility=hidden -rdynamic -ldl

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 314.0414.4714.2014.331. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 313.6413.8413.9913.911. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12246810SE +/- 0.004, N = 3SE +/- 0.055, N = 3SE +/- 0.014, N = 3SE +/- 0.033, N = 36.8596.9176.1376.8231. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.25, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 3SE +/- 0.20, N = 389.3283.6387.0785.81-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 120.14470.28940.43410.57880.7235SE +/- 0.004823, N = 3SE +/- 0.000722, N = 3SE +/- 0.000908, N = 30.6430930.6386640.641231-fopenmp=libomp - MIN: 0.61-fopenmp - MIN: 0.61-fopenmp=libomp - MIN: 0.611. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 2.24, N = 13SE +/- 2.47, N = 12SE +/- 2.40, N = 12SE +/- 2.40, N = 12238.11225.17235.04223.501. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.0090, N = 3SE +/- 0.0148, N = 3SE +/- 0.0057, N = 3SE +/- 0.0075, N = 39.20129.34936.99229.49961. (CXX) g++ options: -O3 -march=native -flto -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 123691215SE +/- 0.01936, N = 3SE +/- 0.01340, N = 3SE +/- 0.01452, N = 39.571949.259679.59442-fopenmp=libomp - MIN: 9.46-fopenmp - MIN: 9.1-fopenmp=libomp - MIN: 9.471. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 10.2LLVM Clang 121.25752.5153.77255.036.2875SE +/- 0.031, N = 5SE +/- 0.037, N = 55.4845.589-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 1.86, N = 8SE +/- 1.75, N = 9SE +/- 1.66, N = 9SE +/- 1.62, N = 12210.72210.35208.93213.57-mstack-alignment=64-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12246810SE +/- 0.027, N = 3SE +/- 0.012, N = 3SE +/- 0.048, N = 3SE +/- 0.055, N = 38.3848.3098.9278.3421. (CXX) g++ options: -O3 -fPIC -lm

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12918273645SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 335.8734.3438.1336.44-Xclang -mrelax-all-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -funwind-tables -O2 -pthread -fPIE -pie -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1220406080100SE +/- 0.28, N = 3SE +/- 0.35, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 389.7488.7089.8092.161. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12246810SE +/- 0.015, N = 3SE +/- 0.010, N = 3SE +/- 0.011, N = 3SE +/- 0.010, N = 36.2886.3646.4146.7891. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121530456075SE +/- 0.13, N = 3SE +/- 0.66, N = 3SE +/- 0.24, N = 3SE +/- 0.13, N = 365.7564.2851.7765.311. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 1248121620SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 317.3617.2917.34-fopenmp=libomp - MIN: 16.83-fopenmp - MIN: 16.58-fopenmp=libomp - MIN: 16.811. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121.26882.53763.80645.07526.344SE +/- 0.009, N = 3SE +/- 0.015, N = 3SE +/- 0.023, N = 3SE +/- 0.015, N = 35.4535.6395.1575.5221. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 RealtimeGCC 10.2LLVM Clang 12306090120150SE +/- 0.75, N = 3SE +/- 1.02, N = 3121.13118.221. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121.17952.3593.53854.7185.8975SE +/- 0.016, N = 3SE +/- 0.020, N = 3SE +/- 0.018, N = 3SE +/- 0.055, N = 34.6094.9375.2424.6741. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 121.09692.19383.29074.38765.4845SE +/- 0.041, N = 3SE +/- 0.015, N = 3SE +/- 0.022, N = 3SE +/- 0.038, N = 34.8324.8374.8754.8071. (CXX) g++ options: -O3 -fPIC -lm

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesGCC 10.21.05172.10343.15514.20685.2585SE +/- 0.015, N = 34.6741. (CXX) g++ options: -fopenmp -O3 -march=native

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 122004006008001000SE +/- 9.00, N = 3SE +/- 1.29, N = 3SE +/- 1.38, N = 3SE +/- 2.97, N = 3976.93959.29971.79979.34MIN: 633.01 / MAX: 1069.88MIN: 714.89 / MAX: 1039.77-lm - MIN: 732.02 / MAX: 1055.82MIN: 717.55 / MAX: 1062.341. (CC) gcc options: -O3 -march=native -pthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.91181.82362.73543.64724.559SE +/- 0.0273, N = 3SE +/- 0.0017, N = 3SE +/- 0.0178, N = 3SE +/- 0.0018, N = 33.28993.40404.05243.50761. (CXX) g++ options: -O3 -march=native -flto -pthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAMD AOCC 3.0GCC 10.2LLVM Clang 120.82011.64022.46033.28044.1005SE +/- 0.00604, N = 3SE +/- 0.00753, N = 3SE +/- 0.01444, N = 33.583643.554673.64485-fopenmp=libomp - MIN: 3.44-fopenmp - MIN: 3.46-fopenmp=libomp - MIN: 3.51. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 1250100150200250SE +/- 0.90, N = 3SE +/- 0.39, N = 3SE +/- 0.68, N = 3SE +/- 0.82, N = 3230.19221.59228.96219.121. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.66421.32841.99262.65683.321SE +/- 0.035, N = 3SE +/- 0.006, N = 3SE +/- 0.014, N = 3SE +/- 0.016, N = 32.9332.9412.9342.9521. (CXX) g++ options: -O3 -fPIC -lm

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.51171.02341.53512.04682.5585SE +/- 0.024, N = 3SE +/- 0.025, N = 3SE +/- 0.005, N = 3SE +/- 0.011, N = 32.1442.1652.2742.1341. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 128001600240032004000SE +/- 5.75, N = 3SE +/- 7.06, N = 3SE +/- 2.21, N = 3SE +/- 26.84, N = 32986.713583.011546.303669.191. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100AMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.37640.75281.12921.50561.882SE +/- 0.006, N = 3SE +/- 0.011, N = 3SE +/- 0.018, N = 4SE +/- 0.003, N = 31.6731.6691.6521.6381. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 12500K1000K1500K2000K2500KSE +/- 4348.49, N = 5SE +/- 3546.09, N = 5SE +/- 7442.75, N = 5SE +/- 4267.44, N = 523142252283512196577321481541. (CC) gcc options: -O3 -march=native

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultAMD AOCC 2.3AMD AOCC 3.0GCC 10.2LLVM Clang 120.23450.4690.70350.9381.1725SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.008, N = 3SE +/- 0.014, N = 30.9791.0071.0420.9771. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16 -ltiff


Phoronix Test Suite v10.8.5