AMD EPYC 7773X Compilers

AMD EPYC 7773X GCC / Clang / AOCC compiler benchmarking by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2204117-NE-EPYC7773X86.

AMD EPYC 7773X CompilersProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionGCC 11.2Clang 14.0AMD AOCC 3.22 x AMD EPYC 7773X 64-Core @ 2.20GHz (128 Cores / 256 Threads)AMD DAYTONA_X (TYM1008C BIOS)AMD Starship/Matisse16 x 32 GB DDR4-3200MT/s 36ASF4G72PZ-3G2E2800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.045.17.0-051700rc8-generic (x86_64)GNOME Shell 42.0X Server1.2.204GCC 11.2.0ext41920x1080Clang 14.0.0-1ubuntu1Clang 13.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 11.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - AMD AOCC 3.2: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver3 Disk Details- GCC 11.2: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001228 Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 7773X Compilersquantlib: etcpak: DXT1etcpak: ETC2lczero: BLASlczero: Eigenlammps: 20k Atomslammps: Rhodopsin Proteinwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionxmrig: Monero - 1Mxmrig: Wownero - 1Mcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedjpegxl: PNG - 8jpegxl: JPEG - 7jpegxl-decode: 1jpegxl-decode: Alltscp: AI Chess Performancegraphics-magick: Rotategraphics-magick: Resizingkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080px265: Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessprimesieve: 1e12 Prime Number Generationaobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3openjpeg: NASA Curiosity Panorama M34openssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57tjbench: Decompression Throughputastcenc: Thoroughastcenc: Exhaustivetoktx: UASTC 3toktx: Zstd Compression 9toktx: Zstd Compression 19toktx: UASTC 3 + Zstd Compression 19toktx: UASTC 4 + Zstd Compression 19draco: LionGCC 11.2Clang 14.0AMD AOCC 3.22125.3844.045134.0434159418735.97228.4681.6952.90624.5618.79850.64241100.542309.798.82269.90.7271.1346.70564.28109414153716431.5243.644.082107.652140.461288.97376.34330.0019.374447513.21180188.75748.7755.0208.3182.61349.74421.5168.98836248315640448103026996.81770209.958256000006127133333163.4869906.42225.87924.6043.83921.4419.01535.58359522240.31926.467166.3924224510736.01029.9471.7012.87123.8688.58545.08940338.240922.799.02086.00.6778.0253.78605.1812316674509333.2145.284.005104.559129.325303.09437.24362.4221.313705497.01560987.25147.5124.9117.9762.61150.30321.8349.98634904017035010915026924.91774479.058679333336229033333150.7200215.95805.77374.7383.88622.9899.38935.53161762251.42012.565158.3474551557036.31630.3531.6602.75922.7598.02645.90138790.736998.098.72311.10.7982.7557.14599.49117243861626832.9245.364.483110.839132.782308.11472.24367.8125.034242648.79819484.81546.2714.6357.8222.56447.94121.8049.78235284817638514841326972.41768233.259490000007010600000160.2910465.73785.56864.6843.83221.8139.07135.3865859OpenBenchmarking.org

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21GCC 11.2Clang 14.0AMD AOCC 3.25001000150020002500SE +/- 9.30, N = 3SE +/- 5.25, N = 3SE +/- 14.43, N = 32125.32240.32251.41. (CXX) g++ options: -O3 -march=native -rdynamic

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1GCC 11.2Clang 14.0AMD AOCC 3.2400800120016002000SE +/- 2.54, N = 3SE +/- 29.34, N = 15SE +/- 23.71, N = 15844.051926.472012.571. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2GCC 11.2Clang 14.0AMD AOCC 3.24080120160200SE +/- 1.46, N = 3SE +/- 0.04, N = 3SE +/- 1.75, N = 4134.04166.39158.351. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASGCC 11.2Clang 14.0AMD AOCC 3.210002000300040005000SE +/- 38.97, N = 3SE +/- 23.81, N = 3SE +/- 44.95, N = 94159422445511. (CXX) g++ options: -flto -O3 -march=native -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenGCC 11.2Clang 14.0AMD AOCC 3.212002400360048006000SE +/- 48.22, N = 3SE +/- 57.43, N = 9SE +/- 67.86, N = 94187510755701. (CXX) g++ options: -flto -O3 -march=native -pthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsGCC 11.2Clang 14.0AMD AOCC 3.2816243240SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.19, N = 335.9736.0136.321. (CXX) g++ options: -O3 -march=native -flto -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinGCC 11.2Clang 14.0AMD AOCC 3.2714212835SE +/- 0.40, N = 15SE +/- 0.30, N = 15SE +/- 0.44, N = 1528.4729.9530.351. (CXX) g++ options: -O3 -march=native -flto -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultGCC 11.2Clang 14.0AMD AOCC 3.20.38270.76541.14811.53081.9135SE +/- 0.001, N = 3SE +/- 0.015, N = 15SE +/- 0.003, N = 31.6951.7011.6601. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100GCC 11.2Clang 14.0AMD AOCC 3.20.65391.30781.96172.61563.2695SE +/- 0.029, N = 3SE +/- 0.029, N = 15SE +/- 0.004, N = 32.9062.8712.7591. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 11.2Clang 14.0AMD AOCC 3.2612182430SE +/- 0.22, N = 3SE +/- 0.22, N = 15SE +/- 0.01, N = 324.5623.8722.761. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionGCC 11.2Clang 14.0AMD AOCC 3.2246810SE +/- 0.004, N = 3SE +/- 0.013, N = 3SE +/- 0.005, N = 38.7988.5858.0261. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionGCC 11.2Clang 14.0AMD AOCC 3.21122334455SE +/- 0.51, N = 5SE +/- 0.11, N = 3SE +/- 0.02, N = 350.6445.0945.901. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MGCC 11.2Clang 14.0AMD AOCC 3.29K18K27K36K45KSE +/- 47.93, N = 3SE +/- 142.04, N = 3SE +/- 276.55, N = 341100.540338.238790.7-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -flto -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MGCC 11.2Clang 14.0AMD AOCC 3.29K18K27K36K45KSE +/- 63.43, N = 3SE +/- 526.98, N = 3SE +/- 352.95, N = 342309.740922.736998.0-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -flto -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedGCC 11.2Clang 14.0AMD AOCC 3.220406080100SE +/- 1.34, N = 3SE +/- 1.18, N = 4SE +/- 1.21, N = 398.899.098.71. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedGCC 11.2Clang 14.0AMD AOCC 3.25001000150020002500SE +/- 4.90, N = 3SE +/- 64.15, N = 4SE +/- 6.09, N = 32269.92086.02311.11. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma

JPEG XL libjxl

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 8GCC 11.2Clang 14.0AMD AOCC 3.20.17780.35560.53340.71120.889SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.720.670.79-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 7GCC 11.2Clang 14.0AMD AOCC 3.220406080100SE +/- 1.50, N = 15SE +/- 1.11, N = 3SE +/- 0.90, N = 471.1378.0282.75-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie

JPEG XL Decoding libjxl

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding libjxl 0.6.1CPU Threads: 1GCC 11.2Clang 14.0AMD AOCC 3.21326395265SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 346.7053.7857.14

JPEG XL Decoding libjxl

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding libjxl 0.6.1CPU Threads: AllGCC 11.2Clang 14.0AMD AOCC 3.2130260390520650SE +/- 7.30, N = 3SE +/- 1.33, N = 3SE +/- 6.36, N = 3564.28605.18599.49

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 11.2Clang 14.0AMD AOCC 3.2300K600K900K1200K1500KSE +/- 2612.05, N = 5SE +/- 4052.15, N = 5SE +/- 4599.34, N = 51094141123166711724381. (CC) gcc options: -O3 -march=native -flto

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 11.2Clang 14.0AMD AOCC 3.2130260390520650SE +/- 7.17, N = 3SE +/- 8.14, N = 15SE +/- 3.76, N = 35374506161. (CC) gcc options: -fopenmp -O3 -march=native -flto -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 11.2Clang 14.0AMD AOCC 3.260120180240300SE +/- 10.44, N = 15SE +/- 1.33, N = 3SE +/- 3.79, N = 3164932681. (CC) gcc options: -fopenmp -O3 -march=native -flto -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: MediumGCC 11.2Clang 14.0AMD AOCC 3.2816243240SE +/- 0.17, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 331.5233.2132.92-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastGCC 11.2Clang 14.0AMD AOCC 3.21020304050SE +/- 0.56, N = 3SE +/- 0.50, N = 15SE +/- 0.64, N = 1543.6445.2845.36-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 4 - Input: Bosphorus 4KGCC 11.2Clang 14.0AMD AOCC 3.21.00872.01743.02614.03485.0435SE +/- 0.036, N = 3SE +/- 0.015, N = 3SE +/- 0.052, N = 34.0824.0054.4831. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 10 - Input: Bosphorus 4KGCC 11.2Clang 14.0AMD AOCC 3.220406080100SE +/- 0.35, N = 3SE +/- 0.85, N = 3SE +/- 0.22, N = 3107.65104.56110.841. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 11.2Clang 14.0AMD AOCC 3.2306090120150SE +/- 1.52, N = 5SE +/- 0.12, N = 3SE +/- 1.72, N = 3140.46129.33132.781. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pGCC 11.2Clang 14.0AMD AOCC 3.270140210280350SE +/- 2.37, N = 3SE +/- 1.03, N = 3SE +/- 2.88, N = 3288.97303.09308.111. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pGCC 11.2Clang 14.0AMD AOCC 3.2100200300400500SE +/- 3.15, N = 13SE +/- 5.37, N = 3SE +/- 3.33, N = 3376.34437.24472.241. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 11.2Clang 14.0AMD AOCC 3.280160240320400SE +/- 1.46, N = 3SE +/- 3.50, N = 3SE +/- 0.54, N = 3330.00362.42367.811. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 11.2Clang 14.0AMD AOCC 3.2612182430SE +/- 0.21, N = 15SE +/- 0.21, N = 15SE +/- 0.18, N = 319.3721.3125.031. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 11.2Clang 14.0AMD AOCC 3.21000K2000K3000K4000K5000KSE +/- 9967.53, N = 3SE +/- 34949.41, N = 3SE +/- 14710.43, N = 34447513.213705497.024242648.801. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0GCC 11.2Clang 14.0AMD AOCC 3.220406080100SE +/- 0.43, N = 3SE +/- 0.37, N = 3SE +/- 0.48, N = 388.7687.2584.821. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2GCC 11.2Clang 14.0AMD AOCC 3.21122334455SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 348.7847.5146.271. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6GCC 11.2Clang 14.0AMD AOCC 3.21.12952.2593.38854.5185.6475SE +/- 0.046, N = 7SE +/- 0.057, N = 4SE +/- 0.047, N = 35.0204.9114.6351. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, LosslessGCC 11.2Clang 14.0AMD AOCC 3.2246810SE +/- 0.057, N = 3SE +/- 0.038, N = 3SE +/- 0.109, N = 38.3187.9767.8221. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.71e12 Prime Number GenerationGCC 11.2Clang 14.0AMD AOCC 3.20.58791.17581.76372.35162.9395SE +/- 0.029, N = 5SE +/- 0.030, N = 15SE +/- 0.028, N = 52.6132.6112.5641. (CXX) g++ options: -O3 -march=native -flto

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 11.2Clang 14.0AMD AOCC 3.21122334455SE +/- 0.13, N = 3SE +/- 0.25, N = 3SE +/- 0.41, N = 349.7450.3047.941. (CC) gcc options: -lm -O3 -march=native -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLACGCC 11.2Clang 14.0AMD AOCC 3.2510152025SE +/- 0.02, N = 5SE +/- 0.07, N = 5SE +/- 0.04, N = 521.5221.8321.80-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -flto -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 11.2Clang 14.0AMD AOCC 3.23691215SE +/- 0.024, N = 3SE +/- 0.017, N = 3SE +/- 0.020, N = 38.9889.9869.782-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -flto -lncurses -lm

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34GCC 11.2Clang 14.0AMD AOCC 3.280K160K240K320K400KSE +/- 3417.00, N = 3SE +/- 935.91, N = 3SE +/- 3533.23, N = 153624833490403528481. (CXX) g++ options: -O3 -march=native -flto -rdynamic

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256GCC 11.2Clang 14.0AMD AOCC 3.240000M80000M120000M160000M200000MSE +/- 330832787.20, N = 3SE +/- 326147908.01, N = 3SE +/- 144731144.77, N = 3156404481030170350109150176385148413-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096GCC 11.2Clang 14.0AMD AOCC 3.26K12K18K24K30KSE +/- 8.41, N = 3SE +/- 26.21, N = 3SE +/- 9.62, N = 326996.826924.926972.4-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096GCC 11.2Clang 14.0AMD AOCC 3.2400K800K1200K1600K2000KSE +/- 368.08, N = 3SE +/- 686.16, N = 3SE +/- 526.74, N = 31770209.91774479.01768233.2-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57GCC 11.2Clang 14.0AMD AOCC 3.21300M2600M3900M5200M6500MSE +/- 7813023.32, N = 3SE +/- 2434018.17, N = 3SE +/- 3257811.13, N = 35825600000586793333359490000001. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57GCC 11.2Clang 14.0AMD AOCC 3.21500M3000M4500M6000M7500MSE +/- 1604507.54, N = 3SE +/- 1414606.34, N = 3SE +/- 1311487.70, N = 36127133333622903333370106000001. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 11.2Clang 14.0AMD AOCC 3.24080120160200SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.62, N = 3163.49150.72160.291. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ThoroughGCC 11.2Clang 14.0AMD AOCC 3.2246810SE +/- 0.0328, N = 3SE +/- 0.0253, N = 3SE +/- 0.0468, N = 156.42225.95805.73781. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ExhaustiveGCC 11.2Clang 14.0AMD AOCC 3.21.32282.64563.96845.29126.614SE +/- 0.0090, N = 3SE +/- 0.0041, N = 3SE +/- 0.0070, N = 35.87925.77375.56861. (CXX) g++ options: -O3 -march=native -flto -pthread

KTX-Software toktx

Settings: UASTC 3

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3GCC 11.2Clang 14.0AMD AOCC 3.21.06612.13223.19834.26445.3305SE +/- 0.054, N = 3SE +/- 0.045, N = 15SE +/- 0.059, N = 34.6044.7384.684

KTX-Software toktx

Settings: Zstd Compression 9

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 9GCC 11.2Clang 14.0AMD AOCC 3.20.87441.74882.62323.49764.372SE +/- 0.052, N = 3SE +/- 0.036, N = 3SE +/- 0.021, N = 33.8393.8863.832

KTX-Software toktx

Settings: Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 19GCC 11.2Clang 14.0AMD AOCC 3.2612182430SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 321.4422.9921.81

KTX-Software toktx

Settings: UASTC 3 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3 + Zstd Compression 19GCC 11.2Clang 14.0AMD AOCC 3.23691215SE +/- 0.100, N = 3SE +/- 0.066, N = 15SE +/- 0.024, N = 39.0159.3899.071

KTX-Software toktx

Settings: UASTC 4 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 4 + Zstd Compression 19GCC 11.2Clang 14.0AMD AOCC 3.2816243240SE +/- 0.46, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 335.5835.5335.39

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: LionGCC 11.2Clang 14.0AMD AOCC 3.213002600390052006500SE +/- 7.94, N = 3SE +/- 17.33, N = 3SE +/- 8.33, N = 35952617658591. (CXX) g++ options: -O3 -march=native -flto


Phoronix Test Suite v10.8.4