AMD EPYC 7773X Compilers

AMD EPYC 7773X GCC / Clang / AOCC compiler benchmarking by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2204117-NE-EPYC7773X86&sor.

AMD EPYC 7773X CompilersProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionGCC 11.2Clang 14.0AMD AOCC 3.22 x AMD EPYC 7773X 64-Core @ 2.20GHz (128 Cores / 256 Threads)AMD DAYTONA_X (TYM1008C BIOS)AMD Starship/Matisse16 x 32 GB DDR4-3200MT/s 36ASF4G72PZ-3G2E2800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Mellanox MT27710Ubuntu 22.045.17.0-051700rc8-generic (x86_64)GNOME Shell 42.0X Server1.2.204GCC 11.2.0ext41920x1080Clang 14.0.0-1ubuntu1Clang 13.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- GCC 11.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - AMD AOCC 3.2: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver3 Disk Details- GCC 11.2: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001228 Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 7773X Compilersquantlib: etcpak: DXT1etcpak: ETC2lczero: BLASlczero: Eigenlammps: 20k Atomslammps: Rhodopsin Proteinwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionxmrig: Monero - 1Mxmrig: Wownero - 1Mcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedjpegxl: PNG - 8jpegxl: JPEG - 7jpegxl-decode: 1jpegxl-decode: Alltscp: AI Chess Performancegraphics-magick: Rotategraphics-magick: Resizingkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080px265: Bosphorus 4Kcoremark: CoreMark Size 666 - Iterations Per Secondavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessprimesieve: 1e12 Prime Number Generationaobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3openjpeg: NASA Curiosity Panorama M34openssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57tjbench: Decompression Throughputastcenc: Thoroughastcenc: Exhaustivetoktx: UASTC 3toktx: Zstd Compression 9toktx: Zstd Compression 19toktx: UASTC 3 + Zstd Compression 19toktx: UASTC 4 + Zstd Compression 19draco: LionGCC 11.2Clang 14.0AMD AOCC 3.22125.3844.045134.0434159418735.97228.4681.6952.90624.5618.79850.64241100.542309.798.82269.90.7271.1346.70564.28109414153716431.5243.644.082107.652140.461288.97376.34330.0019.374447513.21180188.75748.7755.0208.3182.61349.74421.5168.98836248315640448103026996.81770209.958256000006127133333163.4869906.42225.87924.6043.83921.4419.01535.58359522240.31926.467166.3924224510736.01029.9471.7012.87123.8688.58545.08940338.240922.799.02086.00.6778.0253.78605.1812316674509333.2145.284.005104.559129.325303.09437.24362.4221.313705497.01560987.25147.5124.9117.9762.61150.30321.8349.98634904017035010915026924.91774479.058679333336229033333150.7200215.95805.77374.7383.88622.9899.38935.53161762251.42012.565158.3474551557036.31630.3531.6602.75922.7598.02645.90138790.736998.098.72311.10.7982.7557.14599.49117243861626832.9245.364.483110.839132.782308.11472.24367.8125.034242648.79819484.81546.2714.6357.8222.56447.94121.8049.78235284817638514841326972.41768233.259490000007010600000160.2910465.73785.56864.6843.83221.8139.07135.3865859OpenBenchmarking.org

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AMD AOCC 3.2Clang 14.0GCC 11.25001000150020002500SE +/- 14.43, N = 3SE +/- 5.25, N = 3SE +/- 9.30, N = 32251.42240.32125.31. (CXX) g++ options: -O3 -march=native -rdynamic

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1AMD AOCC 3.2Clang 14.0GCC 11.2400800120016002000SE +/- 23.71, N = 15SE +/- 29.34, N = 15SE +/- 2.54, N = 32012.571926.47844.051. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Clang 14.0AMD AOCC 3.2GCC 11.24080120160200SE +/- 0.04, N = 3SE +/- 1.75, N = 4SE +/- 1.46, N = 3166.39158.35134.041. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAMD AOCC 3.2Clang 14.0GCC 11.210002000300040005000SE +/- 44.95, N = 9SE +/- 23.81, N = 3SE +/- 38.97, N = 34551422441591. (CXX) g++ options: -flto -O3 -march=native -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAMD AOCC 3.2Clang 14.0GCC 11.212002400360048006000SE +/- 67.86, N = 9SE +/- 57.43, N = 9SE +/- 48.22, N = 35570510741871. (CXX) g++ options: -flto -O3 -march=native -pthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsAMD AOCC 3.2Clang 14.0GCC 11.2816243240SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 336.3236.0135.971. (CXX) g++ options: -O3 -march=native -flto -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinAMD AOCC 3.2Clang 14.0GCC 11.2714212835SE +/- 0.44, N = 15SE +/- 0.30, N = 15SE +/- 0.40, N = 1530.3529.9528.471. (CXX) g++ options: -O3 -march=native -flto -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultAMD AOCC 3.2GCC 11.2Clang 14.00.38270.76541.14811.53081.9135SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.015, N = 151.6601.6951.7011. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100AMD AOCC 3.2Clang 14.0GCC 11.20.65391.30781.96172.61563.2695SE +/- 0.004, N = 3SE +/- 0.029, N = 15SE +/- 0.029, N = 32.7592.8712.9061. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessAMD AOCC 3.2Clang 14.0GCC 11.2612182430SE +/- 0.01, N = 3SE +/- 0.22, N = 15SE +/- 0.22, N = 322.7623.8724.561. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionAMD AOCC 3.2Clang 14.0GCC 11.2246810SE +/- 0.005, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 38.0268.5858.7981. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionClang 14.0AMD AOCC 3.2GCC 11.21122334455SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.51, N = 545.0945.9050.641. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MGCC 11.2Clang 14.0AMD AOCC 3.29K18K27K36K45KSE +/- 47.93, N = 3SE +/- 142.04, N = 3SE +/- 276.55, N = 341100.540338.238790.7-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -flto -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MGCC 11.2Clang 14.0AMD AOCC 3.29K18K27K36K45KSE +/- 63.43, N = 3SE +/- 526.98, N = 3SE +/- 352.95, N = 342309.740922.736998.0-static-libgcc -static-libstdc++-funroll-loops-funroll-loops1. (CXX) g++ options: -O3 -march=native -flto -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedClang 14.0GCC 11.2AMD AOCC 3.220406080100SE +/- 1.18, N = 4SE +/- 1.34, N = 3SE +/- 1.21, N = 399.098.898.71. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedAMD AOCC 3.2GCC 11.2Clang 14.05001000150020002500SE +/- 6.09, N = 3SE +/- 4.90, N = 3SE +/- 64.15, N = 42311.12269.92086.01. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma

JPEG XL libjxl

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: PNG - Encode Speed: 8AMD AOCC 3.2GCC 11.2Clang 14.00.17780.35560.53340.71120.889SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.790.720.67-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie

JPEG XL libjxl

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.6.1Input: JPEG - Encode Speed: 7AMD AOCC 3.2Clang 14.0GCC 11.220406080100SE +/- 0.90, N = 4SE +/- 1.11, N = 3SE +/- 1.50, N = 1582.7578.0271.13-Xclang -mrelax-all-Xclang -mrelax-all1. (CXX) g++ options: -O3 -march=native -flto -funwind-tables -O2 -fPIE -pie

JPEG XL Decoding libjxl

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding libjxl 0.6.1CPU Threads: 1AMD AOCC 3.2Clang 14.0GCC 11.21326395265SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 357.1453.7846.70

JPEG XL Decoding libjxl

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding libjxl 0.6.1CPU Threads: AllClang 14.0AMD AOCC 3.2GCC 11.2130260390520650SE +/- 1.33, N = 3SE +/- 6.36, N = 3SE +/- 7.30, N = 3605.18599.49564.28

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 14.0AMD AOCC 3.2GCC 11.2300K600K900K1200K1500KSE +/- 4052.15, N = 5SE +/- 4599.34, N = 5SE +/- 2612.05, N = 51231667117243810941411. (CC) gcc options: -O3 -march=native -flto

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAMD AOCC 3.2GCC 11.2Clang 14.0130260390520650SE +/- 3.76, N = 3SE +/- 7.17, N = 3SE +/- 8.14, N = 156165374501. (CC) gcc options: -fopenmp -O3 -march=native -flto -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAMD AOCC 3.2GCC 11.2Clang 14.060120180240300SE +/- 3.79, N = 3SE +/- 10.44, N = 15SE +/- 1.33, N = 3268164931. (CC) gcc options: -fopenmp -O3 -march=native -flto -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: MediumClang 14.0AMD AOCC 3.2GCC 11.2816243240SE +/- 0.03, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 333.2132.9231.52-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastAMD AOCC 3.2Clang 14.0GCC 11.21020304050SE +/- 0.64, N = 15SE +/- 0.50, N = 15SE +/- 0.56, N = 345.3645.2843.64-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lm -lrt

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 4 - Input: Bosphorus 4KAMD AOCC 3.2GCC 11.2Clang 14.01.00872.01743.02614.03485.0435SE +/- 0.052, N = 3SE +/- 0.036, N = 3SE +/- 0.015, N = 34.4834.0824.0051. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 10 - Input: Bosphorus 4KAMD AOCC 3.2GCC 11.2Clang 14.020406080100SE +/- 0.22, N = 3SE +/- 0.35, N = 3SE +/- 0.85, N = 3110.84107.65104.561. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 11.2AMD AOCC 3.2Clang 14.0306090120150SE +/- 1.52, N = 5SE +/- 1.72, N = 3SE +/- 0.12, N = 3140.46132.78129.331. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAMD AOCC 3.2Clang 14.0GCC 11.270140210280350SE +/- 2.88, N = 3SE +/- 1.03, N = 3SE +/- 2.37, N = 3308.11303.09288.971. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAMD AOCC 3.2Clang 14.0GCC 11.2100200300400500SE +/- 3.33, N = 3SE +/- 5.37, N = 3SE +/- 3.15, N = 13472.24437.24376.341. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAMD AOCC 3.2Clang 14.0GCC 11.280160240320400SE +/- 0.54, N = 3SE +/- 3.50, N = 3SE +/- 1.46, N = 3367.81362.42330.001. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAMD AOCC 3.2Clang 14.0GCC 11.2612182430SE +/- 0.18, N = 3SE +/- 0.21, N = 15SE +/- 0.21, N = 1525.0321.3119.371. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 11.2AMD AOCC 3.2Clang 14.01000K2000K3000K4000K5000KSE +/- 9967.53, N = 3SE +/- 14710.43, N = 3SE +/- 34949.41, N = 34447513.214242648.803705497.021. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0AMD AOCC 3.2Clang 14.0GCC 11.220406080100SE +/- 0.48, N = 3SE +/- 0.37, N = 3SE +/- 0.43, N = 384.8287.2588.761. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2AMD AOCC 3.2Clang 14.0GCC 11.21122334455SE +/- 0.17, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 346.2747.5148.781. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6AMD AOCC 3.2Clang 14.0GCC 11.21.12952.2593.38854.5185.6475SE +/- 0.047, N = 3SE +/- 0.057, N = 4SE +/- 0.046, N = 74.6354.9115.0201. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, LosslessAMD AOCC 3.2Clang 14.0GCC 11.2246810SE +/- 0.109, N = 3SE +/- 0.038, N = 3SE +/- 0.057, N = 37.8227.9768.3181. (CXX) g++ options: -O3 -fPIC -march=native -flto -lm

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.71e12 Prime Number GenerationAMD AOCC 3.2Clang 14.0GCC 11.20.58791.17581.76372.35162.9395SE +/- 0.028, N = 5SE +/- 0.030, N = 15SE +/- 0.029, N = 52.5642.6112.6131. (CXX) g++ options: -O3 -march=native -flto

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeAMD AOCC 3.2GCC 11.2Clang 14.01122334455SE +/- 0.41, N = 3SE +/- 0.13, N = 3SE +/- 0.25, N = 347.9449.7450.301. (CC) gcc options: -lm -O3 -march=native -flto

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.3WAV To FLACGCC 11.2AMD AOCC 3.2Clang 14.0510152025SE +/- 0.02, N = 5SE +/- 0.04, N = 5SE +/- 0.07, N = 521.5221.8021.83-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -flto -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 11.2AMD AOCC 3.2Clang 14.03691215SE +/- 0.024, N = 3SE +/- 0.020, N = 3SE +/- 0.017, N = 38.9889.7829.986-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -flto -lncurses -lm

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34Clang 14.0AMD AOCC 3.2GCC 11.280K160K240K320K400KSE +/- 935.91, N = 3SE +/- 3533.23, N = 15SE +/- 3417.00, N = 33490403528483624831. (CXX) g++ options: -O3 -march=native -flto -rdynamic

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256AMD AOCC 3.2Clang 14.0GCC 11.240000M80000M120000M160000M200000MSE +/- 144731144.77, N = 3SE +/- 326147908.01, N = 3SE +/- 330832787.20, N = 3176385148413170350109150156404481030-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096GCC 11.2AMD AOCC 3.2Clang 14.06K12K18K24K30KSE +/- 8.41, N = 3SE +/- 9.62, N = 3SE +/- 26.21, N = 326996.826972.426924.9-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096Clang 14.0GCC 11.2AMD AOCC 3.2400K800K1200K1600K2000KSE +/- 686.16, N = 3SE +/- 368.08, N = 3SE +/- 526.74, N = 31774479.01770209.91768233.2-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.2Clang 14.0GCC 11.21300M2600M3900M5200M6500MSE +/- 3257811.13, N = 3SE +/- 2434018.17, N = 3SE +/- 7813023.32, N = 35949000000586793333358256000001. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.2Clang 14.0GCC 11.21500M3000M4500M6000M7500MSE +/- 1311487.70, N = 3SE +/- 1414606.34, N = 3SE +/- 1604507.54, N = 37010600000622903333361271333331. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputGCC 11.2AMD AOCC 3.2Clang 14.04080120160200SE +/- 0.17, N = 3SE +/- 0.62, N = 3SE +/- 0.18, N = 3163.49160.29150.721. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ThoroughAMD AOCC 3.2Clang 14.0GCC 11.2246810SE +/- 0.0468, N = 15SE +/- 0.0253, N = 3SE +/- 0.0328, N = 35.73785.95806.42221. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ExhaustiveAMD AOCC 3.2Clang 14.0GCC 11.21.32282.64563.96845.29126.614SE +/- 0.0070, N = 3SE +/- 0.0041, N = 3SE +/- 0.0090, N = 35.56865.77375.87921. (CXX) g++ options: -O3 -march=native -flto -pthread

KTX-Software toktx

Settings: UASTC 3

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3GCC 11.2AMD AOCC 3.2Clang 14.01.06612.13223.19834.26445.3305SE +/- 0.054, N = 3SE +/- 0.059, N = 3SE +/- 0.045, N = 154.6044.6844.738

KTX-Software toktx

Settings: Zstd Compression 9

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 9AMD AOCC 3.2GCC 11.2Clang 14.00.87441.74882.62323.49764.372SE +/- 0.021, N = 3SE +/- 0.052, N = 3SE +/- 0.036, N = 33.8323.8393.886

KTX-Software toktx

Settings: Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 19GCC 11.2AMD AOCC 3.2Clang 14.0612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 321.4421.8122.99

KTX-Software toktx

Settings: UASTC 3 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3 + Zstd Compression 19GCC 11.2AMD AOCC 3.2Clang 14.03691215SE +/- 0.100, N = 3SE +/- 0.024, N = 3SE +/- 0.066, N = 159.0159.0719.389

KTX-Software toktx

Settings: UASTC 4 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 4 + Zstd Compression 19AMD AOCC 3.2Clang 14.0GCC 11.2816243240SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.46, N = 335.3935.5335.58

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.0Model: LionAMD AOCC 3.2GCC 11.2Clang 14.013002600390052006500SE +/- 8.33, N = 3SE +/- 7.94, N = 3SE +/- 17.33, N = 35859595261761. (CXX) g++ options: -O3 -march=native -flto


Phoronix Test Suite v10.8.4