AMD AOCC 4.0 Benchmarks

AMD Ryzen 9 7950X compiler benchmarks by Michael Larabel for a future article.

AOCC 4.0

Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR X670E HERO (0703 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: 1000GB Sabrent Rocket 4.0 Plus, Graphics: AMD Radeon RX 6800 16GB (2475/1000MHz), Audio: AMD Navi 21/23, Monitor: ASUS MG28U, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 14.0.6, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver4
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 14

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 14.0.6-2, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 12.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 13.0 14 Nov

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 13.0.0 20221114 + clang (GCC) 13.0.0 20221114 (experimental), File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: --disable-multilib
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 15.0.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 15.0.2-1, File-System: ext4, Screen Resolution: 3840x2160

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

QuadRay

VectorChief's QuadRay is a real-time ray-tracing engine written to support SIMD across ARM, MIPS, PPC, and x86/x86_64 processors. QuadRay supports SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 usage on Intel/AMD CPUs. Learn more via the OpenBenchmarking.org test page.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

Renderer: Software CPU - AF: On - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google as the AV1 Codec Library. Learn more via the OpenBenchmarking.org test page.

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

Throughput Test: Kostya

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: TopTweet

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: LargeRandom

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: PartialTweets

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: DistinctUserID

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Model: GPT-2 - Device: CPU - Executor: Parallel