AMD EPYC Genoa-X Linux 6.9 Kernel Benchmarks

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2403282-NE-AMDEPYCGE98&hlc=1&sgm=1&ppt=D&sro.

AMD EPYC Genoa-X Linux 6.9 Kernel BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionLinux 6.8Linux 6.9 27 Mar2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1007B BIOS)AMD Device 14a41520GB3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash DriveASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 23.106.8.0-060800-generic (x86_64)GCC 13.2.0ext41920x12006.9.0-060900rc1daily20240327-generic (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113ePython Details- Python 3.11.6Security Details- Linux 6.8: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - Linux 6.9 27 Mar: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa-X Linux 6.9 Kernel Benchmarkssvt-av1: Preset 12 - Bosphorus 4Khackbench: 32 - Processstress-ng: NUMAstress-ng: Semaphoresstress-ng: Vector Mathstress-ng: AVX-512 VNNIstress-ng: Context Switchingstress-ng: System V Message Passingopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUrocksdb: Overwriterocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randspeedb: Update Randllamafile: llava-v1.5-7b-q4 - CPUllamafile: mistral-7b-instruct-v0.2.Q8_0 - CPULinux 6.8Linux 6.9 27 Mar159.51310.836646.34264112681.091306706.1918968297.9440075791.1325534145.8711218.124.263864.6512.401086.4844.13381523104729086337571420665324318257229374612.0312.79164.30810.425691.54292541552.771349057.5119594674.6741854298.6828384870.1511719.994.084063.1111.801119.5642.82416860109608195140625822418140348487731379121.1013.46OpenBenchmarking.org

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 4KLinux 6.8Linux 6.9 27 Mar4080120160200SE +/- 1.75, N = 3SE +/- 0.71, N = 3159.51164.311. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Hackbench

Count: 32 - Type: Process

OpenBenchmarking.orgSeconds, Fewer Is BetterHackbenchCount: 32 - Type: ProcessLinux 6.8Linux 6.9 27 Mar3691215SE +/- 0.13, N = 4SE +/- 0.14, N = 310.8410.431. (CC) gcc options: -lpthread

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: NUMALinux 6.8Linux 6.9 27 Mar150300450600750SE +/- 2.59, N = 3SE +/- 0.95, N = 3646.34691.541. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: SemaphoresLinux 6.8Linux 6.9 27 Mar60M120M180M240M300MSE +/- 1107000.14, N = 3SE +/- 1842285.12, N = 3264112681.09292541552.771. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector MathLinux 6.8Linux 6.9 27 Mar300K600K900K1200K1500KSE +/- 1312.02, N = 3SE +/- 1326.11, N = 31306706.191349057.511. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: AVX-512 VNNI

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNILinux 6.8Linux 6.9 27 Mar4M8M12M16M20MSE +/- 19220.66, N = 3SE +/- 25357.28, N = 318968297.9419594674.671. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Context SwitchingLinux 6.8Linux 6.9 27 Mar9M18M27M36M45MSE +/- 130933.36, N = 3SE +/- 110197.66, N = 340075791.1341854298.681. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: System V Message PassingLinux 6.8Linux 6.9 27 Mar6M12M18M24M30MSE +/- 770160.74, N = 12SE +/- 9139.64, N = 325534145.8728384870.151. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPULinux 6.8Linux 6.9 27 Mar3K6K9K12K15KSE +/- 5.76, N = 3SE +/- 24.42, N = 311218.1211719.991. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPULinux 6.8Linux 6.9 27 Mar0.95851.9172.87553.8344.7925SE +/- 0.00, N = 3SE +/- 0.01, N = 34.264.08MIN: 3.76 / MAX: 20.9MIN: 3.64 / MAX: 20.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPULinux 6.8Linux 6.9 27 Mar9001800270036004500SE +/- 1.98, N = 3SE +/- 3.68, N = 33864.654063.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPULinux 6.8Linux 6.9 27 Mar3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 312.4011.80MIN: 11.56 / MAX: 38.83MIN: 11.09 / MAX: 37.41. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPULinux 6.8Linux 6.9 27 Mar2004006008001000SE +/- 1.18, N = 3SE +/- 0.57, N = 31086.481119.561. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPULinux 6.8Linux 6.9 27 Mar1020304050SE +/- 0.05, N = 3SE +/- 0.02, N = 344.1342.82MIN: 36.04 / MAX: 216.03MIN: 34.68 / MAX: 192.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

RocksDB

Test: Overwrite

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: OverwriteLinux 6.8Linux 6.9 27 Mar90K180K270K360K450KSE +/- 180.51, N = 3SE +/- 1320.84, N = 33815234168601. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadLinux 6.8Linux 6.9 27 Mar200M400M600M800M1000MSE +/- 4458137.36, N = 3SE +/- 2941599.90, N = 3104729086310960819511. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomLinux 6.8Linux 6.9 27 Mar90K180K270K360K450KSE +/- 379.66, N = 3SE +/- 1155.95, N = 33757144062581. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingLinux 6.8Linux 6.9 27 Mar5M10M15M20M25MSE +/- 155882.11, N = 3SE +/- 390827.28, N = 1220665324224181401. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomLinux 6.8Linux 6.9 27 Mar700K1400K2100K2800K3500KSE +/- 3761.31, N = 3SE +/- 3987.34, N = 3318257234848771. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Speedb

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomLinux 6.8Linux 6.9 27 Mar70K140K210K280K350KSE +/- 945.90, N = 3SE +/- 2172.75, N = 32937463137911. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Llamafile

Test: llava-v1.5-7b-q4 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.6Test: llava-v1.5-7b-q4 - Acceleration: CPULinux 6.8Linux 6.9 27 Mar510152025SE +/- 0.42, N = 15SE +/- 0.49, N = 1512.0321.10

Llamafile

Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.6Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPULinux 6.8Linux 6.9 27 Mar3691215SE +/- 0.04, N = 3SE +/- 0.04, N = 312.7913.46

Geometric Mean Of All Test Results

Result Composite - AMD EPYC Genoa-X Linux 6.9 Kernel Benchmarks

OpenBenchmarking.orgGeometric Mean, More Is BetterGeometric Mean Of All Test ResultsResult Composite - AMD EPYC Genoa-X Linux 6.9 Kernel BenchmarksLinux 6.8Linux 6.9 27 Mar6K12K18K24K30K27772.2630087.89


Phoronix Test Suite v10.8.5