linux-518-sched-core-amd-ryzen-imbalance-patch

AMD Ryzen 9 5950X 16-Core benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2202194-PTS-LINUX51892&grr&sro.

linux-518-sched-core-amd-ryzen-imbalance-patch ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionLinux 5.17 Gitsched-core GitAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3904 BIOS)AMD Starship/Matisse32GB1000GB Sabrent Rocket 4.0 PlusllvmpipeNVIDIA GA104 HD AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 22.045.17.0-051700rc4daily20220218-generic (x86_64)GNOME Shell 41.3X Server 1.20.144.5 Mesa 22.1.0-devel (git-9da3d71 2022-02-05 jammy-oibaf-ppa) (LLVM 13.0.0 256 bits)1.2.204GCC 11.2.0ext43840x21605.17.0-rc1-sched-core-phx (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-O5cEXJ/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-O5cEXJ/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016Python Details- Python 3.9.10Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

linux-518-sched-core-amd-ryzen-imbalance-patch lczero: Eigenlczero: BLASblender: Barbershop - CPU-Onlyqe: AUSURF112aom-av1: Speed 6 Realtime - Bosphorus 4Khpcg: openssl: SHA256brl-cad: VGR Performance Metricgromacs: MPI CPU - water_GMX50_bareonnx: shufflenet-v2-10 - CPUstargate: 192000 - 512sysbench: CPUstockfish: Total Timetachyon: Total Timeblender: BMW27 - CPU-Onlynpb: EP.Daom-av1: Speed 8 Realtime - Bosphorus 4Kgraphics-magick: Rotatebuild-godot: Time To Compiletensorflow-lite: Inception V4luxcorerender: Danish Mood - CPUtensorflow-lite: Inception ResNet V2namd: ATPase Simulation - 327,506 Atomsluxcorerender: DLSC - CPUtensorflow-lite: SqueezeNettensorflow-lite: Mobilenet Quanttensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floatgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegraphics-magick: Swirlopenssl: RSA4096openssl: RSA4096npb: IS.Dstargate: 192000 - 1024aom-av1: Speed 10 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kbuild-linux-kernel: defconfigcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speednpb: SP.Bembree: Pathtracer ISPC - Crowncompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedstargate: 96000 - 512astcenc: Exhaustivestargate: 96000 - 1024avifenc: 6, Losslessbuild-mesa: Time To Compilecompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingsynthmark: VoiceMark_100basis: UASTC Level 3embree: Pathtracer ISPC - Asian Dragoncoremark: CoreMark Size 666 - Iterations Per Secondembree: Pathtracer - Crownembree: Pathtracer - Asian Dragonospray: San Miguel - SciVisavifenc: 2x265: Bosphorus 4Kclomp: Static OMP Speedupkvazaar: Bosphorus 4K - Very Fastdav1d: Chimera 1080p 10-bitnpb: MG.Cdav1d: Summer Nature 4Kwebp: Quality 100, Losslessopenjpeg: NASA Curiosity Panorama M34svt-av1: Preset 8 - Bosphorus 4Kkvazaar: Bosphorus 4K - Ultra Fastprimesieve: 1e12 Prime Number Generationospray: Magnetic Reconnection - SciVisavifenc: 6svt-av1: Preset 10 - Bosphorus 4Kx265: Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080plammps: Rhodopsin Proteinnpb: EP.Csvt-av1: Preset 12 - Bosphorus 4Kwebp: Quality 100, Highest Compressionx264: H.264 Video Encodingcompress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compressionetcpak: DXT1webp: Quality 100Linux 5.17 Gitsched-core Git682718777.51573.9111.034.24821258402477072590801.281229253.10209190429.385632833079.337876.741823.4545.27105568.79613357972.9312019271.106343.5494209.969679.910202662398.6225425183543810241075313385.84780.2622.783.33533959.4458.3649.0944186.536.47879.1721.95744168.751.14.69399933.34404.89700231.95930.38113423093914950.80028.64222.9406769998.97366123.895724.746829.4123.99126.7720.530.20590.399836.51239.7912.8625347549.94353.1311.24821.749.06693.12186.84364.1012.5261811.13114.8005.342204.005.4171546.7911.724680681773.49572.4311.344.23549259635686502611461.247230043.10121790605.165646014278.853576.621829.0845.36105669.33713343273.1012008501.096153.5694313.969736.710276362533.8225424182943710341080313540.54799.2623.893.33087960.8758.0749.1944159.736.37876.9322.05104105.950.94.69452733.19334.84871231.54930.57213463592933944.75728.53322.9995772603.65428323.696424.616329.4123.90626.1520.130.16588.239880.60238.9812.9925319849.78753.4011.24721.749.05293.16686.46364.5012.6391850.49114.1945.350202.465.3141577.2161.719OpenBenchmarking.org

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenLinux 5.17 Gitsched-core Git150300450600750SE +/- 9.77, N = 9SE +/- 7.23, N = 96826801. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASLinux 5.17 Gitsched-core Git150300450600750SE +/- 9.07, N = 9SE +/- 10.25, N = 97186811. (CXX) g++ options: -flto -pthread

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: CPU-OnlyLinux 5.17 Gitsched-core Git2004006008001000SE +/- 0.14, N = 3SE +/- 0.71, N = 3777.51773.49

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112Linux 5.17 Gitsched-core Git120240360480600SE +/- 0.95, N = 3SE +/- 1.30, N = 3573.91572.431. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git3691215SE +/- 0.13, N = 15SE +/- 0.14, N = 1511.0311.341. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Linux 5.17 Gitsched-core Git0.95581.91162.86743.82324.779SE +/- 0.00122, N = 3SE +/- 0.00135, N = 34.248214.235491. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256Linux 5.17 Gitsched-core Git6000M12000M18000M24000M30000MSE +/- 16374725.02, N = 3SE +/- 51458203.55, N = 325840247707259635686501. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.32.2VGR Performance MetricLinux 5.17 Gitsched-core Git60K120K180K240K300K2590802611461. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -ldl -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021.2Implementation: MPI CPU - Input: water_GMX50_bareLinux 5.17 Gitsched-core Git0.28820.57640.86461.15281.441SE +/- 0.003, N = 3SE +/- 0.001, N = 31.2811.2471. (CXX) g++ options: -O3

ONNX Runtime

Model: shufflenet-v2-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPULinux 5.17 Gitsched-core Git5K10K15K20K25KSE +/- 54.94, N = 3SE +/- 40.13, N = 322925230041. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 192000 - Buffer Size: 512Linux 5.17 Gitsched-core Git0.6981.3962.0942.7923.49SE +/- 0.027371, N = 7SE +/- 0.023164, N = 33.1020913.1012171. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPULinux 5.17 Gitsched-core Git20K40K60K80K100KSE +/- 54.36, N = 3SE +/- 44.83, N = 390429.3890605.161. (CC) gcc options: -O2 -funroll-loops -rdynamic -ldl -laio -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeLinux 5.17 Gitsched-core Git12M24M36M48M60MSE +/- 418294.34, N = 15SE +/- 559448.47, N = 656328330564601421. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeLinux 5.17 Gitsched-core Git20406080100SE +/- 0.09, N = 3SE +/- 0.10, N = 379.3478.851. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: CPU-OnlyLinux 5.17 Gitsched-core Git20406080100SE +/- 0.17, N = 3SE +/- 0.18, N = 376.7476.62

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DLinux 5.17 Gitsched-core Git400800120016002000SE +/- 0.82, N = 3SE +/- 2.71, N = 31823.451829.081. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git1020304050SE +/- 0.36, N = 15SE +/- 0.29, N = 1545.2745.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateLinux 5.17 Gitsched-core Git2004006008001000SE +/- 13.13, N = 4SE +/- 1.20, N = 3105510561. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileLinux 5.17 Gitsched-core Git1530456075SE +/- 0.14, N = 3SE +/- 0.23, N = 368.8069.34

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Linux 5.17 Gitsched-core Git300K600K900K1200K1500KSE +/- 1055.47, N = 3SE +/- 1295.59, N = 313357971334327

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPULinux 5.17 Gitsched-core Git0.69751.3952.09252.793.4875SE +/- 0.01, N = 3SE +/- 0.03, N = 32.933.10MIN: 1.06 / MAX: 3.42MIN: 1.28 / MAX: 3.55

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Linux 5.17 Gitsched-core Git300K600K900K1200K1500KSE +/- 770.42, N = 3SE +/- 378.99, N = 312019271200850

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsLinux 5.17 Gitsched-core Git0.24890.49780.74670.99561.2445SE +/- 0.00544, N = 3SE +/- 0.00281, N = 31.106341.09615

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPULinux 5.17 Gitsched-core Git0.8011.6022.4033.2044.005SE +/- 0.01, N = 3SE +/- 0.01, N = 33.543.56MIN: 3.43 / MAX: 3.73MIN: 3.46 / MAX: 3.77

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetLinux 5.17 Gitsched-core Git20K40K60K80K100KSE +/- 196.79, N = 3SE +/- 158.77, N = 394209.994313.9

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantLinux 5.17 Gitsched-core Git15K30K45K60K75KSE +/- 98.88, N = 3SE +/- 70.99, N = 369679.969736.7

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileLinux 5.17 Gitsched-core Git20K40K60K80K100KSE +/- 299.04, N = 3SE +/- 146.64, N = 3102026102763

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatLinux 5.17 Gitsched-core Git13K26K39K52K65KSE +/- 56.11, N = 3SE +/- 63.69, N = 362398.662533.8

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenLinux 5.17 Gitsched-core Git50100150200250SE +/- 0.58, N = 3SE +/- 0.58, N = 32252251. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedLinux 5.17 Gitsched-core Git90180270360450SE +/- 0.33, N = 3SE +/- 0.58, N = 34254241. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingLinux 5.17 Gitsched-core Git400800120016002000SE +/- 0.88, N = 3SE +/- 2.33, N = 3183518291. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianLinux 5.17 Gitsched-core Git90180270360450SE +/- 0.58, N = 3SE +/- 1.67, N = 34384371. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceLinux 5.17 Gitsched-core Git2004006008001000SE +/- 0.00, N = 3SE +/- 0.88, N = 3102410341. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlLinux 5.17 Gitsched-core Git2004006008001000SE +/- 0.58, N = 3SE +/- 1.33, N = 3107510801. (CC) gcc options: -fopenmp -O2 -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -lpthread

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096Linux 5.17 Gitsched-core Git70K140K210K280K350KSE +/- 62.26, N = 3SE +/- 193.77, N = 3313385.8313540.51. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096Linux 5.17 Gitsched-core Git10002000300040005000SE +/- 1.99, N = 3SE +/- 1.99, N = 34780.24799.21. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DLinux 5.17 Gitsched-core Git130260390520650SE +/- 1.21, N = 3SE +/- 0.82, N = 3622.78623.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 192000 - Buffer Size: 1024Linux 5.17 Gitsched-core Git0.75051.5012.25153.0023.7525SE +/- 0.003007, N = 3SE +/- 0.011397, N = 33.3353393.3308791. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

AOM AV1

Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git1428425670SE +/- 1.37, N = 15SE +/- 1.43, N = 1559.4460.871. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.3Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git1326395265SE +/- 0.88, N = 12SE +/- 0.86, N = 1558.3658.071. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: defconfigLinux 5.17 Gitsched-core Git1122334455SE +/- 0.06, N = 3SE +/- 0.04, N = 349.0949.19

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedLinux 5.17 Gitsched-core Git9001800270036004500SE +/- 65.05, N = 3SE +/- 40.62, N = 34186.54159.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedLinux 5.17 Gitsched-core Git816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 336.436.31. (CC) gcc options: -O3 -pthread -lz -llzma

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BLinux 5.17 Gitsched-core Git2K4K6K8K10KSE +/- 6.13, N = 3SE +/- 3.93, N = 37879.177876.931. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownLinux 5.17 Gitsched-core Git510152025SE +/- 0.21, N = 6SE +/- 0.09, N = 321.9622.05MIN: 16.5 / MAX: 22.74MIN: 21.76 / MAX: 22.64

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedLinux 5.17 Gitsched-core Git9001800270036004500SE +/- 32.27, N = 3SE +/- 19.06, N = 34168.74105.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedLinux 5.17 Gitsched-core Git1224364860SE +/- 0.20, N = 3SE +/- 0.26, N = 351.150.91. (CC) gcc options: -O3 -pthread -lz -llzma

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 512

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 96000 - Buffer Size: 512Linux 5.17 Gitsched-core Git1.05632.11263.16894.22525.2815SE +/- 0.010509, N = 3SE +/- 0.006775, N = 34.6939994.6945271. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ExhaustiveLinux 5.17 Gitsched-core Git816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 333.3433.191. (CXX) g++ options: -O3 -flto -pthread

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 21.10.9Sample Rate: 96000 - Buffer Size: 1024Linux 5.17 Gitsched-core Git1.10182.20363.30544.40725.509SE +/- 0.015880, N = 3SE +/- 0.039160, N = 34.8970024.8487121. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessLinux 5.17 Gitsched-core Git714212835SE +/- 0.08, N = 3SE +/- 0.08, N = 331.9631.551. (CXX) g++ options: -O3 -fPIC -lm

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileLinux 5.17 Gitsched-core Git714212835SE +/- 0.03, N = 3SE +/- 0.04, N = 330.3830.57

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression RatingLinux 5.17 Gitsched-core Git30K60K90K120K150KSE +/- 788.43, N = 3SE +/- 377.57, N = 31342301346351. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression RatingLinux 5.17 Gitsched-core Git20K40K60K80K100KSE +/- 58.26, N = 3SE +/- 314.18, N = 393914929331. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Linux 5.17 Gitsched-core Git2004006008001000SE +/- 1.60, N = 3SE +/- 4.79, N = 3950.80944.761. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3Linux 5.17 Gitsched-core Git714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 328.6428.531. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonLinux 5.17 Gitsched-core Git612182430SE +/- 0.09, N = 3SE +/- 0.02, N = 322.9423.00MIN: 22.64 / MAX: 23.37MIN: 22.86 / MAX: 23.48

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondLinux 5.17 Gitsched-core Git170K340K510K680K850KSE +/- 2600.56, N = 3SE +/- 1894.94, N = 3769998.97772603.651. (CC) gcc options: -O2 -lrt" -lrt

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: CrownLinux 5.17 Gitsched-core Git612182430SE +/- 0.08, N = 3SE +/- 0.04, N = 323.9023.70MIN: 23.56 / MAX: 24.55MIN: 23.38 / MAX: 24.28

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: Asian DragonLinux 5.17 Gitsched-core Git612182430SE +/- 0.03, N = 3SE +/- 0.13, N = 324.7524.62MIN: 24.57 / MAX: 25.31MIN: 24.36 / MAX: 25.32

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisLinux 5.17 Gitsched-core Git714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 329.4129.41MIN: 28.57 / MAX: 31.25MIN: 27.78 / MAX: 31.25

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Linux 5.17 Gitsched-core Git612182430SE +/- 0.11, N = 3SE +/- 0.12, N = 323.9923.911. (CXX) g++ options: -O3 -fPIC -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KLinux 5.17 Gitsched-core Git612182430SE +/- 0.05, N = 3SE +/- 0.13, N = 326.7726.151. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.17 Gitsched-core Git510152025SE +/- 0.15, N = 3SE +/- 0.27, N = 320.520.11. (CC) gcc options: -fopenmp -O3 -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very FastLinux 5.17 Gitsched-core Git714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 330.2030.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Chimera 1080p 10-bitLinux 5.17 Gitsched-core Git130260390520650SE +/- 0.55, N = 3SE +/- 0.38, N = 3590.39588.23MIN: 486.11 / MAX: 723.97MIN: 488.46 / MAX: 714.791. (CC) gcc options: -pthread -lm

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CLinux 5.17 Gitsched-core Git2K4K6K8K10KSE +/- 4.48, N = 3SE +/- 1.65, N = 39836.519880.601. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Summer Nature 4KLinux 5.17 Gitsched-core Git50100150200250SE +/- 0.17, N = 3SE +/- 0.12, N = 3239.79238.98MIN: 181.39 / MAX: 247.94MIN: 179.5 / MAX: 246.871. (CC) gcc options: -pthread -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessLinux 5.17 Gitsched-core Git3691215SE +/- 0.07, N = 3SE +/- 0.01, N = 312.8612.991. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16 -ltiff

OpenJPEG

Encode: NASA Curiosity Panorama M34

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34Linux 5.17 Gitsched-core Git11K22K33K44K55KSE +/- 499.18, N = 3SE +/- 395.71, N = 1453475531981. (CXX) g++ options: -rdynamic

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 8 - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git1122334455SE +/- 0.56, N = 3SE +/- 0.26, N = 349.9449.791. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra FastLinux 5.17 Gitsched-core Git1224364860SE +/- 0.18, N = 3SE +/- 0.15, N = 353.1353.401. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.71e12 Prime Number GenerationLinux 5.17 Gitsched-core Git3691215SE +/- 0.03, N = 3SE +/- 0.01, N = 311.2511.251. (CXX) g++ options: -O3

OSPray

Demo: Magnetic Reconnection - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisLinux 5.17 Gitsched-core Git510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 321.7421.74MIN: 21.28 / MAX: 22.22MIN: 20.41 / MAX: 22.22

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Linux 5.17 Gitsched-core Git3691215SE +/- 0.034, N = 3SE +/- 0.024, N = 39.0669.0521. (CXX) g++ options: -O3 -fPIC -lm

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 10 - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git20406080100SE +/- 0.44, N = 3SE +/- 0.27, N = 393.1293.171. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pLinux 5.17 Gitsched-core Git20406080100SE +/- 0.06, N = 3SE +/- 0.27, N = 386.8486.461. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pLinux 5.17 Gitsched-core Git80160240320400SE +/- 2.90, N = 12SE +/- 2.98, N = 9364.10364.501. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinLinux 5.17 Gitsched-core Git3691215SE +/- 0.18, N = 3SE +/- 0.10, N = 1512.5312.641. (CXX) g++ options: -O3 -lm

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.17 Gitsched-core Git400800120016002000SE +/- 21.44, N = 4SE +/- 3.42, N = 31811.131850.491. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.9Encoder Mode: Preset 12 - Input: Bosphorus 4KLinux 5.17 Gitsched-core Git306090120150SE +/- 0.82, N = 3SE +/- 1.06, N = 3114.80114.191. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionLinux 5.17 Gitsched-core Git1.20382.40763.61144.81526.019SE +/- 0.035, N = 3SE +/- 0.053, N = 35.3425.3501. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16 -ltiff

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingLinux 5.17 Gitsched-core Git4080120160200SE +/- 2.22, N = 3SE +/- 1.70, N = 8204.00202.461. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Parallel BZIP2 Compression

FreeBSD-13.0-RELEASE-amd64-memstick.img Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.13FreeBSD-13.0-RELEASE-amd64-memstick.img CompressionLinux 5.17 Gitsched-core Git1.21882.43763.65644.87526.094SE +/- 0.072, N = 3SE +/- 0.047, N = 35.4175.3141. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Linux 5.17 Gitsched-core Git30060090012001500SE +/- 0.51, N = 3SE +/- 17.94, N = 31546.791577.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Linux 5.17 Gitsched-core Git0.38790.77581.16371.55161.9395SE +/- 0.014, N = 3SE +/- 0.015, N = 31.7241.7191. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16 -ltiff


Phoronix Test Suite v10.8.4