Threadripper PRO 5965WX Memory Channels

AMD Ryzen Threadripper PRO 5965WX Linux memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2208077-NE-TRPRO500044&grt&sro.

Threadripper PRO 5965WX Memory ChannelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution8c6c4cAMD Ryzen Threadripper PRO 5965WX 24-Cores @ 3.80GHz (24 Cores / 48 Threads)ASUS Pro WS WRX80E-SAGE SE WIFI (1003 BIOS)AMD Starship/Matisse128GB1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA NV106 2GBAMD Starship/MatisseVA24312 x Intel 10G X550T + Intel Wi-Fi 6 AX200Ubuntu 22.045.19.0-051900rc7-generic (x86_64)GNOME Shell 42.2X Server 1.21.1.3 + Waylandnouveau4.3 Mesa 22.0.11.2.204GCC 11.2.0ext41920x108096GB64GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa008203Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Threadripper PRO 5965WX Memory Channelscompress-7zip: Compression Ratingamg: astcenc: Fastblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyclickhouse: 100M Rows Web Analytics Dataset, First Run / Cold Cacheclickhouse: 100M Rows Web Analytics Dataset, Second Runclickhouse: 100M Rows Web Analytics Dataset, Third Rungraph500: 26graph500: 26graph500: 26graph500: 26gromacs: MPI CPU - water_GMX50_barehpcg: oidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160oidn: RTLightmap.hdr.4096x4096lulesh: namd: ATPase Simulation - 327,506 Atomsnpb: BT.Cnpb: CG.Cnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cnwchem: C240 Buckyballopenfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timeopenfoam: drivaerFastback, Large Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Tracerospray-studio: 1 - 1080p - 1 - Path Tracerospray-studio: 2 - 1080p - 1 - Path Tracerospray-studio: 3 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 16 - Path Tracerospray-studio: 1 - 1080p - 32 - Path Tracerospray-studio: 2 - 1080p - 16 - Path Tracerospray-studio: 2 - 1080p - 32 - Path Tracerospray-studio: 3 - 1080p - 16 - Path Tracerospray-studio: 3 - 1080p - 32 - Path Tracerpennant: sedovbigpennant: leblancbigqe: AUSURF112stress-ng: Memory Copyingsvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 4Ksvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 1080pbuild-gcc: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-mplayer: Time To Compilewrf: conus 2.5kmincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Direction8c6c4c197316653248167455.5963130.04524.21287.86301.89315.993420140003480300001232430001643760003.48412.58911.081.080.539045.53600.6496062499.8015087.8438325.141560.3171618.7537047.6835607.3524929.103763.664.471088.1425819.45158.791313.36159.411316.02945540234131479668843134477704531362208127615820610101032120716201326221661533196193843863919.7680212.27072331.208094.03151.792189.506190.811104.81188.94586.83131.29466.67141.36471.58120.86388.72635.32435.830385.973227.596263.19812.92626021.0669.0714536041.6351471200943207473100462.2226131.41517.75306.59314.96324.023481080003540960001223040001601190003.2613.963871.011.010.509285.92540.7709327617.567330.5912166.64605.5331363.0811213.5911216.237198.483915.370.731181.1830373.88175.211686.71176.361663.197573952404247226795913172169199133757798361554969941010118415970318861620732476190073802365.0741040.28895344.758228.36160.112200.146194.706110.19194.61644.73124.77470.21136.95472.54123.85397.23617.89934.811375.094219.972255.57612.67930933.31622.879023293.6710790140026184538833451.3173130.00522.56238.02252.63256.13321128000324305000998528001308230002.4183.562930.880.890.447527.83430.8246724943.595404.7911378.66566.1730214.2510403.4810930.117090.304062.671.711638.8251240.35219.742429.98220.842469.98875340484139480668984134157707021368848142915858610111035120516196324791660533252193783861366.4106141.30835397.694210.74135.763165.358186.33397.50139.28524.9583.31346.3988.92353.8689.86323.03637.79838.575419.634254.606286.69713.43818.044858982.5536499OpenBenchmarking.org

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.00Test: Compression Rating4c6c8c40K80K120K160K200KSE +/- 124.84, N = 3SE +/- 524.95, N = 3SE +/- 504.48, N = 31400262009431973161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.24c6c8c140M280M420M560M700MSE +/- 64488.56, N = 3SE +/- 215404.98, N = 3SE +/- 656104.93, N = 31845388332074731006532481671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Fast4c6c8c100200300400500SE +/- 0.69, N = 3SE +/- 0.14, N = 3SE +/- 1.23, N = 3451.32462.22455.601. (CXX) g++ options: -O3 -flto -pthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Classroom - Compute: CPU-Only4c6c8c306090120150SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 3130.00131.41130.04

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Barbershop - Compute: CPU-Only4c6c8c110220330440550SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.49, N = 3522.56517.75524.21

ClickHouse

100M Rows Web Analytics Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, First Run / Cold Cache4c6c8c70140210280350SE +/- 1.87, N = 10SE +/- 2.92, N = 15SE +/- 3.88, N = 3238.02306.59287.86MIN: 14.87 / MAX: 20000MIN: 23.87 / MAX: 30000MIN: 27.05 / MAX: 200001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Second Run4c6c8c70140210280350SE +/- 1.82, N = 10SE +/- 2.56, N = 15SE +/- 2.39, N = 3252.63314.96301.89MIN: 14.7 / MAX: 20000MIN: 24.41 / MAX: 30000MIN: 26.97 / MAX: 150001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Third Run4c6c8c70140210280350SE +/- 1.78, N = 10SE +/- 1.23, N = 15SE +/- 0.64, N = 3256.13324.02315.99MIN: 14.67 / MAX: 30000MIN: 24.02 / MAX: 30000MIN: 27.08 / MAX: 300001. ClickHouse server version 22.7.1.687 (official build).

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c70M140M210M280M350M3211280003481080003420140001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c80M160M240M320M400M3243050003540960003480300001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c30M60M90M120M150M998528001223040001232430001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c40M80M120M160M200M1308230001601190001643760001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare4c6c8c0.78391.56782.35173.13563.9195SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 32.4183.2613.4841. (CXX) g++ options: -O3

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.14c6c8c3691215SE +/- 0.00250, N = 3SE +/- 0.01229, N = 3SE +/- 0.00191, N = 33.562933.9638712.589101. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x21604c6c8c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.881.011.08

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x21604c6c8c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.891.011.08

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x40964c6c8c0.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.440.500.53

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.34c6c8c2K4K6K8K10KSE +/- 83.82, N = 3SE +/- 20.87, N = 3SE +/- 67.93, N = 37527.839285.939045.541. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms4c6c8c0.18560.37120.55680.74240.928SE +/- 0.00160, N = 3SE +/- 0.00232, N = 3SE +/- 0.00063, N = 30.824670.770930.64960

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C4c6c8c13K26K39K52K65KSE +/- 66.33, N = 3SE +/- 70.37, N = 3SE +/- 59.44, N = 324943.5927617.5662499.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C4c6c8c3K6K9K12K15KSE +/- 17.52, N = 3SE +/- 78.48, N = 3SE +/- 125.06, N = 35404.797330.5915087.841. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C4c6c8c8K16K24K32K40KSE +/- 9.24, N = 3SE +/- 41.39, N = 3SE +/- 18.86, N = 311378.6612166.6438325.141. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D4c6c8c30060090012001500SE +/- 7.03, N = 4SE +/- 5.86, N = 3SE +/- 3.35, N = 3566.17605.531560.311. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C4c6c8c15K30K45K60K75KSE +/- 149.22, N = 3SE +/- 93.08, N = 3SE +/- 63.92, N = 330214.2531363.0871618.751. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C4c6c8c8K16K24K32K40KSE +/- 2.83, N = 3SE +/- 4.70, N = 3SE +/- 10.53, N = 310403.4811213.5937047.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B4c6c8c8K16K24K32K40KSE +/- 83.71, N = 3SE +/- 53.26, N = 3SE +/- 39.29, N = 310930.1111216.2335607.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C4c6c8c5K10K15K20K25KSE +/- 46.80, N = 3SE +/- 29.58, N = 3SE +/- 22.67, N = 37090.307198.4824929.101. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball4c6c8c90018002700360045004062.63915.33763.61. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: motorBike - Execution Time4c6c8c163248648071.7170.7364.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Mesh Time4c6c8c4008001200160020001638.821181.181088.141. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Execution Time4c6c8c11K22K33K44K55K51240.3530373.8825819.451. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Mesh Time4c6c8c50100150200250219.74175.21158.791. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Execution Time4c6c8c50010001500200025002429.981686.711313.361. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Mesh Time4c6c8c50100150200250220.84176.36159.411. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Execution Time4c6c8c50010001500200025002469.981663.101316.021. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC4c6c8c20406080100SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3879794MIN: 7 / MAX: 1883MIN: 9 / MAX: 2123MIN: 8 / MAX: 2044

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar4c6c8c1326395265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3535755MIN: 4 / MAX: 1606MIN: 5 / MAX: 1779MIN: 4 / MAX: 1689

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c9001800270036004500SE +/- 5.69, N = 3SE +/- 5.20, N = 3SE +/- 4.81, N = 34048395240231. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c9001800270036004500SE +/- 5.51, N = 3SE +/- 3.33, N = 3SE +/- 3.93, N = 34139404241311. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c10002000300040005000SE +/- 7.69, N = 3SE +/- 7.54, N = 3SE +/- 3.53, N = 34806472247961. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c15K30K45K60K75KSE +/- 84.76, N = 3SE +/- 17.13, N = 3SE +/- 47.36, N = 36898467959688431. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 221.08, N = 3SE +/- 225.43, N = 3SE +/- 391.66, N = 31341571317211344771. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c15K30K45K60K75KSE +/- 151.58, N = 3SE +/- 121.88, N = 3SE +/- 30.48, N = 37070269199704531. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 280.26, N = 3SE +/- 392.43, N = 3SE +/- 171.91, N = 31368841337571362201. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c20K40K60K80K100KSE +/- 125.68, N = 3SE +/- 43.68, N = 3SE +/- 101.29, N = 38142979836812761. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 123.38, N = 3SE +/- 172.12, N = 3SE +/- 112.13, N = 31585861554961582061. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c2004006008001000SE +/- 1.45, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 3101199410101. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c2004006008001000SE +/- 1.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 31035101010321. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c30060090012001500SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 31205118412071. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c3K6K9K12K15KSE +/- 25.83, N = 3SE +/- 15.34, N = 3SE +/- 22.61, N = 31619615970162011. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c7K14K21K28K35KSE +/- 38.85, N = 3SE +/- 93.71, N = 3SE +/- 34.89, N = 33247931886326221. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c4K8K12K16K20KSE +/- 4.48, N = 3SE +/- 27.87, N = 3SE +/- 22.28, N = 31660516207166151. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c7K14K21K28K35KSE +/- 43.67, N = 3SE +/- 54.34, N = 3SE +/- 60.40, N = 33325232476331961. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c4K8K12K16K20KSE +/- 66.58, N = 3SE +/- 0.58, N = 3SE +/- 47.01, N = 31937819007193841. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c8K16K24K32K40KSE +/- 29.84, N = 3SE +/- 33.12, N = 3SE +/- 98.10, N = 33861338023386391. (CXX) g++ options: -O3 -ldl

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig4c6c8c1530456075SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 366.4165.0719.771. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig4c6c8c918273645SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 341.3140.2912.271. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF1124c6c8c90180270360450SE +/- 0.56, N = 3SE +/- 0.69, N = 3SE +/- 1.56, N = 3397.69344.75331.201. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copying4c6c8c2K4K6K8K10KSE +/- 34.95, N = 3SE +/- 14.52, N = 3SE +/- 19.40, N = 34210.748228.368094.031. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 10 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 1.07, N = 3SE +/- 0.32, N = 3SE +/- 0.04, N = 3135.76160.11151.791. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 12 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 0.43, N = 3SE +/- 1.06, N = 3SE +/- 0.38, N = 3165.36200.15189.511. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 8 - Input: Bosphorus 1080p4c6c8c4080120160200SE +/- 1.47, N = 3SE +/- 0.35, N = 3SE +/- 0.17, N = 3186.33194.71190.811. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4K4c6c8c20406080100SE +/- 0.67, N = 3SE +/- 0.21, N = 3SE +/- 0.62, N = 397.50110.19104.811. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 0.77, N = 3SE +/- 1.08, N = 3SE +/- 0.81, N = 3139.28194.61188.941. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p4c6c8c140280420560700SE +/- 1.66, N = 3SE +/- 3.02, N = 3SE +/- 6.13, N = 3524.95644.73586.831. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.74, N = 14SE +/- 1.64, N = 14SE +/- 1.94, N = 1283.31124.77131.291. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p4c6c8c100200300400500SE +/- 3.25, N = 13SE +/- 4.72, N = 12SE +/- 6.26, N = 12346.39470.21466.671. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.36, N = 3SE +/- 0.85, N = 3SE +/- 0.30, N = 388.92136.95141.361. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p4c6c8c100200300400500SE +/- 1.37, N = 3SE +/- 1.32, N = 3SE +/- 3.84, N = 3353.86472.54471.581. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.28, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 389.86123.85120.861. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p4c6c8c90180270360450SE +/- 1.28, N = 3SE +/- 1.45, N = 3SE +/- 2.06, N = 3323.03397.23388.721. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compile4c6c8c140280420560700SE +/- 1.13, N = 3SE +/- 0.47, N = 3SE +/- 0.79, N = 3637.80617.90635.32

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: defconfig4c6c8c918273645SE +/- 0.37, N = 3SE +/- 0.34, N = 3SE +/- 0.37, N = 338.5834.8135.83

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: allmodconfig4c6c8c90180270360450SE +/- 0.55, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 3419.63375.09385.97

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja4c6c8c60120180240300SE +/- 0.88, N = 3SE +/- 0.07, N = 3SE +/- 0.72, N = 3254.61219.97227.60

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix Makefiles4c6c8c60120180240300SE +/- 2.73, N = 9SE +/- 2.39, N = 3SE +/- 2.27, N = 3286.70255.58263.20

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile4c6c8c3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 313.4412.6812.93

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km6c8c7K14K21K28K35K30933.3226021.071. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction4c6c8c510152025SE +/- 0.22101413, N = 3SE +/- 0.32246994, N = 3SE +/- 0.06683370, N = 1518.0448589022.879023209.071453601. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction4c6c8c20406080100SE +/- 0.56, N = 3SE +/- 0.58, N = 3SE +/- 0.39, N = 382.5593.6741.641. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz


Phoronix Test Suite v10.8.5