Threadripper PRO 5965WX Memory Channels

AMD Ryzen Threadripper PRO 5965WX Linux memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2208077-NE-TRPRO500044&grr&sro.

Threadripper PRO 5965WX Memory ChannelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution8c6c4cAMD Ryzen Threadripper PRO 5965WX 24-Cores @ 3.80GHz (24 Cores / 48 Threads)ASUS Pro WS WRX80E-SAGE SE WIFI (1003 BIOS)AMD Starship/Matisse128GB1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA NV106 2GBAMD Starship/MatisseVA24312 x Intel 10G X550T + Intel Wi-Fi 6 AX200Ubuntu 22.045.19.0-051900rc7-generic (x86_64)GNOME Shell 42.2X Server 1.21.1.3 + Waylandnouveau4.3 Mesa 22.0.11.2.204GCC 11.2.0ext41920x108096GB64GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa008203Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Threadripper PRO 5965WX Memory Channelsopenfoam: drivaerFastback, Large Mesh Size - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timewrf: conus 2.5kmnwchem: C240 Buckyballopenvkl: vklBenchmark Scalaropenvkl: vklBenchmark ISPCgraph500: 26graph500: 26graph500: 26graph500: 26openfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timebuild-gcc: Time To Compileblender: Barbershop - CPU-Onlybuild-llvm: Unix Makefilesbuild-linux-kernel: allmodconfighpcg: qe: AUSURF112build-llvm: Ninjaclickhouse: 100M Rows Web Analytics Dataset, Third Runclickhouse: 100M Rows Web Analytics Dataset, Second Runclickhouse: 100M Rows Web Analytics Dataset, First Run / Cold Cacheospray-studio: 3 - 4K - 32 - Path Tracernpb: SP.Cospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 3 - 1080p - 16 - Path Tracerblender: Classroom - CPU-Onlyospray-studio: 3 - 1080p - 32 - Path Tracerospray-studio: 2 - 1080p - 32 - Path Tracerospray-studio: 3 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 32 - Path Tracerospray-studio: 2 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 1 - Path Tracerospray-studio: 2 - 1080p - 16 - Path Tracerospray-studio: 1 - 1080p - 16 - Path Tracernpb: BT.Cospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerincompact3d: input.i3d 193 Cells Per Directiongromacs: MPI CPU - water_GMX50_bareoidn: RTLightmap.hdr.4096x4096npb: LU.Cnpb: IS.Damg: pennant: sedovbignamd: ATPase Simulation - 327,506 Atomsbuild-linux-kernel: defconfigopenfoam: motorBike - Execution Timepennant: leblancbigsvt-vp9: VMAF Optimized - Bosphorus 4Koidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160stress-ng: Memory Copyingincompact3d: input.i3d 129 Cells Per Directionnpb: FT.Ccompress-7zip: Compression Ratingnpb: SP.Bnpb: CG.Cbuild-mplayer: Time To Compilenpb: MG.Castcenc: Fastsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 4Klulesh: svt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksvt-hevc: 7 - Bosphorus 4Ksvt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080p8c6c4c25819.451088.1426021.0663763.655941643760001232430003480300003420140001316.02159.411313.36158.79635.324524.21263.198385.97312.5891331.20227.596315.99301.89287.8615820624929.1047961362201344774023413119384130.04386393319612073262210321010166151620162499.8081276704536884341.63514713.4840.5371618.751560.3165324816719.768020.6496035.83064.4712.27072131.291.081.088094.039.0714536038325.1419731635607.3515087.8412.92637047.68455.5963466.67120.869045.5360141.36104.81151.792189.506188.94190.811388.72471.58586.8330373.881181.1830933.3163915.357971601190001223040003540960003481080001663.1176.361686.71175.21617.899517.75255.576375.0943.96387344.75219.972324.02314.96306.591554967198.4847221337571317213952404219007131.4138023324761184318861010994162071597027617.5679836691996795993.67107903.2610.5031363.08605.5320747310065.074100.7709334.81170.7340.28895124.771.011.018228.3622.879023212166.6420094311216.237330.5912.67911213.59462.2226470.21123.859285.9254136.95110.19160.112200.146194.61194.706397.23472.54644.7351240.351638.824062.65387130823000998528003243050003211280002469.98220.842429.98219.74637.798522.56286.697419.6343.56293397.69254.606256.13252.63238.021585867090.3048061368841341574048413919378130.00386133325212053247910351011166051619624943.5981429707026898482.55364992.4180.4430214.25566.1718453883366.410610.8246738.57571.7141.3083583.310.880.894210.7418.044858911378.6614002610930.115404.7913.43810403.48451.3173346.3989.867527.834388.9297.50135.763165.358139.28186.333323.03353.86524.95OpenBenchmarking.org

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Execution Time4c6c8c11K22K33K44K55K51240.3530373.8825819.451. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Mesh Time4c6c8c4008001200160020001638.821181.181088.141. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km6c8c7K14K21K28K35K30933.3226021.071. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball4c6c8c90018002700360045004062.63915.33763.61. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar4c6c8c1326395265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3535755MIN: 4 / MAX: 1606MIN: 5 / MAX: 1779MIN: 4 / MAX: 1689

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC4c6c8c20406080100SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3879794MIN: 7 / MAX: 1883MIN: 9 / MAX: 2123MIN: 8 / MAX: 2044

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c40M80M120M160M200M1308230001601190001643760001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c30M60M90M120M150M998528001223040001232430001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c80M160M240M320M400M3243050003540960003480300001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 264c6c8c70M140M210M280M350M3211280003481080003420140001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Execution Time4c6c8c50010001500200025002469.981663.101316.021. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Mesh Time4c6c8c50100150200250220.84176.36159.411. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Execution Time4c6c8c50010001500200025002429.981686.711313.361. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Mesh Time4c6c8c50100150200250219.74175.21158.791. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compile4c6c8c140280420560700SE +/- 1.13, N = 3SE +/- 0.47, N = 3SE +/- 0.79, N = 3637.80617.90635.32

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Barbershop - Compute: CPU-Only4c6c8c110220330440550SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.49, N = 3522.56517.75524.21

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix Makefiles4c6c8c60120180240300SE +/- 2.73, N = 9SE +/- 2.39, N = 3SE +/- 2.27, N = 3286.70255.58263.20

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: allmodconfig4c6c8c90180270360450SE +/- 0.55, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 3419.63375.09385.97

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.14c6c8c3691215SE +/- 0.00250, N = 3SE +/- 0.01229, N = 3SE +/- 0.00191, N = 33.562933.9638712.589101. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF1124c6c8c90180270360450SE +/- 0.56, N = 3SE +/- 0.69, N = 3SE +/- 1.56, N = 3397.69344.75331.201. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja4c6c8c60120180240300SE +/- 0.88, N = 3SE +/- 0.07, N = 3SE +/- 0.72, N = 3254.61219.97227.60

ClickHouse

100M Rows Web Analytics Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Third Run4c6c8c70140210280350SE +/- 1.78, N = 10SE +/- 1.23, N = 15SE +/- 0.64, N = 3256.13324.02315.99MIN: 14.67 / MAX: 30000MIN: 24.02 / MAX: 30000MIN: 27.08 / MAX: 300001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Second Run4c6c8c70140210280350SE +/- 1.82, N = 10SE +/- 2.56, N = 15SE +/- 2.39, N = 3252.63314.96301.89MIN: 14.7 / MAX: 20000MIN: 24.41 / MAX: 30000MIN: 26.97 / MAX: 150001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, First Run / Cold Cache4c6c8c70140210280350SE +/- 1.87, N = 10SE +/- 2.92, N = 15SE +/- 3.88, N = 3238.02306.59287.86MIN: 14.87 / MAX: 20000MIN: 23.87 / MAX: 30000MIN: 27.05 / MAX: 200001. ClickHouse server version 22.7.1.687 (official build).

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 123.38, N = 3SE +/- 172.12, N = 3SE +/- 112.13, N = 31585861554961582061. (CXX) g++ options: -O3 -ldl

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C4c6c8c5K10K15K20K25KSE +/- 46.80, N = 3SE +/- 29.58, N = 3SE +/- 22.67, N = 37090.307198.4824929.101. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c10002000300040005000SE +/- 7.69, N = 3SE +/- 7.54, N = 3SE +/- 3.53, N = 34806472247961. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 280.26, N = 3SE +/- 392.43, N = 3SE +/- 171.91, N = 31368841337571362201. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c30K60K90K120K150KSE +/- 221.08, N = 3SE +/- 225.43, N = 3SE +/- 391.66, N = 31341571317211344771. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c9001800270036004500SE +/- 5.69, N = 3SE +/- 5.20, N = 3SE +/- 4.81, N = 34048395240231. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c9001800270036004500SE +/- 5.51, N = 3SE +/- 3.33, N = 3SE +/- 3.93, N = 34139404241311. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c4K8K12K16K20KSE +/- 66.58, N = 3SE +/- 0.58, N = 3SE +/- 47.01, N = 31937819007193841. (CXX) g++ options: -O3 -ldl

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Classroom - Compute: CPU-Only4c6c8c306090120150SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 3130.00131.41130.04

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c8K16K24K32K40KSE +/- 29.84, N = 3SE +/- 33.12, N = 3SE +/- 98.10, N = 33861338023386391. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c7K14K21K28K35KSE +/- 43.67, N = 3SE +/- 54.34, N = 3SE +/- 60.40, N = 33325232476331961. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c30060090012001500SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 31205118412071. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer4c6c8c7K14K21K28K35KSE +/- 38.85, N = 3SE +/- 93.71, N = 3SE +/- 34.89, N = 33247931886326221. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c2004006008001000SE +/- 1.33, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 31035101010321. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer4c6c8c2004006008001000SE +/- 1.45, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 3101199410101. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c4K8K12K16K20KSE +/- 4.48, N = 3SE +/- 27.87, N = 3SE +/- 22.28, N = 31660516207166151. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c3K6K9K12K15KSE +/- 25.83, N = 3SE +/- 15.34, N = 3SE +/- 22.61, N = 31619615970162011. (CXX) g++ options: -O3 -ldl

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C4c6c8c13K26K39K52K65KSE +/- 66.33, N = 3SE +/- 70.37, N = 3SE +/- 59.44, N = 324943.5927617.5662499.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c20K40K60K80K100KSE +/- 125.68, N = 3SE +/- 43.68, N = 3SE +/- 101.29, N = 38142979836812761. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c15K30K45K60K75KSE +/- 151.58, N = 3SE +/- 121.88, N = 3SE +/- 30.48, N = 37070269199704531. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer4c6c8c15K30K45K60K75KSE +/- 84.76, N = 3SE +/- 17.13, N = 3SE +/- 47.36, N = 36898467959688431. (CXX) g++ options: -O3 -ldl

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction4c6c8c20406080100SE +/- 0.56, N = 3SE +/- 0.58, N = 3SE +/- 0.39, N = 382.5593.6741.641. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare4c6c8c0.78391.56782.35173.13563.9195SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 32.4183.2613.4841. (CXX) g++ options: -O3

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x40964c6c8c0.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.440.500.53

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C4c6c8c15K30K45K60K75KSE +/- 149.22, N = 3SE +/- 93.08, N = 3SE +/- 63.92, N = 330214.2531363.0871618.751. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D4c6c8c30060090012001500SE +/- 7.03, N = 4SE +/- 5.86, N = 3SE +/- 3.35, N = 3566.17605.531560.311. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.24c6c8c140M280M420M560M700MSE +/- 64488.56, N = 3SE +/- 215404.98, N = 3SE +/- 656104.93, N = 31845388332074731006532481671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig4c6c8c1530456075SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 366.4165.0719.771. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms4c6c8c0.18560.37120.55680.74240.928SE +/- 0.00160, N = 3SE +/- 0.00232, N = 3SE +/- 0.00063, N = 30.824670.770930.64960

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: defconfig4c6c8c918273645SE +/- 0.37, N = 3SE +/- 0.34, N = 3SE +/- 0.37, N = 338.5834.8135.83

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: motorBike - Execution Time4c6c8c163248648071.7170.7364.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig4c6c8c918273645SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 341.3140.2912.271. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.74, N = 14SE +/- 1.64, N = 14SE +/- 1.94, N = 1283.31124.77131.291. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x21604c6c8c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.881.011.08

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x21604c6c8c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.891.011.08

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copying4c6c8c2K4K6K8K10KSE +/- 34.95, N = 3SE +/- 14.52, N = 3SE +/- 19.40, N = 34210.748228.368094.031. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction4c6c8c510152025SE +/- 0.22101413, N = 3SE +/- 0.32246994, N = 3SE +/- 0.06683370, N = 1518.0448589022.879023209.071453601. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C4c6c8c8K16K24K32K40KSE +/- 9.24, N = 3SE +/- 41.39, N = 3SE +/- 18.86, N = 311378.6612166.6438325.141. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.00Test: Compression Rating4c6c8c40K80K120K160K200KSE +/- 124.84, N = 3SE +/- 524.95, N = 3SE +/- 504.48, N = 31400262009431973161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B4c6c8c8K16K24K32K40KSE +/- 83.71, N = 3SE +/- 53.26, N = 3SE +/- 39.29, N = 310930.1111216.2335607.351. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C4c6c8c3K6K9K12K15KSE +/- 17.52, N = 3SE +/- 78.48, N = 3SE +/- 125.06, N = 35404.797330.5915087.841. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile4c6c8c3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 313.4412.6812.93

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C4c6c8c8K16K24K32K40KSE +/- 2.83, N = 3SE +/- 4.70, N = 3SE +/- 10.53, N = 310403.4811213.5937047.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Fast4c6c8c100200300400500SE +/- 0.69, N = 3SE +/- 0.14, N = 3SE +/- 1.23, N = 3451.32462.22455.601. (CXX) g++ options: -O3 -flto -pthread

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p4c6c8c100200300400500SE +/- 3.25, N = 13SE +/- 4.72, N = 12SE +/- 6.26, N = 12346.39470.21466.671. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.28, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 389.86123.85120.861. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.34c6c8c2K4K6K8K10KSE +/- 83.82, N = 3SE +/- 20.87, N = 3SE +/- 67.93, N = 37527.839285.939045.541. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K4c6c8c306090120150SE +/- 0.36, N = 3SE +/- 0.85, N = 3SE +/- 0.30, N = 388.92136.95141.361. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 7 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4K4c6c8c20406080100SE +/- 0.67, N = 3SE +/- 0.21, N = 3SE +/- 0.62, N = 397.50110.19104.811. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 10 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 1.07, N = 3SE +/- 0.32, N = 3SE +/- 0.04, N = 3135.76160.11151.791. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 12 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 0.43, N = 3SE +/- 1.06, N = 3SE +/- 0.38, N = 3165.36200.15189.511. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4K4c6c8c4080120160200SE +/- 0.77, N = 3SE +/- 1.08, N = 3SE +/- 0.81, N = 3139.28194.61188.941. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 8 - Input: Bosphorus 1080p4c6c8c4080120160200SE +/- 1.47, N = 3SE +/- 0.35, N = 3SE +/- 0.17, N = 3186.33194.71190.811. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p4c6c8c90180270360450SE +/- 1.28, N = 3SE +/- 1.45, N = 3SE +/- 2.06, N = 3323.03397.23388.721. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p4c6c8c100200300400500SE +/- 1.37, N = 3SE +/- 1.32, N = 3SE +/- 3.84, N = 3353.86472.54471.581. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p4c6c8c140280420560700SE +/- 1.66, N = 3SE +/- 3.02, N = 3SE +/- 6.13, N = 3524.95644.73586.831. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.5