Threadripper PRO 5965WX Memory Channels

AMD Ryzen Threadripper PRO 5965WX Linux memory benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2208077-NE-TRPRO500044.

Threadripper PRO 5965WX Memory ChannelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution8c6c4cAMD Ryzen Threadripper PRO 5965WX 24-Cores @ 3.80GHz (24 Cores / 48 Threads)ASUS Pro WS WRX80E-SAGE SE WIFI (1003 BIOS)AMD Starship/Matisse128GB1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA NV106 2GBAMD Starship/MatisseVA24312 x Intel 10G X550T + Intel Wi-Fi 6 AX200Ubuntu 22.045.19.0-051900rc7-generic (x86_64)GNOME Shell 42.2X Server 1.21.1.3 + Waylandnouveau4.3 Mesa 22.0.11.2.204GCC 11.2.0ext41920x108096GB64GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa008203Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Threadripper PRO 5965WX Memory Channelshpcg: npb: BT.Cnpb: CG.Cnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Cnamd: ATPase Simulation - 327,506 Atomsamg: pennant: sedovbigpennant: leblancbignwchem: C240 Buckyballincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionopenfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timeopenfoam: drivaerFastback, Large Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeqe: AUSURF112lulesh: svt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 4Ksvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 4Ksvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 4Ksvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 1080poidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160oidn: RTLightmap.hdr.4096x4096openvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarcompress-7zip: Compression Ratingbuild-gcc: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-mplayer: Time To Compileospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Tracerospray-studio: 1 - 1080p - 1 - Path Tracerospray-studio: 2 - 1080p - 1 - Path Tracerospray-studio: 3 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 16 - Path Tracerospray-studio: 1 - 1080p - 32 - Path Tracerospray-studio: 2 - 1080p - 16 - Path Tracerospray-studio: 2 - 1080p - 32 - Path Tracerospray-studio: 3 - 1080p - 16 - Path Tracerospray-studio: 3 - 1080p - 32 - Path Tracerastcenc: Fastgraph500: 26graph500: 26graph500: 26graph500: 26gromacs: MPI CPU - water_GMX50_barestress-ng: Memory Copyingwrf: conus 2.5kmblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyclickhouse: 100M Rows Web Analytics Dataset, First Run / Cold Cacheclickhouse: 100M Rows Web Analytics Dataset, Second Runclickhouse: 100M Rows Web Analytics Dataset, Third Run8c6c4c12.589162499.8015087.8438325.141560.3171618.7537047.6835607.3524929.100.6496065324816719.7680212.270723763.69.0714536041.635147164.471088.1425819.45158.791313.36159.411316.02331.209045.5360151.792189.506190.811104.81188.94586.83131.29466.67141.36471.58120.86388.721.081.080.539455197316635.32435.830385.973227.596263.19812.926402341314796688431344777045313622081276158206101010321207162013262216615331961938438639455.59633420140003480300001232430001643760003.4848094.0326021.066130.04524.21287.86301.89315.993.9638727617.567330.5912166.64605.5331363.0811213.5911216.237198.480.7709320747310065.0741040.288953915.322.879023293.671079070.731181.1830373.88175.211686.71176.361663.1344.759285.9254160.112200.146194.706110.19194.61644.73124.77470.21136.95472.54123.85397.231.011.010.509757200943617.89934.811375.094219.972255.57612.67939524042472267959131721691991337577983615549699410101184159703188616207324761900738023462.22263481080003540960001223040001601190003.2618228.3630933.316131.41517.75306.59314.96324.023.5629324943.595404.7911378.66566.1730214.2510403.4810930.117090.300.8246718453883366.4106141.308354062.618.044858982.553649971.711638.8251240.35219.742429.98220.842469.98397.697527.8343135.763165.358186.33397.50139.28524.9583.31346.3988.92353.8689.86323.030.880.890.448753140026637.79838.575419.634254.606286.69713.438404841394806689841341577070213688481429158586101110351205161963247916605332521937838613451.3173321128000324305000998528001308230002.4184210.74130.00522.56238.02252.63256.13OpenBenchmarking.org

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.18c6c4c3691215SE +/- 0.00191, N = 3SE +/- 0.01229, N = 3SE +/- 0.00250, N = 312.589103.963873.562931. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C8c6c4c13K26K39K52K65KSE +/- 59.44, N = 3SE +/- 70.37, N = 3SE +/- 66.33, N = 362499.8027617.5624943.591. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C8c6c4c3K6K9K12K15KSE +/- 125.06, N = 3SE +/- 78.48, N = 3SE +/- 17.52, N = 315087.847330.595404.791. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C8c6c4c8K16K24K32K40KSE +/- 18.86, N = 3SE +/- 41.39, N = 3SE +/- 9.24, N = 338325.1412166.6411378.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D8c6c4c30060090012001500SE +/- 3.35, N = 3SE +/- 5.86, N = 3SE +/- 7.03, N = 41560.31605.53566.171. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C8c6c4c15K30K45K60K75KSE +/- 63.92, N = 3SE +/- 93.08, N = 3SE +/- 149.22, N = 371618.7531363.0830214.251. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C8c6c4c8K16K24K32K40KSE +/- 10.53, N = 3SE +/- 4.70, N = 3SE +/- 2.83, N = 337047.6811213.5910403.481. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B8c6c4c8K16K24K32K40KSE +/- 39.29, N = 3SE +/- 53.26, N = 3SE +/- 83.71, N = 335607.3511216.2310930.111. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C8c6c4c5K10K15K20K25KSE +/- 22.67, N = 3SE +/- 29.58, N = 3SE +/- 46.80, N = 324929.107198.487090.301. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms8c6c4c0.18560.37120.55680.74240.928SE +/- 0.00063, N = 3SE +/- 0.00232, N = 3SE +/- 0.00160, N = 30.649600.770930.82467

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.28c6c4c140M280M420M560M700MSE +/- 656104.93, N = 3SE +/- 215404.98, N = 3SE +/- 64488.56, N = 36532481672074731001845388331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig8c6c4c1530456075SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 319.7765.0766.411. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig8c6c4c918273645SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 312.2740.2941.311. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball8c6c4c90018002700360045003763.63915.34062.61. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction8c6c4c510152025SE +/- 0.06683370, N = 15SE +/- 0.32246994, N = 3SE +/- 0.22101413, N = 39.0714536022.8790232018.044858901. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction8c6c4c20406080100SE +/- 0.39, N = 3SE +/- 0.58, N = 3SE +/- 0.56, N = 341.6493.6782.551. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: motorBike - Execution Time8c6c4c163248648064.4770.7371.711. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Mesh Time8c6c4c4008001200160020001088.141181.181638.821. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Large Mesh Size - Execution Time8c6c4c11K22K33K44K55K25819.4530373.8851240.351. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Mesh Time8c6c4c50100150200250158.79175.21219.741. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Small Mesh Size - Execution Time8c6c4c50010001500200025001313.361686.712429.981. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Mesh Time8c6c4c50100150200250159.41176.36220.841. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Execution Time8c6c4c50010001500200025001316.021663.102469.981. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF1128c6c4c90180270360450SE +/- 1.56, N = 3SE +/- 0.69, N = 3SE +/- 0.56, N = 3331.20344.75397.691. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.38c6c4c2K4K6K8K10KSE +/- 67.93, N = 3SE +/- 20.87, N = 3SE +/- 83.82, N = 39045.549285.937527.831. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

SVT-AV1

Encoder Mode: Preset 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 10 - Input: Bosphorus 4K8c6c4c4080120160200SE +/- 0.04, N = 3SE +/- 0.32, N = 3SE +/- 1.07, N = 3151.79160.11135.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 12 - Input: Bosphorus 4K8c6c4c4080120160200SE +/- 0.38, N = 3SE +/- 1.06, N = 3SE +/- 0.43, N = 3189.51200.15165.361. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 8 - Input: Bosphorus 1080p8c6c4c4080120160200SE +/- 0.17, N = 3SE +/- 0.35, N = 3SE +/- 1.47, N = 3190.81194.71186.331. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

Tuning: 7 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 4K8c6c4c20406080100SE +/- 0.62, N = 3SE +/- 0.21, N = 3SE +/- 0.67, N = 3104.81110.1997.501. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4K8c6c4c4080120160200SE +/- 0.81, N = 3SE +/- 1.08, N = 3SE +/- 0.77, N = 3188.94194.61139.281. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p8c6c4c140280420560700SE +/- 6.13, N = 3SE +/- 3.02, N = 3SE +/- 1.66, N = 3586.83644.73524.951. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 4K8c6c4c306090120150SE +/- 1.94, N = 12SE +/- 1.64, N = 14SE +/- 0.74, N = 14131.29124.7783.311. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p8c6c4c100200300400500SE +/- 6.26, N = 12SE +/- 4.72, N = 12SE +/- 3.25, N = 13466.67470.21346.391. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K8c6c4c306090120150SE +/- 0.30, N = 3SE +/- 0.85, N = 3SE +/- 0.36, N = 3141.36136.9588.921. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p8c6c4c100200300400500SE +/- 3.84, N = 3SE +/- 1.32, N = 3SE +/- 1.37, N = 3471.58472.54353.861. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 4K8c6c4c306090120150SE +/- 0.28, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 3120.86123.8589.861. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p8c6c4c90180270360450SE +/- 2.06, N = 3SE +/- 1.45, N = 3SE +/- 1.28, N = 3388.72397.23323.031. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x21608c6c4c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.081.010.88

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x21608c6c4c0.2430.4860.7290.9721.215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.081.010.89

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x40968c6c4c0.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.530.500.44

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPC8c6c4c20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3949787MIN: 8 / MAX: 2044MIN: 9 / MAX: 2123MIN: 7 / MAX: 1883

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark Scalar8c6c4c1326395265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3555753MIN: 4 / MAX: 1689MIN: 5 / MAX: 1779MIN: 4 / MAX: 1606

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.00Test: Compression Rating8c6c4c40K80K120K160K200KSE +/- 504.48, N = 3SE +/- 524.95, N = 3SE +/- 124.84, N = 31973162009431400261. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 11.2.0Time To Compile8c6c4c140280420560700SE +/- 0.79, N = 3SE +/- 0.47, N = 3SE +/- 1.13, N = 3635.32617.90637.80

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: defconfig8c6c4c918273645SE +/- 0.37, N = 3SE +/- 0.34, N = 3SE +/- 0.37, N = 335.8334.8138.58

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.18Build: allmodconfig8c6c4c90180270360450SE +/- 0.08, N = 3SE +/- 0.20, N = 3SE +/- 0.55, N = 3385.97375.09419.63

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja8c6c4c60120180240300SE +/- 0.72, N = 3SE +/- 0.07, N = 3SE +/- 0.88, N = 3227.60219.97254.61

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix Makefiles8c6c4c60120180240300SE +/- 2.27, N = 3SE +/- 2.39, N = 3SE +/- 2.73, N = 9263.20255.58286.70

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile8c6c4c3691215SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 312.9312.6813.44

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c9001800270036004500SE +/- 4.81, N = 3SE +/- 5.20, N = 3SE +/- 5.69, N = 34023395240481. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c9001800270036004500SE +/- 3.93, N = 3SE +/- 3.33, N = 3SE +/- 5.51, N = 34131404241391. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c10002000300040005000SE +/- 3.53, N = 3SE +/- 7.54, N = 3SE +/- 7.69, N = 34796472248061. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c15K30K45K60K75KSE +/- 47.36, N = 3SE +/- 17.13, N = 3SE +/- 84.76, N = 36884367959689841. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c30K60K90K120K150KSE +/- 391.66, N = 3SE +/- 225.43, N = 3SE +/- 221.08, N = 31344771317211341571. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c15K30K45K60K75KSE +/- 30.48, N = 3SE +/- 121.88, N = 3SE +/- 151.58, N = 37045369199707021. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c30K60K90K120K150KSE +/- 171.91, N = 3SE +/- 392.43, N = 3SE +/- 280.26, N = 31362201337571368841. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c20K40K60K80K100KSE +/- 101.29, N = 3SE +/- 43.68, N = 3SE +/- 125.68, N = 38127679836814291. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c30K60K90K120K150KSE +/- 112.13, N = 3SE +/- 172.12, N = 3SE +/- 123.38, N = 31582061554961585861. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c2004006008001000SE +/- 1.20, N = 3SE +/- 0.88, N = 3SE +/- 1.45, N = 3101099410111. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c2004006008001000SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.33, N = 31032101010351. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer8c6c4c30060090012001500SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 1.15, N = 31207118412051. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c3K6K9K12K15KSE +/- 22.61, N = 3SE +/- 15.34, N = 3SE +/- 25.83, N = 31620115970161961. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c7K14K21K28K35KSE +/- 34.89, N = 3SE +/- 93.71, N = 3SE +/- 38.85, N = 33262231886324791. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c4K8K12K16K20KSE +/- 22.28, N = 3SE +/- 27.87, N = 3SE +/- 4.48, N = 31661516207166051. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c7K14K21K28K35KSE +/- 60.40, N = 3SE +/- 54.34, N = 3SE +/- 43.67, N = 33319632476332521. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer8c6c4c4K8K12K16K20KSE +/- 47.01, N = 3SE +/- 0.58, N = 3SE +/- 66.58, N = 31938419007193781. (CXX) g++ options: -O3 -ldl

OSPRay Studio

Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer8c6c4c8K16K24K32K40KSE +/- 98.10, N = 3SE +/- 33.12, N = 3SE +/- 29.84, N = 33863938023386131. (CXX) g++ options: -O3 -ldl

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Fast8c6c4c100200300400500SE +/- 1.23, N = 3SE +/- 0.14, N = 3SE +/- 0.69, N = 3455.60462.22451.321. (CXX) g++ options: -O3 -flto -pthread

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 268c6c4c70M140M210M280M350M3420140003481080003211280001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 268c6c4c80M160M240M320M400M3480300003540960003243050001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 268c6c4c30M60M90M120M150M123243000122304000998528001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 268c6c4c40M80M120M160M200M1643760001601190001308230001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare8c6c4c0.78391.56782.35173.13563.9195SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 33.4843.2612.4181. (CXX) g++ options: -O3

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copying8c6c4c2K4K6K8K10KSE +/- 19.40, N = 3SE +/- 14.52, N = 3SE +/- 34.95, N = 38094.038228.364210.741. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km8c6c7K14K21K28K35K26021.0730933.321. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Classroom - Compute: CPU-Only8c6c4c306090120150SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 3130.04131.41130.00

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.2Blend File: Barbershop - Compute: CPU-Only8c6c4c110220330440550SE +/- 0.49, N = 3SE +/- 0.20, N = 3SE +/- 0.34, N = 3524.21517.75522.56

ClickHouse

100M Rows Web Analytics Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, First Run / Cold Cache8c6c4c70140210280350SE +/- 3.88, N = 3SE +/- 2.92, N = 15SE +/- 1.87, N = 10287.86306.59238.02MIN: 27.05 / MAX: 20000MIN: 23.87 / MAX: 30000MIN: 14.87 / MAX: 200001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Second Run8c6c4c70140210280350SE +/- 2.39, N = 3SE +/- 2.56, N = 15SE +/- 1.82, N = 10301.89314.96252.63MIN: 26.97 / MAX: 15000MIN: 24.41 / MAX: 30000MIN: 14.7 / MAX: 200001. ClickHouse server version 22.7.1.687 (official build).

ClickHouse

100M Rows Web Analytics Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse100M Rows Web Analytics Dataset, Third Run8c6c4c70140210280350SE +/- 0.64, N = 3SE +/- 1.23, N = 15SE +/- 1.78, N = 10315.99324.02256.13MIN: 27.08 / MAX: 30000MIN: 24.02 / MAX: 30000MIN: 14.67 / MAX: 300001. ClickHouse server version 22.7.1.687 (official build).


Phoronix Test Suite v10.8.4