Linux 5.12 Scheduler

AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102172-PTS-LINUX51261&sro&grs.

Linux 5.12 SchedulerProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionLinux 5.115.12 schedAMD Ryzen 9 5950X 16-Core @ 6.92GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.0-051100-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.94.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.145GCC 10.2.0ext43840x21605.11.0-sched (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.12 Scheduleretcpak: ETC2graphics-magick: Rotateaskap: tConvolve MPI - Degriddingbuild-linux-kernel: Time To Compileqmcpack: simple-H2Otesseract: 3840 x 2160stockfish: Total Timenpb: CG.Caskap: tConvolve OpenMP - Griddinggraphics-magick: Resizingfinancebench: Bonds OpenMPdaphne: OpenMP - Points2Imageetcpak: DXT1dav1d: Summer Nature 1080pdaphne: OpenMP - NDT Mappingparaview: Wavelet Volume - 3840 x 2160paraview: Wavelet Volume - 3840 x 2160financebench: Repo OpenMPv-ray: CPUnpb: FT.Cbuild-godot: Time To Compileaskap: tConvolve MPI - Griddingnpb: LU.Cindigobench: CPU - Bedroomindigobench: CPU - Supercarparaview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080askap: Hogbom Clean OpenMPoidn: Memorialnpb: EP.Copenvkl: vklBenchmarkdav1d: Chimera 1080p 10-bitnpb: BT.Cbuild-gdb: Time To Compilewarsow: 3840 x 2160paraview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingnpb: IS.Dn-queens: Elapsed Timeaskap: tConvolve OpenMP - Degriddingwebp2: Quality 75, Compression Effort 7daphne: OpenMP - Euclidean Clusterparaview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 3840 x 2160dav1d: Summer Nature 4Krawtherapee: Total Benchmark Timegromacs: water_GMX50_barewebp2: Quality 95, Compression Effort 7openfoam: Motorbike 30Mdav1d: Chimera 1080pm-queens: Time To Solveparaview: Many Spheres - 3840 x 2160askap: tConvolve MT - Degriddingparaview: Many Spheres - 3840 x 2160npb: MG.Cnpb: SP.Baskap: tConvolve MT - Griddingnamd: ATPase Simulation - 327,506 Atomsjpegxl-decode: Allclomp: Static OMP Speedupsimdjson: PartialTweetssimdjson: Kostyaparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080Linux 5.115.12 sched241.11410686758.7245.59822.350398.4990431336457018.382722.08189239971.76432327781.1264230971533.222911.62882.03264.674234.71327369.5234372150612182.0678.9626672.0228328.864.1398.68266.036620.179216.93414.571917.44292121.6624129.9262.436431.73896.310373.881043.96646.035.6223252.89116.3351493.342653.882254.66240.7745.7641.264214.94397.64837.0830.8146294.7681344.5962.789935.407886.19785.0161.08110202.6021.20.900.677823.400488.96235.61410536846.9345.02822.608394.3750435540036957.052698.79187639641.30468827567.0671601481521.611904.93888.36266.264260.09727208.0826822162612122.5179.3136642.9328209.454.1228.71766.296645.758217.70914.521910.88293122.0624051.4362.234430.43884.718372.771046.99647.675.6363260.38116.5991496.142649.139254.21241.1845.8371.266215.27797.49838.2530.8556291.4321343.8962.759930.817883.48784.7751.08100202.6121.20.940.717416.986463.56OpenBenchmarking.org

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC25.12 schedLinux 5.1150100150200250SE +/- 1.05, N = 3SE +/- 3.17, N = 3235.61241.111. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate5.12 schedLinux 5.112004006008001000SE +/- 5.86, N = 3SE +/- 2.91, N = 3105310681. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding5.12 schedLinux 5.1115003000450060007500SE +/- 79.27, N = 3SE +/- 77.23, N = 36846.936758.721. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile5.12 schedLinux 5.111020304050SE +/- 0.33, N = 3SE +/- 0.33, N = 345.0345.60

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O5.12 schedLinux 5.11510152025SE +/- 0.24, N = 5SE +/- 0.10, N = 322.6122.351. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

Tesseract

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterTesseract 2014-05-12Resolution: 3840 x 21605.12 schedLinux 5.1190180270360450SE +/- 3.88, N = 15SE +/- 3.68, N = 6394.38398.50

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time5.12 schedLinux 5.119M18M27M36M45MSE +/- 444782.82, N = 5SE +/- 412700.59, N = 343554003431336451. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C5.12 schedLinux 5.1115003000450060007500SE +/- 11.23, N = 3SE +/- 60.15, N = 36957.057018.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding5.12 schedLinux 5.116001200180024003000SE +/- 18.11, N = 3SE +/- 23.99, N = 72698.792722.081. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing5.12 schedLinux 5.11400800120016002000SE +/- 6.56, N = 3SE +/- 4.37, N = 3187618921. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP5.12 schedLinux 5.119K18K27K36K45KSE +/- 34.33, N = 3SE +/- 31.83, N = 339641.3039971.761. (CXX) g++ options: -O3 -march=native -fopenmp

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2Image5.12 schedLinux 5.116K12K18K24K30KSE +/- 392.26, N = 3SE +/- 342.19, N = 327567.0727781.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT15.12 schedLinux 5.1130060090012001500SE +/- 4.22, N = 3SE +/- 1.30, N = 31521.611533.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p5.12 schedLinux 5.112004006008001000SE +/- 0.24, N = 3SE +/- 4.97, N = 3904.93911.62MIN: 648.7 / MAX: 987.05MIN: 618.67 / MAX: 1003.61. (CC) gcc options: -pthread

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mapping5.12 schedLinux 5.112004006008001000SE +/- 2.37, N = 3SE +/- 6.01, N = 3888.36882.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 21605.12 schedLinux 5.1160120180240300SE +/- 0.26, N = 3SE +/- 1.81, N = 12266.26264.67

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 21605.12 schedLinux 5.119001800270036004500SE +/- 4.11, N = 3SE +/- 28.98, N = 124260.104234.71

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP5.12 schedLinux 5.116K12K18K24K30KSE +/- 33.25, N = 3SE +/- 288.55, N = 327208.0827369.521. (CXX) g++ options: -O3 -march=native -fopenmp

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU5.12 schedLinux 5.115K10K15K20K25KSE +/- 221.33, N = 3SE +/- 99.08, N = 32162621506

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C5.12 schedLinux 5.113K6K9K12K15KSE +/- 4.72, N = 3SE +/- 6.14, N = 312122.5112182.061. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile5.12 schedLinux 5.1120406080100SE +/- 0.12, N = 3SE +/- 0.18, N = 379.3178.96

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding5.12 schedLinux 5.1114002800420056007000SE +/- 0.00, N = 3SE +/- 56.07, N = 36642.936672.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C5.12 schedLinux 5.116K12K18K24K30KSE +/- 19.13, N = 3SE +/- 8.24, N = 328209.4528328.861. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom5.12 schedLinux 5.110.93131.86262.79393.72524.6565SE +/- 0.016, N = 3SE +/- 0.017, N = 34.1224.139

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar5.12 schedLinux 5.11246810SE +/- 0.023, N = 3SE +/- 0.011, N = 38.7178.682

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 10805.12 schedLinux 5.111530456075SE +/- 0.15, N = 3SE +/- 0.22, N = 366.2966.03

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 10805.12 schedLinux 5.1114002800420056007000SE +/- 14.64, N = 3SE +/- 22.52, N = 36645.766620.18

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP5.12 schedLinux 5.1150100150200250SE +/- 0.42, N = 3SE +/- 1.25, N = 3217.71216.931. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: Memorial5.12 schedLinux 5.1148121620SE +/- 0.02, N = 3SE +/- 0.02, N = 314.5214.57

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C5.12 schedLinux 5.11400800120016002000SE +/- 5.65, N = 3SE +/- 4.39, N = 31910.881917.441. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmark5.12 schedLinux 5.1160120180240300293292MIN: 1 / MAX: 1137MIN: 1 / MAX: 1136

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit5.12 schedLinux 5.11306090120150SE +/- 1.09, N = 3SE +/- 0.66, N = 3122.06121.66MIN: 86.2 / MAX: 274.09MIN: 87.02 / MAX: 270.791. (CC) gcc options: -pthread

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C5.12 schedLinux 5.115K10K15K20K25KSE +/- 23.23, N = 3SE +/- 23.90, N = 324051.4324129.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To Compile5.12 schedLinux 5.111428425670SE +/- 0.31, N = 3SE +/- 0.23, N = 362.2362.44

Warsow

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 3840 x 21605.12 schedLinux 5.1190180270360450SE +/- 0.38, N = 3SE +/- 0.70, N = 3430.4431.7

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10805.12 schedLinux 5.118001600240032004000SE +/- 1.80, N = 3SE +/- 1.65, N = 33884.723896.31

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10805.12 schedLinux 5.1180160240320400SE +/- 0.17, N = 3SE +/- 0.16, N = 3372.77373.88

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mapping5.12 schedLinux 5.112004006008001000SE +/- 2.77, N = 3SE +/- 9.06, N = 31046.991043.961. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D5.12 schedLinux 5.11140280420560700SE +/- 0.65, N = 3SE +/- 2.89, N = 3647.67646.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Time5.12 schedLinux 5.111.26812.53623.80435.07246.3405SE +/- 0.003, N = 3SE +/- 0.003, N = 35.6365.6221. (CC) gcc options: -static -fopenmp -O3 -march=native

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding5.12 schedLinux 5.117001400210028003500SE +/- 13.36, N = 3SE +/- 10.36, N = 73260.383252.891. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 75.12 schedLinux 5.11306090120150SE +/- 0.55, N = 3SE +/- 1.18, N = 3116.60116.341. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Euclidean Cluster5.12 schedLinux 5.1130060090012001500SE +/- 6.52, N = 3SE +/- 3.74, N = 31496.141493.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 21605.12 schedLinux 5.116001200180024003000SE +/- 0.80, N = 3SE +/- 0.59, N = 32649.142653.88

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 21605.12 schedLinux 5.1160120180240300SE +/- 0.08, N = 3SE +/- 0.06, N = 3254.21254.66

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K5.12 schedLinux 5.1150100150200250SE +/- 0.13, N = 3SE +/- 0.34, N = 3241.18240.77MIN: 181.33 / MAX: 249.24MIN: 181.75 / MAX: 249.171. (CC) gcc options: -pthread

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time5.12 schedLinux 5.111020304050SE +/- 0.22, N = 3SE +/- 0.26, N = 345.8445.761. RawTherapee, version 5.8, command line.

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare5.12 schedLinux 5.110.28490.56980.85471.13961.4245SE +/- 0.002, N = 3SE +/- 0.002, N = 31.2661.2641. (CXX) g++ options: -O3 -pthread

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 75.12 schedLinux 5.1150100150200250SE +/- 0.64, N = 3SE +/- 0.69, N = 3215.28214.941. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M5.12 schedLinux 5.1120406080100SE +/- 0.14, N = 3SE +/- 0.07, N = 397.4997.641. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p5.12 schedLinux 5.112004006008001000SE +/- 5.53, N = 3SE +/- 7.91, N = 3838.25837.08MIN: 588.61 / MAX: 1047.17MIN: 547.13 / MAX: 1054.471. (CC) gcc options: -pthread

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve5.12 schedLinux 5.11714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 330.8630.811. (CXX) g++ options: -fopenmp -O2 -march=native

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 21605.12 schedLinux 5.1113002600390052006500SE +/- 3.70, N = 3SE +/- 1.21, N = 36291.436294.77

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding5.12 schedLinux 5.1130060090012001500SE +/- 2.63, N = 3SE +/- 1.91, N = 31343.891344.591. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 21605.12 schedLinux 5.111428425670SE +/- 0.04, N = 3SE +/- 0.01, N = 362.7562.78

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C5.12 schedLinux 5.112K4K6K8K10KSE +/- 6.34, N = 3SE +/- 3.64, N = 39930.819935.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B5.12 schedLinux 5.112K4K6K8K10KSE +/- 21.12, N = 3SE +/- 10.12, N = 37883.487886.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding5.12 schedLinux 5.112004006008001000SE +/- 2.00, N = 3SE +/- 1.97, N = 3784.78785.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms5.12 schedLinux 5.110.24320.48640.72960.97281.216SE +/- 0.00363, N = 3SE +/- 0.00393, N = 31.081001.08110

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All5.12 schedLinux 5.114080120160200SE +/- 0.33, N = 3SE +/- 0.31, N = 3202.61202.60

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup5.12 schedLinux 5.11510152025SE +/- 0.07, N = 3SE +/- 0.09, N = 321.221.21. (CC) gcc options: -fopenmp -O3 -lm

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets5.12 schedLinux 5.110.21150.4230.63450.8461.0575SE +/- 0.02, N = 15SE +/- 0.01, N = 150.940.901. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya5.12 schedLinux 5.110.15980.31960.47940.63920.799SE +/- 0.03, N = 15SE +/- 0.02, N = 120.710.671. (CXX) g++ options: -O3 -pthread

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10805.12 schedLinux 5.112K4K6K8K10KSE +/- 229.82, N = 12SE +/- 31.61, N = 37416.997823.40

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10805.12 schedLinux 5.11110220330440550SE +/- 14.36, N = 12SE +/- 1.98, N = 3463.56488.96


Phoronix Test Suite v10.8.5