Linux 5.12 Scheduler

AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102172-PTS-LINUX51261&sro&grr.

Linux 5.12 SchedulerProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionLinux 5.115.12 schedAMD Ryzen 9 5950X 16-Core @ 6.92GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.0-051100-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.94.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.145GCC 10.2.0ext43840x21605.11.0-sched (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.12 Schedulersimdjson: Kostyasimdjson: PartialTweetswebp2: Quality 95, Compression Effort 7askap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddinggromacs: water_GMX50_barenpb: BT.Cwebp2: Quality 75, Compression Effort 7tesseract: 3840 x 2160openfoam: Motorbike 30Mdav1d: Chimera 1080p 10-bitopenvkl: vklBenchmarkwarsow: 3840 x 2160build-godot: Time To Compilenpb: LU.Cv-ray: CPUdaphne: OpenMP - Points2Imageaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingbuild-gdb: Time To Compileindigobench: CPU - Bedroomindigobench: CPU - Supercarnamd: ATPase Simulation - 327,506 Atomsgraphics-magick: Resizinggraphics-magick: Rotatenpb: IS.Dstockfish: Total Timerawtherapee: Total Benchmark Timefinancebench: Bonds OpenMPnpb: SP.Bbuild-linux-kernel: Time To Compilejpegxl-decode: Allnpb: FT.Cfinancebench: Repo OpenMPm-queens: Time To Solveqmcpack: simple-H2Oparaview: Many Spheres - 3840 x 2160paraview: Many Spheres - 3840 x 2160paraview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080npb: CG.Cdaphne: OpenMP - NDT Mappingetcpak: ETC2ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingclomp: Static OMP Speedupparaview: Wavelet Volume - 3840 x 2160paraview: Wavelet Volume - 3840 x 2160askap: Hogbom Clean OpenMPnpb: MG.Cparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080dav1d: Summer Nature 4Kdaphne: OpenMP - Euclidean Clusteroidn: Memorialdav1d: Chimera 1080paskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingparaview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080n-queens: Elapsed Timenpb: EP.Cdav1d: Summer Nature 1080petcpak: DXT1Linux 5.115.12 sched0.670.90214.9431344.59785.0161.26424129.92116.335398.499097.64121.66292431.778.96228328.862150627781.1264230976672.026758.7262.4364.1398.6821.0811018921068646.034313364545.76439971.7643237886.1945.598202.6012182.0627369.52343730.81422.3506294.76862.786620.17966.037018.38882.03241.1141043.9621.24234.713264.67216.9349935.407823.400488.96240.771493.3414.57837.083252.892722.082653.882254.663896.310373.885.6221917.44911.621533.2220.710.94215.2771343.89784.7751.26624051.43116.599394.375097.49122.06293430.479.31328209.452162627567.0671601486642.936846.9362.2344.1228.7171.0810018761053647.674355400345.83739641.3046887883.4845.028202.6112122.5127208.08268230.85522.6086291.43262.756645.75866.296957.05888.36235.6141046.9921.24260.097266.26217.7099930.817416.986463.56241.181496.1414.52838.253260.382698.792649.139254.213884.718372.775.6361910.88904.931521.611OpenBenchmarking.org

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya5.12 schedLinux 5.110.15980.31960.47940.63920.799SE +/- 0.03, N = 15SE +/- 0.02, N = 120.710.671. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets5.12 schedLinux 5.110.21150.4230.63450.8461.0575SE +/- 0.02, N = 15SE +/- 0.01, N = 150.940.901. (CXX) g++ options: -O3 -pthread

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 75.12 schedLinux 5.1150100150200250SE +/- 0.64, N = 3SE +/- 0.69, N = 3215.28214.941. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding5.12 schedLinux 5.1130060090012001500SE +/- 2.63, N = 3SE +/- 1.91, N = 31343.891344.591. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding5.12 schedLinux 5.112004006008001000SE +/- 2.00, N = 3SE +/- 1.97, N = 3784.78785.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare5.12 schedLinux 5.110.28490.56980.85471.13961.4245SE +/- 0.002, N = 3SE +/- 0.002, N = 31.2661.2641. (CXX) g++ options: -O3 -pthread

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C5.12 schedLinux 5.115K10K15K20K25KSE +/- 23.23, N = 3SE +/- 23.90, N = 324051.4324129.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 75.12 schedLinux 5.11306090120150SE +/- 0.55, N = 3SE +/- 1.18, N = 3116.60116.341. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Tesseract

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterTesseract 2014-05-12Resolution: 3840 x 21605.12 schedLinux 5.1190180270360450SE +/- 3.88, N = 15SE +/- 3.68, N = 6394.38398.50

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M5.12 schedLinux 5.1120406080100SE +/- 0.14, N = 3SE +/- 0.07, N = 397.4997.641. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit5.12 schedLinux 5.11306090120150SE +/- 1.09, N = 3SE +/- 0.66, N = 3122.06121.66MIN: 86.2 / MAX: 274.09MIN: 87.02 / MAX: 270.791. (CC) gcc options: -pthread

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmark5.12 schedLinux 5.1160120180240300293292MIN: 1 / MAX: 1137MIN: 1 / MAX: 1136

Warsow

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 3840 x 21605.12 schedLinux 5.1190180270360450SE +/- 0.38, N = 3SE +/- 0.70, N = 3430.4431.7

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile5.12 schedLinux 5.1120406080100SE +/- 0.12, N = 3SE +/- 0.18, N = 379.3178.96

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C5.12 schedLinux 5.116K12K18K24K30KSE +/- 19.13, N = 3SE +/- 8.24, N = 328209.4528328.861. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU5.12 schedLinux 5.115K10K15K20K25KSE +/- 221.33, N = 3SE +/- 99.08, N = 32162621506

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2Image5.12 schedLinux 5.116K12K18K24K30KSE +/- 392.26, N = 3SE +/- 342.19, N = 327567.0727781.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding5.12 schedLinux 5.1114002800420056007000SE +/- 0.00, N = 3SE +/- 56.07, N = 36642.936672.021. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding5.12 schedLinux 5.1115003000450060007500SE +/- 79.27, N = 3SE +/- 77.23, N = 36846.936758.721. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To Compile5.12 schedLinux 5.111428425670SE +/- 0.31, N = 3SE +/- 0.23, N = 362.2362.44

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom5.12 schedLinux 5.110.93131.86262.79393.72524.6565SE +/- 0.016, N = 3SE +/- 0.017, N = 34.1224.139

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar5.12 schedLinux 5.11246810SE +/- 0.023, N = 3SE +/- 0.011, N = 38.7178.682

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms5.12 schedLinux 5.110.24320.48640.72960.97281.216SE +/- 0.00363, N = 3SE +/- 0.00393, N = 31.081001.08110

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing5.12 schedLinux 5.11400800120016002000SE +/- 6.56, N = 3SE +/- 4.37, N = 3187618921. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate5.12 schedLinux 5.112004006008001000SE +/- 5.86, N = 3SE +/- 2.91, N = 3105310681. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D5.12 schedLinux 5.11140280420560700SE +/- 0.65, N = 3SE +/- 2.89, N = 3647.67646.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time5.12 schedLinux 5.119M18M27M36M45MSE +/- 444782.82, N = 5SE +/- 412700.59, N = 343554003431336451. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time5.12 schedLinux 5.111020304050SE +/- 0.22, N = 3SE +/- 0.26, N = 345.8445.761. RawTherapee, version 5.8, command line.

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP5.12 schedLinux 5.119K18K27K36K45KSE +/- 34.33, N = 3SE +/- 31.83, N = 339641.3039971.761. (CXX) g++ options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B5.12 schedLinux 5.112K4K6K8K10KSE +/- 21.12, N = 3SE +/- 10.12, N = 37883.487886.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile5.12 schedLinux 5.111020304050SE +/- 0.33, N = 3SE +/- 0.33, N = 345.0345.60

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All5.12 schedLinux 5.114080120160200SE +/- 0.33, N = 3SE +/- 0.31, N = 3202.61202.60

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C5.12 schedLinux 5.113K6K9K12K15KSE +/- 4.72, N = 3SE +/- 6.14, N = 312122.5112182.061. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP5.12 schedLinux 5.116K12K18K24K30KSE +/- 33.25, N = 3SE +/- 288.55, N = 327208.0827369.521. (CXX) g++ options: -O3 -march=native -fopenmp

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solve5.12 schedLinux 5.11714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 330.8630.811. (CXX) g++ options: -fopenmp -O2 -march=native

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O5.12 schedLinux 5.11510152025SE +/- 0.24, N = 5SE +/- 0.10, N = 322.6122.351. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 21605.12 schedLinux 5.1113002600390052006500SE +/- 3.70, N = 3SE +/- 1.21, N = 36291.436294.77

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 21605.12 schedLinux 5.111428425670SE +/- 0.04, N = 3SE +/- 0.01, N = 362.7562.78

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 10805.12 schedLinux 5.1114002800420056007000SE +/- 14.64, N = 3SE +/- 22.52, N = 36645.766620.18

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 10805.12 schedLinux 5.111530456075SE +/- 0.15, N = 3SE +/- 0.22, N = 366.2966.03

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C5.12 schedLinux 5.1115003000450060007500SE +/- 11.23, N = 3SE +/- 60.15, N = 36957.057018.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mapping5.12 schedLinux 5.112004006008001000SE +/- 2.37, N = 3SE +/- 6.01, N = 3888.36882.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC25.12 schedLinux 5.1150100150200250SE +/- 1.05, N = 3SE +/- 3.17, N = 3235.61241.111. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mapping5.12 schedLinux 5.112004006008001000SE +/- 2.77, N = 3SE +/- 9.06, N = 31046.991043.961. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup5.12 schedLinux 5.11510152025SE +/- 0.07, N = 3SE +/- 0.09, N = 321.221.21. (CC) gcc options: -fopenmp -O3 -lm

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 21605.12 schedLinux 5.119001800270036004500SE +/- 4.11, N = 3SE +/- 28.98, N = 124260.104234.71

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 21605.12 schedLinux 5.1160120180240300SE +/- 0.26, N = 3SE +/- 1.81, N = 12266.26264.67

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP5.12 schedLinux 5.1150100150200250SE +/- 0.42, N = 3SE +/- 1.25, N = 3217.71216.931. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C5.12 schedLinux 5.112K4K6K8K10KSE +/- 6.34, N = 3SE +/- 3.64, N = 39930.819935.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10805.12 schedLinux 5.112K4K6K8K10KSE +/- 229.82, N = 12SE +/- 31.61, N = 37416.997823.40

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 10805.12 schedLinux 5.11110220330440550SE +/- 14.36, N = 12SE +/- 1.98, N = 3463.56488.96

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K5.12 schedLinux 5.1150100150200250SE +/- 0.13, N = 3SE +/- 0.34, N = 3241.18240.77MIN: 181.33 / MAX: 249.24MIN: 181.75 / MAX: 249.171. (CC) gcc options: -pthread

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Euclidean Cluster5.12 schedLinux 5.1130060090012001500SE +/- 6.52, N = 3SE +/- 3.74, N = 31496.141493.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: Memorial5.12 schedLinux 5.1148121620SE +/- 0.02, N = 3SE +/- 0.02, N = 314.5214.57

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p5.12 schedLinux 5.112004006008001000SE +/- 5.53, N = 3SE +/- 7.91, N = 3838.25837.08MIN: 588.61 / MAX: 1047.17MIN: 547.13 / MAX: 1054.471. (CC) gcc options: -pthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding5.12 schedLinux 5.117001400210028003500SE +/- 13.36, N = 3SE +/- 10.36, N = 73260.383252.891. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding5.12 schedLinux 5.116001200180024003000SE +/- 18.11, N = 3SE +/- 23.99, N = 72698.792722.081. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 21605.12 schedLinux 5.116001200180024003000SE +/- 0.80, N = 3SE +/- 0.59, N = 32649.142653.88

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 21605.12 schedLinux 5.1160120180240300SE +/- 0.08, N = 3SE +/- 0.06, N = 3254.21254.66

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10805.12 schedLinux 5.118001600240032004000SE +/- 1.80, N = 3SE +/- 1.65, N = 33884.723896.31

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10805.12 schedLinux 5.1180160240320400SE +/- 0.17, N = 3SE +/- 0.16, N = 3372.77373.88

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Time5.12 schedLinux 5.111.26812.53623.80435.07246.3405SE +/- 0.003, N = 3SE +/- 0.003, N = 35.6365.6221. (CC) gcc options: -static -fopenmp -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C5.12 schedLinux 5.11400800120016002000SE +/- 5.65, N = 3SE +/- 4.39, N = 31910.881917.441. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p5.12 schedLinux 5.112004006008001000SE +/- 0.24, N = 3SE +/- 4.97, N = 3904.93911.62MIN: 648.7 / MAX: 987.05MIN: 618.67 / MAX: 1003.61. (CC) gcc options: -pthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT15.12 schedLinux 5.1130060090012001500SE +/- 4.22, N = 3SE +/- 1.30, N = 31521.611533.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread


Phoronix Test Suite v10.8.5