Linux 5.12 Scheduler

AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102172-PTS-LINUX51261&grr.

Linux 5.12 SchedulerProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionLinux 5.115.12 schedAMD Ryzen 9 5950X 16-Core @ 6.92GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.0-051100-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.94.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.145GCC 10.2.0ext43840x21605.11.0-sched (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.12 Schedulersimdjson: Kostyasimdjson: PartialTweetswebp2: Quality 95, Compression Effort 7askap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddinggromacs: water_GMX50_barenpb: BT.Cwebp2: Quality 75, Compression Effort 7tesseract: 3840 x 2160openfoam: Motorbike 30Mdav1d: Chimera 1080p 10-bitopenvkl: vklBenchmarkwarsow: 3840 x 2160build-godot: Time To Compilenpb: LU.Cv-ray: CPUdaphne: OpenMP - Points2Imageaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingbuild-gdb: Time To Compileindigobench: CPU - Bedroomindigobench: CPU - Supercarnamd: ATPase Simulation - 327,506 Atomsgraphics-magick: Resizinggraphics-magick: Rotatenpb: IS.Dstockfish: Total Timerawtherapee: Total Benchmark Timefinancebench: Bonds OpenMPnpb: SP.Bbuild-linux-kernel: Time To Compilejpegxl-decode: Allnpb: FT.Cfinancebench: Repo OpenMPm-queens: Time To Solveqmcpack: simple-H2Oparaview: Many Spheres - 3840 x 2160paraview: Many Spheres - 3840 x 2160paraview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080npb: CG.Cdaphne: OpenMP - NDT Mappingetcpak: ETC2ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingclomp: Static OMP Speedupparaview: Wavelet Volume - 3840 x 2160paraview: Wavelet Volume - 3840 x 2160askap: Hogbom Clean OpenMPnpb: MG.Cparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080dav1d: Summer Nature 4Kdaphne: OpenMP - Euclidean Clusteroidn: Memorialdav1d: Chimera 1080paskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingparaview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080n-queens: Elapsed Timenpb: EP.Cdav1d: Summer Nature 1080petcpak: DXT1Linux 5.115.12 sched0.670.90214.9431344.59785.0161.26424129.92116.335398.499097.64121.66292431.778.96228328.862150627781.1264230976672.026758.7262.4364.1398.6821.0811018921068646.034313364545.76439971.7643237886.1945.598202.6012182.0627369.52343730.81422.3506294.76862.786620.17966.037018.38882.03241.1141043.9621.24234.713264.67216.9349935.407823.400488.96240.771493.3414.57837.083252.892722.082653.882254.663896.310373.885.6221917.44911.621533.2220.710.94215.2771343.89784.7751.26624051.43116.599394.375097.49122.06293430.479.31328209.452162627567.0671601486642.936846.9362.2344.1228.7171.0810018761053647.674355400345.83739641.3046887883.4845.028202.6112122.5127208.08268230.85522.6086291.43262.756645.75866.296957.05888.36235.6141046.9921.24260.097266.26217.7099930.817416.986463.56241.181496.1414.52838.253260.382698.792649.139254.213884.718372.775.6361910.88904.931521.611OpenBenchmarking.org

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaLinux 5.115.12 sched0.15980.31960.47940.63920.799SE +/- 0.02, N = 12SE +/- 0.03, N = 150.670.711. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsLinux 5.115.12 sched0.21150.4230.63450.8461.0575SE +/- 0.01, N = 15SE +/- 0.02, N = 150.900.941. (CXX) g++ options: -O3 -pthread

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7Linux 5.115.12 sched50100150200250SE +/- 0.69, N = 3SE +/- 0.64, N = 3214.94215.281. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingLinux 5.115.12 sched30060090012001500SE +/- 1.91, N = 3SE +/- 2.63, N = 31344.591343.891. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingLinux 5.115.12 sched2004006008001000SE +/- 1.97, N = 3SE +/- 2.00, N = 3785.02784.781. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bareLinux 5.115.12 sched0.28490.56980.85471.13961.4245SE +/- 0.002, N = 3SE +/- 0.002, N = 31.2641.2661. (CXX) g++ options: -O3 -pthread

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CLinux 5.115.12 sched5K10K15K20K25KSE +/- 23.90, N = 3SE +/- 23.23, N = 324129.9224051.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7Linux 5.115.12 sched306090120150SE +/- 1.18, N = 3SE +/- 0.55, N = 3116.34116.601. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Tesseract

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterTesseract 2014-05-12Resolution: 3840 x 2160Linux 5.115.12 sched90180270360450SE +/- 3.68, N = 6SE +/- 3.88, N = 15398.50394.38

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MLinux 5.115.12 sched20406080100SE +/- 0.07, N = 3SE +/- 0.14, N = 397.6497.491. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitLinux 5.115.12 sched306090120150SE +/- 0.66, N = 3SE +/- 1.09, N = 3121.66122.06MIN: 87.02 / MAX: 270.79MIN: 86.2 / MAX: 274.091. (CC) gcc options: -pthread

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkLinux 5.115.12 sched60120180240300292293MIN: 1 / MAX: 1136MIN: 1 / MAX: 1137

Warsow

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 3840 x 2160Linux 5.115.12 sched90180270360450SE +/- 0.70, N = 3SE +/- 0.38, N = 3431.7430.4

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileLinux 5.115.12 sched20406080100SE +/- 0.18, N = 3SE +/- 0.12, N = 378.9679.31

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CLinux 5.115.12 sched6K12K18K24K30KSE +/- 8.24, N = 3SE +/- 19.13, N = 328328.8628209.451. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPULinux 5.115.12 sched5K10K15K20K25KSE +/- 99.08, N = 3SE +/- 221.33, N = 32150621626

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2ImageLinux 5.115.12 sched6K12K18K24K30KSE +/- 342.19, N = 3SE +/- 392.26, N = 327781.1327567.071. (CXX) g++ options: -O3 -std=c++11 -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingLinux 5.115.12 sched14002800420056007000SE +/- 56.07, N = 3SE +/- 0.00, N = 36672.026642.931. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingLinux 5.115.12 sched15003000450060007500SE +/- 77.23, N = 3SE +/- 79.27, N = 36758.726846.931. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileLinux 5.115.12 sched1428425670SE +/- 0.23, N = 3SE +/- 0.31, N = 362.4462.23

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomLinux 5.115.12 sched0.93131.86262.79393.72524.6565SE +/- 0.017, N = 3SE +/- 0.016, N = 34.1394.122

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarLinux 5.115.12 sched246810SE +/- 0.011, N = 3SE +/- 0.023, N = 38.6828.717

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsLinux 5.115.12 sched0.24320.48640.72960.97281.216SE +/- 0.00393, N = 3SE +/- 0.00363, N = 31.081101.08100

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingLinux 5.115.12 sched400800120016002000SE +/- 4.37, N = 3SE +/- 6.56, N = 3189218761. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateLinux 5.115.12 sched2004006008001000SE +/- 2.91, N = 3SE +/- 5.86, N = 3106810531. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DLinux 5.115.12 sched140280420560700SE +/- 2.89, N = 3SE +/- 0.65, N = 3646.03647.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeLinux 5.115.12 sched9M18M27M36M45MSE +/- 412700.59, N = 3SE +/- 444782.82, N = 543133645435540031. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark TimeLinux 5.115.12 sched1020304050SE +/- 0.26, N = 3SE +/- 0.22, N = 345.7645.841. RawTherapee, version 5.8, command line.

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPLinux 5.115.12 sched9K18K27K36K45KSE +/- 31.83, N = 3SE +/- 34.33, N = 339971.7639641.301. (CXX) g++ options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BLinux 5.115.12 sched2K4K6K8K10KSE +/- 10.12, N = 3SE +/- 21.12, N = 37886.197883.481. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileLinux 5.115.12 sched1020304050SE +/- 0.33, N = 3SE +/- 0.33, N = 345.6045.03

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: AllLinux 5.115.12 sched4080120160200SE +/- 0.31, N = 3SE +/- 0.33, N = 3202.60202.61

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CLinux 5.115.12 sched3K6K9K12K15KSE +/- 6.14, N = 3SE +/- 4.72, N = 312182.0612122.511. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPLinux 5.115.12 sched6K12K18K24K30KSE +/- 288.55, N = 3SE +/- 33.25, N = 327369.5227208.081. (CXX) g++ options: -O3 -march=native -fopenmp

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveLinux 5.115.12 sched714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 330.8130.861. (CXX) g++ options: -fopenmp -O2 -march=native

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OLinux 5.115.12 sched510152025SE +/- 0.10, N = 3SE +/- 0.24, N = 522.3522.611. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160Linux 5.115.12 sched13002600390052006500SE +/- 1.21, N = 3SE +/- 3.70, N = 36294.776291.43

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160Linux 5.115.12 sched1428425670SE +/- 0.01, N = 3SE +/- 0.04, N = 362.7862.75

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080Linux 5.115.12 sched14002800420056007000SE +/- 22.52, N = 3SE +/- 14.64, N = 36620.186645.76

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080Linux 5.115.12 sched1530456075SE +/- 0.22, N = 3SE +/- 0.15, N = 366.0366.29

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CLinux 5.115.12 sched15003000450060007500SE +/- 60.15, N = 3SE +/- 11.23, N = 37018.386957.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT MappingLinux 5.115.12 sched2004006008001000SE +/- 6.01, N = 3SE +/- 2.37, N = 3882.03888.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Linux 5.115.12 sched50100150200250SE +/- 3.17, N = 3SE +/- 1.05, N = 3241.11235.611. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingLinux 5.115.12 sched2004006008001000SE +/- 9.06, N = 3SE +/- 2.77, N = 31043.961046.991. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.115.12 sched510152025SE +/- 0.09, N = 3SE +/- 0.07, N = 321.221.21. (CC) gcc options: -fopenmp -O3 -lm

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160Linux 5.115.12 sched9001800270036004500SE +/- 28.98, N = 12SE +/- 4.11, N = 34234.714260.10

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160Linux 5.115.12 sched60120180240300SE +/- 1.81, N = 12SE +/- 0.26, N = 3264.67266.26

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPLinux 5.115.12 sched50100150200250SE +/- 1.25, N = 3SE +/- 0.42, N = 3216.93217.711. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CLinux 5.115.12 sched2K4K6K8K10KSE +/- 3.64, N = 3SE +/- 6.34, N = 39935.409930.811. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080Linux 5.115.12 sched2K4K6K8K10KSE +/- 31.61, N = 3SE +/- 229.82, N = 127823.407416.99

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080Linux 5.115.12 sched110220330440550SE +/- 1.98, N = 3SE +/- 14.36, N = 12488.96463.56

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KLinux 5.115.12 sched50100150200250SE +/- 0.34, N = 3SE +/- 0.13, N = 3240.77241.18MIN: 181.75 / MAX: 249.17MIN: 181.33 / MAX: 249.241. (CC) gcc options: -pthread

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Euclidean ClusterLinux 5.115.12 sched30060090012001500SE +/- 3.74, N = 3SE +/- 6.52, N = 31493.341496.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialLinux 5.115.12 sched48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 314.5714.52

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pLinux 5.115.12 sched2004006008001000SE +/- 7.91, N = 3SE +/- 5.53, N = 3837.08838.25MIN: 547.13 / MAX: 1054.47MIN: 588.61 / MAX: 1047.171. (CC) gcc options: -pthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingLinux 5.115.12 sched7001400210028003500SE +/- 10.36, N = 7SE +/- 13.36, N = 33252.893260.381. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingLinux 5.115.12 sched6001200180024003000SE +/- 23.99, N = 7SE +/- 18.11, N = 32722.082698.791. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160Linux 5.115.12 sched6001200180024003000SE +/- 0.59, N = 3SE +/- 0.80, N = 32653.882649.14

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160Linux 5.115.12 sched60120180240300SE +/- 0.06, N = 3SE +/- 0.08, N = 3254.66254.21

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080Linux 5.115.12 sched8001600240032004000SE +/- 1.65, N = 3SE +/- 1.80, N = 33896.313884.72

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080Linux 5.115.12 sched80160240320400SE +/- 0.16, N = 3SE +/- 0.17, N = 3373.88372.77

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeLinux 5.115.12 sched1.26812.53623.80435.07246.3405SE +/- 0.003, N = 3SE +/- 0.003, N = 35.6225.6361. (CC) gcc options: -static -fopenmp -O3 -march=native

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.115.12 sched400800120016002000SE +/- 4.39, N = 3SE +/- 5.65, N = 31917.441910.881. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pLinux 5.115.12 sched2004006008001000SE +/- 4.97, N = 3SE +/- 0.24, N = 3911.62904.93MIN: 618.67 / MAX: 1003.6MIN: 648.7 / MAX: 987.051. (CC) gcc options: -pthread

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Linux 5.115.12 sched30060090012001500SE +/- 1.30, N = 3SE +/- 4.22, N = 31533.221521.611. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread


Phoronix Test Suite v10.8.4