MGLRU Kernel Tests

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2201118-NE-MGLRUKERN50
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
Timed Code Compilation 4 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 13 Tests
Creator Workloads 6 Tests
Fortran Tests 5 Tests
Game Development 2 Tests
Go Language Tests 2 Tests
HPC - High Performance Computing 11 Tests
Common Kernel Benchmarks 2 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Linear Algebra 2 Tests
Machine Learning 2 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 3 Tests
Multi-Core 16 Tests
NVIDIA GPU Compute 3 Tests
Intel oneAPI 3 Tests
OpenMPI Tests 7 Tests
Programmer / Developer System Benchmarks 6 Tests
Python Tests 5 Tests
Renderers 2 Tests
Scientific Computing 7 Tests
Server 3 Tests
Server CPU Tests 8 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
MGLRU Enabled
January 09 2022
  9 Hours, 21 Minutes
MGLRU Disabled
January 10 2022
  8 Hours, 7 Minutes
Invert Hiding All Results Option
  8 Hours, 44 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


MGLRU Kernel Tests - Phoronix Test Suite

MGLRU Kernel Tests

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2201118-NE-MGLRUKERN50&grw&sor.

MGLRU Kernel TestsProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionMGLRU EnabledMGLRU Disabled2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse128GB280GB INTEL SSDPE21D280GAASPEEDVE2282 x Intel 10G X550TUbuntu 21.105.16.0-rc8-mglru-pts (x86_64)GNOME Shell 40.5X Server1.1.182GCC 11.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details- MGLRU Enabled: OpenJDK Runtime Environment (build 11.0.12+7-Ubuntu-0ubuntu3)- MGLRU Disabled: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.10)Python Details- Python 3.9.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

MGLRU Kernel Testsjava-gradle-perf: Reactorxmrig: Monero - 1Mxmrig: Wownero - 1Mplaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUplaidml: No - Inference - ResNet 50 - CPUonnx: yolov4 - CPUonnx: fcn-resnet101-11 - CPUonnx: shufflenet-v2-10 - CPUonnx: super-resolution-10 - CPUnpb: EP.Dnpb: MG.Crodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusternamd: ATPase Simulation - 327,506 Atomsmt-dgemm: Sustained Floating-Point Rateamg: nwchem: C240 Buckyballmocassin: Dust 2D tau100.0incompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionqe: AUSURF112stockfish: Total Timecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingbuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-linux-kernel: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Kbuild-godot: Time To Compileembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownopenvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUospray: San Miguel - SciVisospray: San Miguel - Path Tracerbuild-mesa: Time To Compileliquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57nginx: 500nginx: 1000apache: 500apache: 1000pgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 500 - Read Onlypgbench: 100 - 500 - Read Only - Average LatencyMGLRU EnabledMGLRU Disabled374.74940029.953546.028.0924.194.49212180555359238589.0374668.5233.040104.60146.1108.9699.7290.2715828.60169812491416672161.2230463.57981413.3743223330.62249652058409975594185110.353196.91919.82820.453157.8444.47652.65557.65166.204959.859417511810.365.3583.336.5221.1445100733333551176666789324.7491024.3176184.7394120.0019351500.12919222850.260380.44140078.353670.326.2823.534.40230177544767118547.9174790.1233.295105.13747.0779.2889.8030.2725728.82657512446050002154.7230463.23082513.3808743330.96250125465397755594384110.285196.72919.80420.618159.2804.53052.28858.11266.228760.101117611810.285.2583.336.6521.1305093333333552620000089985.5191601.6580632.5987890.8620004450.12519366630.258OpenBenchmarking.org

Java Gradle Build

Gradle Build: Reactor

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: ReactorMGLRU EnabledMGLRU Disabled80160240320400SE +/- 5.21, N = 9SE +/- 5.30, N = 3374.75380.44

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MMGLRU DisabledMGLRU Enabled9K18K27K36K45KSE +/- 67.91, N = 3SE +/- 178.74, N = 340078.340029.91. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MMGLRU DisabledMGLRU Enabled11K22K33K44K55KSE +/- 176.64, N = 3SE +/- 192.97, N = 353670.353546.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUMGLRU EnabledMGLRU Disabled714212835SE +/- 0.29, N = 15SE +/- 0.25, N = 1528.0926.28

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUMGLRU EnabledMGLRU Disabled612182430SE +/- 0.23, N = 3SE +/- 0.23, N = 1524.1923.53

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUMGLRU EnabledMGLRU Disabled1.01032.02063.03094.04125.0515SE +/- 0.04, N = 3SE +/- 0.02, N = 34.494.40

ONNX Runtime

Model: yolov4 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: yolov4 - Device: CPUMGLRU DisabledMGLRU Enabled50100150200250SE +/- 0.87, N = 3SE +/- 1.95, N = 122302121. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: fcn-resnet101-11 - Device: CPUMGLRU EnabledMGLRU Disabled4080120160200SE +/- 4.29, N = 12SE +/- 1.80, N = 31801771. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPUMGLRU EnabledMGLRU Disabled12002400360048006000SE +/- 54.18, N = 3SE +/- 71.82, N = 3555354471. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: super-resolution-10 - Device: CPUMGLRU DisabledMGLRU Enabled14002800420056007000SE +/- 5.04, N = 3SE +/- 176.40, N = 12671159231. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DMGLRU EnabledMGLRU Disabled2K4K6K8K10KSE +/- 4.21, N = 3SE +/- 47.65, N = 38589.038547.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CMGLRU DisabledMGLRU Enabled16K32K48K64K80KSE +/- 406.38, N = 3SE +/- 370.74, N = 374790.1274668.521. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDMGLRU EnabledMGLRU Disabled816243240SE +/- 0.15, N = 3SE +/- 0.17, N = 333.0433.301. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DMGLRU EnabledMGLRU Disabled20406080100SE +/- 0.54, N = 3SE +/- 0.89, N = 3104.60105.141. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteMGLRU EnabledMGLRU Disabled1122334455SE +/- 0.32, N = 13SE +/- 0.20, N = 346.1147.081. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverMGLRU EnabledMGLRU Disabled3691215SE +/- 0.083, N = 6SE +/- 0.124, N = 128.9699.2881. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterMGLRU EnabledMGLRU Disabled3691215SE +/- 0.134, N = 15SE +/- 0.201, N = 149.7299.8031. (CXX) g++ options: -O2 -lOpenCL

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsMGLRU EnabledMGLRU Disabled0.06130.12260.18390.24520.3065SE +/- 0.00314, N = 3SE +/- 0.00229, N = 30.271580.27257

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateMGLRU DisabledMGLRU Enabled714212835SE +/- 0.17, N = 3SE +/- 0.17, N = 328.8328.601. (CC) gcc options: -O3 -march=native -fopenmp

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2MGLRU EnabledMGLRU Disabled300M600M900M1200M1500MSE +/- 738959.93, N = 3SE +/- 2472880.37, N = 3124914166712446050001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballMGLRU DisabledMGLRU Enabled50010001500200025002154.72161.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0MGLRU EnabledMGLRU Disabled50100150200250SE +/- 0.67, N = 3SE +/- 0.00, N = 32302301. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dMGLRU DisabledMGLRU Enabled100200300400500SE +/- 1.49, N = 3SE +/- 0.82, N = 3463.23463.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionMGLRU EnabledMGLRU Disabled3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 313.3713.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112MGLRU EnabledMGLRU Disabled70140210280350SE +/- 0.20, N = 3SE +/- 0.36, N = 3330.62330.961. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeMGLRU DisabledMGLRU Enabled50M100M150M200M250MSE +/- 2532421.76, N = 6SE +/- 2261974.30, N = 32501254652496520581. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression RatingMGLRU EnabledMGLRU Disabled90K180K270K360K450KSE +/- 5744.62, N = 3SE +/- 1852.98, N = 34099753977551. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression RatingMGLRU DisabledMGLRU Enabled130K260K390K520K650KSE +/- 7344.83, N = 3SE +/- 6316.43, N = 35943845941851. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: NinjaMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.23, N = 3SE +/- 0.14, N = 3110.29110.35

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix MakefilesMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.47, N = 3SE +/- 0.16, N = 3196.73196.92

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.13, N = 14SE +/- 0.14, N = 1319.8019.83

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: defconfigMGLRU EnabledMGLRU Disabled510152025SE +/- 0.07, N = 3SE +/- 0.02, N = 320.4520.62

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: allmodconfigMGLRU EnabledMGLRU Disabled4080120160200SE +/- 0.55, N = 3SE +/- 0.79, N = 3157.84159.28

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KMGLRU DisabledMGLRU Enabled1.01932.03863.05794.07725.0965SE +/- 0.005, N = 3SE +/- 0.008, N = 34.5304.4761. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KMGLRU EnabledMGLRU Disabled1224364860SE +/- 0.18, N = 3SE +/- 0.18, N = 352.6652.291. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileMGLRU EnabledMGLRU Disabled1326395265SE +/- 0.08, N = 3SE +/- 0.12, N = 357.6558.11

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: CrownMGLRU DisabledMGLRU Enabled1530456075SE +/- 0.24, N = 3SE +/- 0.15, N = 366.2366.20MIN: 61.28 / MAX: 74.94MIN: 61.78 / MAX: 73.44

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownMGLRU DisabledMGLRU Enabled1326395265SE +/- 0.18, N = 3SE +/- 0.42, N = 360.1059.86MIN: 56.51 / MAX: 68.5MIN: 55.99 / MAX: 67.12

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPCMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.00, N = 3SE +/- 0.67, N = 3176175MIN: 16 / MAX: 2455MIN: 14 / MAX: 2362

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ScalarMGLRU DisabledMGLRU Enabled306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 3118118MIN: 11 / MAX: 2528MIN: 11 / MAX: 2529

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUMGLRU EnabledMGLRU Disabled3691215SE +/- 0.07, N = 3SE +/- 0.07, N = 310.3610.28MIN: 9.71 / MAX: 14.1MIN: 9.66 / MAX: 14.1

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUMGLRU EnabledMGLRU Disabled1.20382.40763.61144.81526.019SE +/- 0.09, N = 15SE +/- 0.08, N = 155.355.25MIN: 1.85 / MAX: 7.13MIN: 1.73 / MAX: 7.07

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 383.3383.33MIN: 47.62 / MAX: 100MIN: 35.71 / MAX: 100

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerMGLRU DisabledMGLRU Enabled246810SE +/- 0.08, N = 3SE +/- 0.08, N = 36.656.52MIN: 5.46 / MAX: 7.14MIN: 5.41 / MAX: 7.09

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 321.1321.14

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57MGLRU EnabledMGLRU Disabled1100M2200M3300M4400M5500MSE +/- 11970148.05, N = 3SE +/- 7872808.34, N = 3510073333350933333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57MGLRU DisabledMGLRU Enabled1200M2400M3600M4800M6000MSE +/- 16977730.51, N = 3SE +/- 18720428.53, N = 3552620000055117666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 202.70, N = 3SE +/- 134.98, N = 389985.5189324.741. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 270.06, N = 3SE +/- 265.34, N = 391601.6591024.311. (CC) gcc options: -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 232.53, N = 3SE +/- 976.98, N = 380632.5976184.731. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 908.68, N = 6SE +/- 1001.52, N = 494120.0087890.861. (CC) gcc options: -shared -fPIC -O2

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 19866.33, N = 3SE +/- 5692.58, N = 3200044519351501. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.0290.0580.0870.1160.145SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1250.1291. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 21901.77, N = 3SE +/- 11345.70, N = 3193666319222851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.05850.1170.17550.2340.2925SE +/- 0.003, N = 3SE +/- 0.002, N = 30.2580.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm


Phoronix Test Suite v10.8.4