MGLRU Kernel Tests

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2201118-NE-MGLRUKERN50&sro&grt.

MGLRU Kernel TestsProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionMGLRU EnabledMGLRU Disabled2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse128GB280GB INTEL SSDPE21D280GAASPEEDVE2282 x Intel 10G X550TUbuntu 21.105.16.0-rc8-mglru-pts (x86_64)GNOME Shell 40.5X Server1.1.182GCC 11.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details- MGLRU Enabled: OpenJDK Runtime Environment (build 11.0.12+7-Ubuntu-0ubuntu3)- MGLRU Disabled: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.10)Python Details- Python 3.9.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

MGLRU Kernel Testscompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingmt-dgemm: Sustained Floating-Point Rateamg: apache: 500apache: 1000embree: Pathtracer - Crownembree: Pathtracer ISPC - Crownjava-gradle-perf: Reactorliquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 57luxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUmocassin: Dust 2D tau100.0namd: ATPase Simulation - 327,506 Atomsnpb: EP.Dnpb: MG.Cnginx: 500nginx: 1000nwchem: C240 Buckyballonnx: yolov4 - CPUonnx: fcn-resnet101-11 - CPUonnx: shufflenet-v2-10 - CPUonnx: super-resolution-10 - CPUopenvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarospray: San Miguel - SciVisospray: San Miguel - Path Tracerplaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUplaidml: No - Inference - ResNet 50 - CPUpgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 500 - Read Onlypgbench: 100 - 500 - Read Only - Average Latencyqe: AUSURF112rodinia: OpenMP LavaMDrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocyterodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterstockfish: Total Timesvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Kbuild-godot: Time To Compilebuild-linux-kernel: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-mesa: Time To Compileincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionxmrig: Monero - 1Mxmrig: Wownero - 1MMGLRU EnabledMGLRU Disabled40997559418528.601698124914166776184.7394120.0066.204959.8594374.7495100733333551176666710.365.352300.271588589.0374668.5289324.7491024.312161.22121805553592317511883.336.5228.0924.194.4919351500.12919222850.260330.6233.040104.60146.1108.9699.7292496520584.47652.65557.65119.82820.453157.844110.353196.91921.144463.57981413.374322340029.953546.039775559438428.826575124460500080632.5987890.8666.228760.1011380.4415093333333552620000010.285.252300.272578547.9174790.1289985.5191601.652154.72301775447671117611883.336.6526.2823.534.4020004450.12519366630.258330.9633.295105.13747.0779.2889.8032501254654.53052.28858.11219.80420.618159.280110.285196.72921.130463.23082513.380874340078.353670.3OpenBenchmarking.org

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression RatingMGLRU DisabledMGLRU Enabled90K180K270K360K450KSE +/- 1852.98, N = 3SE +/- 5744.62, N = 33977554099751. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression RatingMGLRU DisabledMGLRU Enabled130K260K390K520K650KSE +/- 7344.83, N = 3SE +/- 6316.43, N = 35943845941851. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateMGLRU DisabledMGLRU Enabled714212835SE +/- 0.17, N = 3SE +/- 0.17, N = 328.8328.601. (CC) gcc options: -O3 -march=native -fopenmp

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2MGLRU DisabledMGLRU Enabled300M600M900M1200M1500MSE +/- 2472880.37, N = 3SE +/- 738959.93, N = 3124460500012491416671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 232.53, N = 3SE +/- 976.98, N = 380632.5976184.731. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 1001.52, N = 4SE +/- 908.68, N = 687890.8694120.001. (CC) gcc options: -shared -fPIC -O2

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: CrownMGLRU DisabledMGLRU Enabled1530456075SE +/- 0.24, N = 3SE +/- 0.15, N = 366.2366.20MIN: 61.28 / MAX: 74.94MIN: 61.78 / MAX: 73.44

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownMGLRU DisabledMGLRU Enabled1326395265SE +/- 0.18, N = 3SE +/- 0.42, N = 360.1059.86MIN: 56.51 / MAX: 68.5MIN: 55.99 / MAX: 67.12

Java Gradle Build

Gradle Build: Reactor

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: ReactorMGLRU DisabledMGLRU Enabled80160240320400SE +/- 5.30, N = 3SE +/- 5.21, N = 9380.44374.75

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57MGLRU DisabledMGLRU Enabled1100M2200M3300M4400M5500MSE +/- 7872808.34, N = 3SE +/- 11970148.05, N = 3509333333351007333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57MGLRU DisabledMGLRU Enabled1200M2400M3600M4800M6000MSE +/- 16977730.51, N = 3SE +/- 18720428.53, N = 3552620000055117666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUMGLRU DisabledMGLRU Enabled3691215SE +/- 0.07, N = 3SE +/- 0.07, N = 310.2810.36MIN: 9.66 / MAX: 14.1MIN: 9.71 / MAX: 14.1

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUMGLRU DisabledMGLRU Enabled1.20382.40763.61144.81526.019SE +/- 0.08, N = 15SE +/- 0.09, N = 155.255.35MIN: 1.73 / MAX: 7.07MIN: 1.85 / MAX: 7.13

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0MGLRU DisabledMGLRU Enabled50100150200250SE +/- 0.00, N = 3SE +/- 0.67, N = 32302301. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsMGLRU DisabledMGLRU Enabled0.06130.12260.18390.24520.3065SE +/- 0.00229, N = 3SE +/- 0.00314, N = 30.272570.27158

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DMGLRU DisabledMGLRU Enabled2K4K6K8K10KSE +/- 47.65, N = 3SE +/- 4.21, N = 38547.918589.031. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CMGLRU DisabledMGLRU Enabled16K32K48K64K80KSE +/- 406.38, N = 3SE +/- 370.74, N = 374790.1274668.521. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 202.70, N = 3SE +/- 134.98, N = 389985.5189324.741. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 270.06, N = 3SE +/- 265.34, N = 391601.6591024.311. (CC) gcc options: -lcrypt -lz -O3 -march=native

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballMGLRU DisabledMGLRU Enabled50010001500200025002154.72161.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

ONNX Runtime

Model: yolov4 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: yolov4 - Device: CPUMGLRU DisabledMGLRU Enabled50100150200250SE +/- 0.87, N = 3SE +/- 1.95, N = 122302121. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: fcn-resnet101-11 - Device: CPUMGLRU DisabledMGLRU Enabled4080120160200SE +/- 1.80, N = 3SE +/- 4.29, N = 121771801. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPUMGLRU DisabledMGLRU Enabled12002400360048006000SE +/- 71.82, N = 3SE +/- 54.18, N = 3544755531. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: super-resolution-10 - Device: CPUMGLRU DisabledMGLRU Enabled14002800420056007000SE +/- 5.04, N = 3SE +/- 176.40, N = 12671159231. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPCMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.00, N = 3SE +/- 0.67, N = 3176175MIN: 16 / MAX: 2455MIN: 14 / MAX: 2362

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ScalarMGLRU DisabledMGLRU Enabled306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 3118118MIN: 11 / MAX: 2528MIN: 11 / MAX: 2529

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 383.3383.33MIN: 47.62 / MAX: 100MIN: 35.71 / MAX: 100

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerMGLRU DisabledMGLRU Enabled246810SE +/- 0.08, N = 3SE +/- 0.08, N = 36.656.52MIN: 5.46 / MAX: 7.14MIN: 5.41 / MAX: 7.09

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUMGLRU DisabledMGLRU Enabled714212835SE +/- 0.25, N = 15SE +/- 0.29, N = 1526.2828.09

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUMGLRU DisabledMGLRU Enabled612182430SE +/- 0.23, N = 15SE +/- 0.23, N = 323.5324.19

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUMGLRU DisabledMGLRU Enabled1.01032.02063.03094.04125.0515SE +/- 0.02, N = 3SE +/- 0.04, N = 34.404.49

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 19866.33, N = 3SE +/- 5692.58, N = 3200044519351501. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.0290.0580.0870.1160.145SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1250.1291. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 21901.77, N = 3SE +/- 11345.70, N = 3193666319222851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.05850.1170.17550.2340.2925SE +/- 0.003, N = 3SE +/- 0.002, N = 30.2580.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112MGLRU DisabledMGLRU Enabled70140210280350SE +/- 0.36, N = 3SE +/- 0.20, N = 3330.96330.621. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDMGLRU DisabledMGLRU Enabled816243240SE +/- 0.17, N = 3SE +/- 0.15, N = 333.3033.041. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.89, N = 3SE +/- 0.54, N = 3105.14104.601. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteMGLRU DisabledMGLRU Enabled1122334455SE +/- 0.20, N = 3SE +/- 0.32, N = 1347.0846.111. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverMGLRU DisabledMGLRU Enabled3691215SE +/- 0.124, N = 12SE +/- 0.083, N = 69.2888.9691. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterMGLRU DisabledMGLRU Enabled3691215SE +/- 0.201, N = 14SE +/- 0.134, N = 159.8039.7291. (CXX) g++ options: -O2 -lOpenCL

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeMGLRU DisabledMGLRU Enabled50M100M150M200M250MSE +/- 2532421.76, N = 6SE +/- 2261974.30, N = 32501254652496520581. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KMGLRU DisabledMGLRU Enabled1.01932.03863.05794.07725.0965SE +/- 0.005, N = 3SE +/- 0.008, N = 34.5304.4761. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KMGLRU DisabledMGLRU Enabled1224364860SE +/- 0.18, N = 3SE +/- 0.18, N = 352.2952.661. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileMGLRU DisabledMGLRU Enabled1326395265SE +/- 0.12, N = 3SE +/- 0.08, N = 358.1157.65

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.13, N = 14SE +/- 0.14, N = 1319.8019.83

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: defconfigMGLRU DisabledMGLRU Enabled510152025SE +/- 0.02, N = 3SE +/- 0.07, N = 320.6220.45

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: allmodconfigMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.79, N = 3SE +/- 0.55, N = 3159.28157.84

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: NinjaMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.23, N = 3SE +/- 0.14, N = 3110.29110.35

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix MakefilesMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.47, N = 3SE +/- 0.16, N = 3196.73196.92

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 321.1321.14

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dMGLRU DisabledMGLRU Enabled100200300400500SE +/- 1.49, N = 3SE +/- 0.82, N = 3463.23463.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionMGLRU DisabledMGLRU Enabled3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 313.3813.371. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MMGLRU DisabledMGLRU Enabled9K18K27K36K45KSE +/- 67.91, N = 3SE +/- 178.74, N = 340078.340029.91. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MMGLRU DisabledMGLRU Enabled11K22K33K44K55KSE +/- 176.64, N = 3SE +/- 192.97, N = 353670.353546.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc


Phoronix Test Suite v10.8.5