2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2201118-NE-MGLRUKERN50 MGLRU Kernel Tests - Phoronix Test Suite MGLRU Kernel Tests 2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2201118-NE-MGLRUKERN50 .
MGLRU Kernel Tests Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution MGLRU Enabled MGLRU Disabled 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads) Supermicro H11DSi-NT v2.00 (2.1 BIOS) AMD Starship/Matisse 128GB 280GB INTEL SSDPE21D280GA ASPEED VE228 2 x Intel 10G X550T Ubuntu 21.10 5.16.0-rc8-mglru-pts (x86_64) GNOME Shell 40.5 X Server 1.1.182 GCC 11.2.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details - MGLRU Enabled: OpenJDK Runtime Environment (build 11.0.12+7-Ubuntu-0ubuntu3) - MGLRU Disabled: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.10) Python Details - Python 3.9.7 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
MGLRU Kernel Tests npb: EP.D npb: MG.C rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster namd: ATPase Simulation - 327,506 Atoms amg: nwchem: C240 Buckyball incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 193 Cells Per Direction mocassin: Dust 2D tau100.0 qe: AUSURF112 xmrig: Monero - 1M xmrig: Wownero - 1M java-gradle-perf: Reactor luxcorerender: DLSC - CPU luxcorerender: Danish Mood - CPU ospray: San Miguel - SciVis ospray: San Miguel - Path Tracer embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K mt-dgemm: Sustained Floating-Point Rate openvkl: vklBenchmark ISPC openvkl: vklBenchmark Scalar compress-7zip: Compression Rating compress-7zip: Decompression Rating stockfish: Total Time build-godot: Time To Compile build-linux-kernel: Time To Compile build-llvm: Ninja build-llvm: Unix Makefiles build-mesa: Time To Compile liquid-dsp: 128 - 256 - 57 liquid-dsp: 256 - 256 - 57 pgbench: 100 - 250 - Read Only pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 500 - Read Only pgbench: 100 - 500 - Read Only - Average Latency plaidml: No - Inference - VGG16 - CPU plaidml: No - Inference - VGG19 - CPU plaidml: No - Inference - ResNet 50 - CPU nginx: 500 nginx: 1000 onnx: yolov4 - CPU onnx: fcn-resnet101-11 - CPU onnx: shufflenet-v2-10 - CPU onnx: super-resolution-10 - CPU apache: 500 apache: 1000 build-linux-kernel: defconfig build-linux-kernel: allmodconfig MGLRU Enabled MGLRU Disabled 8589.03 74668.52 33.040 104.601 46.110 8.969 9.729 0.27158 1249141667 2161.2 463.579814 13.3743223 230 330.62 40029.9 53546.0 374.749 10.36 5.35 83.33 6.52 66.2049 59.8594 4.476 52.655 28.601698 175 118 409975 594185 249652058 57.651 19.828 110.353 196.919 21.144 5100733333 5511766667 1935150 0.129 1922285 0.260 28.09 24.19 4.49 89324.74 91024.31 212 180 5553 5923 76184.73 94120.00 20.453 157.844 8547.91 74790.12 33.295 105.137 47.077 9.288 9.803 0.27257 1244605000 2154.7 463.230825 13.3808743 230 330.96 40078.3 53670.3 380.441 10.28 5.25 83.33 6.65 66.2287 60.1011 4.530 52.288 28.826575 176 118 397755 594384 250125465 58.112 19.804 110.285 196.729 21.130 5093333333 5526200000 2000445 0.125 1936663 0.258 26.28 23.53 4.40 89985.51 91601.65 230 177 5447 6711 80632.59 87890.86 20.618 159.280 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D MGLRU Enabled MGLRU Disabled 2K 4K 6K 8K 10K SE +/- 4.21, N = 3 SE +/- 47.65, N = 3 8589.03 8547.91 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C MGLRU Enabled MGLRU Disabled 16K 32K 48K 64K 80K SE +/- 370.74, N = 3 SE +/- 406.38, N = 3 74668.52 74790.12 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD MGLRU Enabled MGLRU Disabled 8 16 24 32 40 SE +/- 0.15, N = 3 SE +/- 0.17, N = 3 33.04 33.30 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D MGLRU Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.89, N = 3 104.60 105.14 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte MGLRU Enabled MGLRU Disabled 11 22 33 44 55 SE +/- 0.32, N = 13 SE +/- 0.20, N = 3 46.11 47.08 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver MGLRU Enabled MGLRU Disabled 3 6 9 12 15 SE +/- 0.083, N = 6 SE +/- 0.124, N = 12 8.969 9.288 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster MGLRU Enabled MGLRU Disabled 3 6 9 12 15 SE +/- 0.134, N = 15 SE +/- 0.201, N = 14 9.729 9.803 1. (CXX) g++ options: -O2 -lOpenCL
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms MGLRU Enabled MGLRU Disabled 0.0613 0.1226 0.1839 0.2452 0.3065 SE +/- 0.00314, N = 3 SE +/- 0.00229, N = 3 0.27158 0.27257
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 MGLRU Enabled MGLRU Disabled 300M 600M 900M 1200M 1500M SE +/- 738959.93, N = 3 SE +/- 2472880.37, N = 3 1249141667 1244605000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball MGLRU Enabled MGLRU Disabled 500 1000 1500 2000 2500 2161.2 2154.7 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d MGLRU Enabled MGLRU Disabled 100 200 300 400 500 SE +/- 0.82, N = 3 SE +/- 1.49, N = 3 463.58 463.23 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction MGLRU Enabled MGLRU Disabled 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 13.37 13.38 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 MGLRU Enabled MGLRU Disabled 50 100 150 200 250 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 230 230 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 7.0 Input: AUSURF112 MGLRU Enabled MGLRU Disabled 70 140 210 280 350 SE +/- 0.20, N = 3 SE +/- 0.36, N = 3 330.62 330.96 1. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M MGLRU Enabled MGLRU Disabled 9K 18K 27K 36K 45K SE +/- 178.74, N = 3 SE +/- 67.91, N = 3 40029.9 40078.3 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M MGLRU Enabled MGLRU Disabled 11K 22K 33K 44K 55K SE +/- 192.97, N = 3 SE +/- 176.64, N = 3 53546.0 53670.3 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Java Gradle Build Gradle Build: Reactor OpenBenchmarking.org Seconds, Fewer Is Better Java Gradle Build Gradle Build: Reactor MGLRU Enabled MGLRU Disabled 80 160 240 320 400 SE +/- 5.21, N = 9 SE +/- 5.30, N = 3 374.75 380.44
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU MGLRU Enabled MGLRU Disabled 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 10.36 10.28 MIN: 9.71 / MAX: 14.1 MIN: 9.66 / MAX: 14.1
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU MGLRU Enabled MGLRU Disabled 1.2038 2.4076 3.6114 4.8152 6.019 SE +/- 0.09, N = 15 SE +/- 0.08, N = 15 5.35 5.25 MIN: 1.85 / MAX: 7.13 MIN: 1.73 / MAX: 7.07
OSPray Demo: San Miguel - Renderer: SciVis OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: SciVis MGLRU Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 83.33 83.33 MIN: 35.71 / MAX: 100 MIN: 47.62 / MAX: 100
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer MGLRU Enabled MGLRU Disabled 2 4 6 8 10 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 6.52 6.65 MIN: 5.41 / MAX: 7.09 MIN: 5.46 / MAX: 7.14
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown MGLRU Enabled MGLRU Disabled 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 66.20 66.23 MIN: 61.78 / MAX: 73.44 MIN: 61.28 / MAX: 74.94
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown MGLRU Enabled MGLRU Disabled 13 26 39 52 65 SE +/- 0.42, N = 3 SE +/- 0.18, N = 3 59.86 60.10 MIN: 55.99 / MAX: 67.12 MIN: 56.51 / MAX: 68.5
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K MGLRU Enabled MGLRU Disabled 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.008, N = 3 SE +/- 0.005, N = 3 4.476 4.530 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K MGLRU Enabled MGLRU Disabled 12 24 36 48 60 SE +/- 0.18, N = 3 SE +/- 0.18, N = 3 52.66 52.29 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate MGLRU Enabled MGLRU Disabled 7 14 21 28 35 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 28.60 28.83 1. (CC) gcc options: -O3 -march=native -fopenmp
OpenVKL Benchmark: vklBenchmark ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark ISPC MGLRU Enabled MGLRU Disabled 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 175 176 MIN: 14 / MAX: 2362 MIN: 16 / MAX: 2455
OpenVKL Benchmark: vklBenchmark Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.0 Benchmark: vklBenchmark Scalar MGLRU Enabled MGLRU Disabled 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 118 118 MIN: 11 / MAX: 2529 MIN: 11 / MAX: 2528
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating MGLRU Enabled MGLRU Disabled 90K 180K 270K 360K 450K SE +/- 5744.62, N = 3 SE +/- 1852.98, N = 3 409975 397755 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating MGLRU Enabled MGLRU Disabled 130K 260K 390K 520K 650K SE +/- 6316.43, N = 3 SE +/- 7344.83, N = 3 594185 594384 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time MGLRU Enabled MGLRU Disabled 50M 100M 150M 200M 250M SE +/- 2261974.30, N = 3 SE +/- 2532421.76, N = 6 249652058 250125465 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile MGLRU Enabled MGLRU Disabled 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 57.65 58.11
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.14 Time To Compile MGLRU Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.14, N = 13 SE +/- 0.13, N = 14 19.83 19.80
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 13.0 Build System: Ninja MGLRU Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.23, N = 3 110.35 110.29
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 13.0 Build System: Unix Makefiles MGLRU Enabled MGLRU Disabled 40 80 120 160 200 SE +/- 0.16, N = 3 SE +/- 0.47, N = 3 196.92 196.73
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile MGLRU Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 21.14 21.13
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 MGLRU Enabled MGLRU Disabled 1100M 2200M 3300M 4400M 5500M SE +/- 11970148.05, N = 3 SE +/- 7872808.34, N = 3 5100733333 5093333333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 256 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 256 - Buffer Length: 256 - Filter Length: 57 MGLRU Enabled MGLRU Disabled 1200M 2400M 3600M 4800M 6000M SE +/- 18720428.53, N = 3 SE +/- 16977730.51, N = 3 5511766667 5526200000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only MGLRU Enabled MGLRU Disabled 400K 800K 1200K 1600K 2000K SE +/- 5692.58, N = 3 SE +/- 19866.33, N = 3 1935150 2000445 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency MGLRU Enabled MGLRU Disabled 0.029 0.058 0.087 0.116 0.145 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.129 0.125 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 500 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 500 - Mode: Read Only MGLRU Enabled MGLRU Disabled 400K 800K 1200K 1600K 2000K SE +/- 11345.70, N = 3 SE +/- 21901.77, N = 3 1922285 1936663 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 14.0 Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency MGLRU Enabled MGLRU Disabled 0.0585 0.117 0.1755 0.234 0.2925 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 0.260 0.258 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU MGLRU Enabled MGLRU Disabled 7 14 21 28 35 SE +/- 0.29, N = 15 SE +/- 0.25, N = 15 28.09 26.28
PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU MGLRU Enabled MGLRU Disabled 6 12 18 24 30 SE +/- 0.23, N = 3 SE +/- 0.23, N = 15 24.19 23.53
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU MGLRU Enabled MGLRU Disabled 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 4.49 4.40
nginx Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 500 MGLRU Enabled MGLRU Disabled 20K 40K 60K 80K 100K SE +/- 134.98, N = 3 SE +/- 202.70, N = 3 89324.74 89985.51 1. (CC) gcc options: -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1000 MGLRU Enabled MGLRU Disabled 20K 40K 60K 80K 100K SE +/- 265.34, N = 3 SE +/- 270.06, N = 3 91024.31 91601.65 1. (CC) gcc options: -lcrypt -lz -O3 -march=native
ONNX Runtime Model: yolov4 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: yolov4 - Device: CPU MGLRU Enabled MGLRU Disabled 50 100 150 200 250 SE +/- 1.95, N = 12 SE +/- 0.87, N = 3 212 230 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: fcn-resnet101-11 - Device: CPU MGLRU Enabled MGLRU Disabled 40 80 120 160 200 SE +/- 4.29, N = 12 SE +/- 1.80, N = 3 180 177 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: shufflenet-v2-10 - Device: CPU MGLRU Enabled MGLRU Disabled 1200 2400 3600 4800 6000 SE +/- 54.18, N = 3 SE +/- 71.82, N = 3 5553 5447 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: super-resolution-10 - Device: CPU MGLRU Enabled MGLRU Disabled 1400 2800 4200 5600 7000 SE +/- 176.40, N = 12 SE +/- 5.04, N = 3 5923 6711 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 MGLRU Enabled MGLRU Disabled 20K 40K 60K 80K 100K SE +/- 976.98, N = 3 SE +/- 232.53, N = 3 76184.73 80632.59 1. (CC) gcc options: -shared -fPIC -O2
Apache HTTP Server Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1000 MGLRU Enabled MGLRU Disabled 20K 40K 60K 80K 100K SE +/- 908.68, N = 6 SE +/- 1001.52, N = 4 94120.00 87890.86 1. (CC) gcc options: -shared -fPIC -O2
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: defconfig MGLRU Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 20.45 20.62
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: allmodconfig MGLRU Enabled MGLRU Disabled 40 80 120 160 200 SE +/- 0.55, N = 3 SE +/- 0.79, N = 3 157.84 159.28
Phoronix Test Suite v10.8.4