Benchmarks by Michael Larabel for a future article.
MGLRU Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8608102Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 2048 MB - vBIOS Version: 113-LUCIENNE-014Python Notes: Python 3.8.10Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
MGLRU Default Enabled Processor: AMD Ryzen 5 5500U @ 2.10GHz (6 Cores / 12 Threads), Motherboard: LENOVO LNVNB161216 (GLCN22WW BIOS), Chipset: AMD Renoir Root Complex, Memory: 6GB, Disk: 256GB SAMSUNG MZALQ256HBJD-00BL2, Graphics: AMD RENOIR 2GB (1800/400MHz), Audio: AMD Device 1637, Network: Qualcomm Atheros QCA6174 802.11ac
OS: Ubuntu 20.04, Kernel: 5.19.0-rc2-phx-mglru-v12 (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.13, OpenGL: 4.6 Mesa 21.2.6 (LLVM 12.0.0), Vulkan: 1.2.182, Compiler: GCC 9.4.0, File-System: ext4, Screen Resolution: 1920x1080
5500U MGLRU OpenBenchmarking.org Phoronix Test Suite AMD Ryzen 5 5500U @ 2.10GHz (6 Cores / 12 Threads) LENOVO LNVNB161216 (GLCN22WW BIOS) AMD Renoir Root Complex 6GB 256GB SAMSUNG MZALQ256HBJD-00BL2 AMD RENOIR 2GB (1800/400MHz) AMD Device 1637 Qualcomm Atheros QCA6174 802.11ac Ubuntu 20.04 5.19.0-rc2-phx-mglru-v12 (x86_64) GNOME Shell 3.36.9 X Server 1.20.13 4.6 Mesa 21.2.6 (LLVM 12.0.0) 1.2.182 GCC 9.4.0 ext4 1920x1080 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 5500U MGLRU Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8608102 - GLAMOR - BAR1 / Visible vRAM Size: 2048 MB - vBIOS Version: 113-LUCIENNE-014 - Python 3.8.10 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
MGLRU Disabled vs. MGLRU Default Enabled Comparison Phoronix Test Suite Baseline +136.4% +136.4% +272.8% +272.8% +409.2% +409.2% +545.6% +545.6% 545.7% 496.7% 357.2% 196.7% 193.5% 82.4% 2% Redis GET SET 64 - 10000 - 2,5000,1 - 10000 4 - 10000 - 2,5000,1 - 10000 Bosphorus 4K 2.2% gravity_spheres_volume/dim_512/ao/real_time 2.1% Time To Compile Memtier_benchmark Redis Redis InfluxDB InfluxDB KeyDB x264 OSPRay Timed Erlang/OTP Compilation MGLRU Disabled MGLRU Default Enabled
5500U MGLRU paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 vkmark: 1920 x 1080 - Mailbox hpcg: namd: ATPase Simulation - 327,506 Atoms chia-vdf: Square Plain C++ chia-vdf: Square Assembly Optimized compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed svt-av1: Preset 8 - Bosphorus 1080p svt-av1: Preset 10 - Bosphorus 1080p svt-av1: Preset 12 - Bosphorus 1080p svt-hevc: 1 - Bosphorus 4K svt-hevc: 7 - Bosphorus 4K svt-hevc: 10 - Bosphorus 4K svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 4K svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 4K svt-vp9: Visual Quality Optimized - Bosphorus 1080p x264: Bosphorus 4K x265: Bosphorus 4K ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time compress-7zip: Compression Rating compress-7zip: Decompression Rating stockfish: Total Time avifenc: 2 avifenc: 6 avifenc: 6, Lossless avifenc: 10, Lossless build-godot: Time To Compile build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-mplayer: Time To Compile ospray-studio: 3 - 1080p - 32 - Path Tracer build-erlang: Time To Compile build-wasmer: Time To Compile keydb: gromacs: MPI CPU - water_GMX50_bare astcenc: Medium astcenc: Thorough astcenc: Exhaustive memtier-benchmark: Redis redis: GET redis: SET blender: BMW27 - CPU-Only blender: Fishy Cat - CPU-Only blender: Pabellon Barcelona - CPU-Only appleseed: Emily appleseed: Disney Material appleseed: Material Tester v-ray: CPU influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 MGLRU Disabled MGLRU Default Enabled 78.49 1255.941 61.48 640.698 2106 3.90382 4.10251 156433 143467 18.2 2445.5 15.6 2570.7 55.468 116.372 222.964 1.01 19.97 37.57 4.12 65.73 133.28 25.98 97.77 27.45 99.75 21.56 83.04 14.85 8.30 1.48457 1.46862 56.4170 0.729518 0.684936 1.14543 37664 36988 12398522 130.977 18.689 23.031 8.497 259.583 189.844 2449.410 75.815 275686 174.817 155.136 199480.15 0.481 6.0297 14.7898 147.6319 258495.88 351787.54 333875.28 298.47 391.09 1044.84 711.638008 425.039255 383.888588 5073 309539.9 317642.9 78.31 1253.020 61.43 640.187 2106 3.90758 4.10896 155667 144433 18.2 2446.8 15.6 2565.9 55.544 115.671 219.307 1.01 19.83 37.57 4.10 65.54 133.16 26.08 97.87 27.58 99.67 21.61 83.50 14.53 8.34 1.48209 1.46482 56.1881 0.714683 0.672614 1.14532 37704 36841 12199806 130.748 18.675 23.078 8.538 261.194 190.732 2464.625 75.265 274524 171.318 154.310 363879.49 0.484 6.0170 14.7355 146.9477 1669128.81 2099165.54 1526376.83 296.53 389.79 1036.40 706.780478 417.827137 379.575504 5127 908567.9 942491.7 OpenBenchmarking.org
ParaView This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 MGLRU Default Enabled MGLRU Disabled 0.8792 1.7584 2.6376 3.5168 4.396 SE +/- 0.00665, N = 3 SE +/- 0.00464, N = 3 3.90758 3.90382 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms MGLRU Default Enabled MGLRU Disabled 0.9245 1.849 2.7735 3.698 4.6225 SE +/- 0.00128, N = 3 SE +/- 0.00711, N = 3 4.10896 4.10251
Chia Blockchain VDF Chia is a blockchain and smart transaction platform based on proofs of space and time rather than proofs of work with other cryptocurrencies. This test profile is benchmarking the CPU performance for Chia VDF performance using the Chia VDF benchmark. The Chia VDF is for the Chia Verifiable Delay Function (Proof of Time). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.7 Test: Square Plain C++ MGLRU Default Enabled MGLRU Disabled 30K 60K 90K 120K 150K SE +/- 433.33, N = 3 SE +/- 466.67, N = 3 155667 156433 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
OpenBenchmarking.org IPS, More Is Better Chia Blockchain VDF 1.0.7 Test: Square Assembly Optimized MGLRU Default Enabled MGLRU Disabled 30K 60K 90K 120K 150K SE +/- 260.34, N = 3 SE +/- 296.27, N = 3 144433 143467 1. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread
Zstd Compression This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MGLRU Default Enabled MGLRU Disabled 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 18.2 18.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MGLRU Default Enabled MGLRU Disabled 500 1000 1500 2000 2500 SE +/- 4.82, N = 3 SE +/- 1.43, N = 3 2446.8 2445.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MGLRU Default Enabled MGLRU Disabled 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 15.6 15.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MGLRU Default Enabled MGLRU Disabled 600 1200 1800 2400 3000 SE +/- 2.90, N = 3 SE +/- 2.57, N = 3 2565.9 2570.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 12 24 36 48 60 SE +/- 0.39, N = 3 SE +/- 0.25, N = 3 55.54 55.47 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 30 60 90 120 150 SE +/- 0.21, N = 3 SE +/- 0.10, N = 3 115.67 116.37 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 50 100 150 200 250 SE +/- 0.45, N = 3 SE +/- 0.15, N = 3 219.31 222.96 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 0.2273 0.4546 0.6819 0.9092 1.1365 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.01 1.01 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.16, N = 3 19.83 19.97 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 9 18 27 36 45 SE +/- 0.39, N = 3 SE +/- 0.28, N = 3 37.57 37.57 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 0.927 1.854 2.781 3.708 4.635 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.10 4.12 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 15 30 45 60 75 SE +/- 0.43, N = 3 SE +/- 0.35, N = 3 65.54 65.73 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 30 60 90 120 150 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 133.16 133.28 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.09, N = 3 26.08 25.98 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 1.21, N = 3 SE +/- 1.37, N = 3 97.87 97.77 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 6 12 18 24 30 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 27.58 27.45 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 1.39, N = 3 SE +/- 1.32, N = 3 99.67 99.75 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 21.61 21.56 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p MGLRU Default Enabled MGLRU Disabled 20 40 60 80 100 SE +/- 0.94, N = 3 SE +/- 0.92, N = 3 83.50 83.04 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K MGLRU Default Enabled MGLRU Disabled 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 8.34 8.30 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/ao/real_time MGLRU Default Enabled MGLRU Disabled 0.334 0.668 1.002 1.336 1.67 SE +/- 0.00076, N = 3 SE +/- 0.00073, N = 3 1.48209 1.48457
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/scivis/real_time MGLRU Default Enabled MGLRU Disabled 0.3304 0.6608 0.9912 1.3216 1.652 SE +/- 0.00031, N = 3 SE +/- 0.00049, N = 3 1.46482 1.46862
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: particle_volume/pathtracer/real_time MGLRU Default Enabled MGLRU Disabled 13 26 39 52 65 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 56.19 56.42
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/ao/real_time MGLRU Default Enabled MGLRU Disabled 0.1641 0.3282 0.4923 0.6564 0.8205 SE +/- 0.000369, N = 3 SE +/- 0.001071, N = 3 0.714683 0.729518
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time MGLRU Default Enabled MGLRU Disabled 0.1541 0.3082 0.4623 0.6164 0.7705 SE +/- 0.000632, N = 3 SE +/- 0.000115, N = 3 0.672614 0.684936
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time MGLRU Default Enabled MGLRU Disabled 0.2577 0.5154 0.7731 1.0308 1.2885 SE +/- 0.00209, N = 3 SE +/- 0.00318, N = 3 1.14532 1.14543
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.00 Test: Decompression Rating MGLRU Default Enabled MGLRU Disabled 8K 16K 24K 32K 40K SE +/- 379.90, N = 3 SE +/- 371.22, N = 3 36841 36988 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time MGLRU Default Enabled MGLRU Disabled 3M 6M 9M 12M 15M SE +/- 130192.51, N = 3 SE +/- 170319.30, N = 3 12199806 12398522 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6 MGLRU Default Enabled MGLRU Disabled 5 10 15 20 25 SE +/- 0.18, N = 6 SE +/- 0.20, N = 5 18.68 18.69 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6, Lossless MGLRU Default Enabled MGLRU Disabled 6 12 18 24 30 SE +/- 0.23, N = 3 SE +/- 0.25, N = 3 23.08 23.03 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 10, Lossless MGLRU Default Enabled MGLRU Disabled 2 4 6 8 10 SE +/- 0.011, N = 3 SE +/- 0.007, N = 3 8.538 8.497 1. (CXX) g++ options: -O3 -fPIC -lm
OSPray Studio Intel OSPray Studio is an open-source, interactive visualization and ray-tracing software package. OSPray Studio makes use of Intel OSPray, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer MGLRU Default Enabled MGLRU Disabled 60K 120K 180K 240K 300K SE +/- 766.64, N = 3 SE +/- 414.54, N = 3 274524 275686 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Timed Wasmer Compilation This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 2.2 Time To Compile MGLRU Default Enabled MGLRU Disabled 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.08, N = 3 154.31 155.14 1. (CC) gcc options: -m64 -ldl -lxkbcommon -lgcc_s -lutil -lrt -lpthread -lm -lc -pie -nodefaultlibs
KeyDB A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.2.0 MGLRU Default Enabled MGLRU Disabled 80K 160K 240K 320K 400K SE +/- 782.03, N = 3 SE +/- 102.35, N = 3 363879.49 199480.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare MGLRU Default Enabled MGLRU Disabled 0.1089 0.2178 0.3267 0.4356 0.5445 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.484 0.481 1. (CXX) g++ options: -O3 -pthread
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Medium MGLRU Default Enabled MGLRU Disabled 2 4 6 8 10 SE +/- 0.0135, N = 3 SE +/- 0.0084, N = 3 6.0170 6.0297 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Thorough MGLRU Default Enabled MGLRU Disabled 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.16, N = 3 14.74 14.79 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Exhaustive MGLRU Default Enabled MGLRU Disabled 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 0.21, N = 3 146.95 147.63 1. (CXX) g++ options: -O3 -flto -pthread
Memtier_benchmark Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool. This current test profile currently just stresses the Redis protocol and basic options exposed wotj a 1:1 Set/Get ratio, 30 pipeline, 100 clients per thread, and thread count equal to the number of CPU cores/threads present. Patches to extend the test are welcome as always. Currently this test profile uses Memtier_benchmark 1.3 and Redis 6. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memtier_benchmark 1.3 Protocol: Redis MGLRU Default Enabled MGLRU Disabled 400K 800K 1200K 1600K 2000K SE +/- 13910.00, N = 3 SE +/- 266.88, N = 3 1669128.81 258495.88 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET MGLRU Default Enabled MGLRU Disabled 400K 800K 1200K 1600K 2000K SE +/- 8633.37, N = 3 SE +/- 425.32, N = 3 2099165.54 351787.54 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET MGLRU Default Enabled MGLRU Disabled 300K 600K 900K 1200K 1500K SE +/- 17602.93, N = 3 SE +/- 1446.54, N = 3 1526376.83 333875.28 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported and HIP for AMD Radeon GPUs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: BMW27 - Compute: CPU-Only MGLRU Default Enabled MGLRU Disabled 70 140 210 280 350 SE +/- 0.57, N = 3 SE +/- 0.88, N = 3 296.53 298.47
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: Fishy Cat - Compute: CPU-Only MGLRU Default Enabled MGLRU Disabled 80 160 240 320 400 SE +/- 0.52, N = 3 SE +/- 0.39, N = 3 389.79 391.09
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: Pabellon Barcelona - Compute: CPU-Only MGLRU Default Enabled MGLRU Disabled 200 400 600 800 1000 SE +/- 0.69, N = 3 SE +/- 0.66, N = 3 1036.40 1044.84
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5.02 Mode: CPU MGLRU Default Enabled MGLRU Disabled 1100 2200 3300 4400 5500 SE +/- 16.02, N = 3 5127 5073
InfluxDB This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 MGLRU Default Enabled MGLRU Disabled 200K 400K 600K 800K 1000K SE +/- 508.05, N = 3 SE +/- 90.52, N = 3 908567.9 309539.9
OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 MGLRU Default Enabled MGLRU Disabled 200K 400K 600K 800K 1000K SE +/- 1011.38, N = 3 SE +/- 331.53, N = 3 942491.7 317642.9
MGLRU Disabled Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8608102Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 2048 MB - vBIOS Version: 113-LUCIENNE-014Python Notes: Python 3.8.10Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 29 June 2022 13:31 by user phoronix.
MGLRU Default Enabled Processor: AMD Ryzen 5 5500U @ 2.10GHz (6 Cores / 12 Threads), Motherboard: LENOVO LNVNB161216 (GLCN22WW BIOS), Chipset: AMD Renoir Root Complex, Memory: 6GB, Disk: 256GB SAMSUNG MZALQ256HBJD-00BL2, Graphics: AMD RENOIR 2GB (1800/400MHz), Audio: AMD Device 1637, Network: Qualcomm Atheros QCA6174 802.11ac
OS: Ubuntu 20.04, Kernel: 5.19.0-rc2-phx-mglru-v12 (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.13, OpenGL: 4.6 Mesa 21.2.6 (LLVM 12.0.0), Vulkan: 1.2.182, Compiler: GCC 9.4.0, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8608102Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 2048 MB - vBIOS Version: 113-LUCIENNE-014Python Notes: Python 3.8.10Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 June 2022 20:33 by user phoronix.