marco

description

HTML result view exported from: https://openbenchmarking.org/result/2502128-NE-MARCO211143&gru.

marcoProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionmarcomarcotestcudaAMD Ryzen 5 3500U @ 2.10GHz (4 Cores / 8 Threads)DL Aspire A315-23 Lotus_DA (V1.20 BIOS)AMD Raven/Raven22 x 4GB DDR4-2400MT/s Samsung M471A5244CB0-CTD256GB Western Digital PC SN530 SDBPNPZ-256G-1114 + 1000GB TOSHIBA MQ04ABF1AMD Radeon Vega 8 2GB (1200/1200MHz)AMD Raven/Raven2/FenghuangRealtek RTL8111/8168/8411 + Intel Dual Band-AC 3168NGWLinuxmint 21.25.15.0-131-generic (x86_64)Cinnamon 5.8.4X Server 1.21.1.44.6 Mesa 23.2.1-1ubuntu3.1~22.04.3 (LLVM 15.0.7 DRM 3.42)1.3.255GCC 11.4.0ext41366x768GCC 11.4.0 + CUDA 11.5OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- NVM_CD_FLAGS=-qProcessor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8108109Graphics Details- GLAMOR - BAR1 / Visible vRAM Size: 2048 MB - vBIOS Version: 113-PICASSO-117Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Compiler Details- marcotestcuda: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v

marcocuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerotf2: 1366 x 768marcomarcotestcuda85528.59599226.67399102.503101298.89196305.881OpenBenchmarking.org

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Originalmarcotestcuda20K40K60K80K100KSE +/- 667.67, N = 1585528.60

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache Blockingmarcotestcuda20K40K60K80K100KSE +/- 1119.11, N = 499226.67

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop Unrollingmarcotestcuda20K40K60K80K100KSE +/- 1302.07, N = 1299102.50

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data Layoutmarcotestcuda20K40K60K80K100KSE +/- 1182.29, N = 4101298.89

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To Zeromarcotestcuda20K40K60K80K100KSE +/- 608.03, N = 396305.88


Phoronix Test Suite v10.8.5