Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-tests

AMD Ryzen 9 7945HX testing with a Alienware 0DWD2H (1.13.1 BIOS) and NVIDIA GeForce RTX 4090 Laptop GPU 16GB on cachyos rolling via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2409210-EIRI-240308551&sro&gru.

Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-testsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen ResolutionDisplay DriverIntel HD 4600 HAS GT2nVidia RTX 4090 mobileRTX 4090 mobileIntel Core i5-4300M @ 3.30GHz (2 Cores / 4 Threads)Dell 0VWNW8 (A26 BIOS)Intel Xeon E3-1200 v3/4th8GB128GB SAMSUNG SSD PM85Intel HD 4600 HSW GT2 2GB (1250MHz)Intel Xeon E3-1200 v3/4thIntel I217-LM + Intel Centrino Ultimate-N 6300cachyos rolling6.7.9-1-cachyos-rt-bore-lto (x86_64)KDE Plasma 6.0.1X Server 1.21.1.114.6 Mesa 24.0.2-arch1.2OpenCL 2.0 beignet 1.4 (git-f72309a5)GCC 13.2.1 20230801 + Clang 17.0.6 + LLVM 17.0.6xfs1920x1080AMD Ryzen 9 7945HX @ 5.46GHz (16 Cores / 32 Threads)Alienware 0DWD2H (1.13.1 BIOS)AMD Device 14d862GBPC SN810 NVMe WDC 2048GB + 4001GB CT4000P3SSD8NVIDIA GeForce RTX 4090 Laptop GPU 16GBNVIDIA Device 22bbRealtek RTL8125 2.5GbE + Qualcomm QCNFA7656.11.0-5-cachyos-lto (x86_64)GNOME Shell 47.0X Server 1.21.1.13NVIDIA 560.35.034.6.0OpenCL 3.0 + OpenCL 2.1 AMD-APP.dbg (3602.0) + OpenCL 3.0 CUDA 12.6.65 + OpenCL 2.0 AMD-APP (1800.8)GCC 14.2.1 20240910 + Clang 18.1.8 + LLVM 18.1.8 + CUDA 12.6zfs2560x1600OpenCL 3.0 CUDA 12.6.65OpenBenchmarking.orgCompiler Details- Intel HD 4600 HAS GT2: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - nVidia RTX 4090 mobile: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - RTX 4090 mobile: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++,rust --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details- Intel HD 4600 HAS GT2: Scaling Governor: intel_cpufreq performance - CPU Microcode: 0x28- nVidia RTX 4090 mobile: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601206- RTX 4090 mobile: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601206Graphics Details- Intel HD 4600 HAS GT2: GLAMOR- nVidia RTX 4090 mobile: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.4c- RTX 4090 mobile: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.4cSecurity Details- Intel HD 4600 HAS GT2: gather_data_sampling: Not affected + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Unknown: No mitigations + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Mitigation of Microcode + tsx_async_abort: Not affected - nVidia RTX 4090 mobile: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - RTX 4090 mobile: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Kernel Details- nVidia RTX 4090 mobile, RTX 4090 mobile: Transparent Huge Pages: alwaysEnvironment Details- nVidia RTX 4090 mobile, RTX 4090 mobile: MUTTER_DEBUG_KMS_THREAD_TYPE=userOpenCL Details- nVidia RTX 4090 mobile, RTX 4090 mobile: GPU Compute Cores: 9728

Project-PhysX-OpenCL-Benchmark-iGPU-vs-dGPU-testsopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: Memory Bandwidth Coalesced Writeopencl-benchmark: FP32 Computeopencl-benchmark: FP64 Computeopencl-benchmark: INT64 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT8 ComputeIntel HD 4600 HAS GT2nVidia RTX 4090 mobileRTX 4090 mobile17.8221.620.070.0070.0330.1010.134548.59561.7642.7280.693.45418.07414.93411.731OpenBenchmarking.org

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced ReadIntel HD 4600 HAS GT2RTX 4090 mobile120240360480600SE +/- 0.06, N = 3SE +/- 0.02, N = 317.82548.591. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced WriteIntel HD 4600 HAS GT2RTX 4090 mobile120240360480600SE +/- 0.06, N = 3SE +/- 0.04, N = 321.62561.761. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 ComputeIntel HD 4600 HAS GT2RTX 4090 mobile1020304050SE +/- 0.000, N = 3SE +/- 0.016, N = 30.07042.7281. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 ComputeRTX 4090 mobile0.15530.31060.46590.62120.7765SE +/- 0.00, N = 30.691. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 ComputeIntel HD 4600 HAS GT2RTX 4090 mobile0.77721.55442.33163.10883.886SE +/- 0.000, N = 3SE +/- 0.009, N = 30.0073.4541. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 ComputeIntel HD 4600 HAS GT2RTX 4090 mobile48121620SE +/- 0.000, N = 3SE +/- 0.095, N = 30.03318.0741. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 ComputeIntel HD 4600 HAS GT2RTX 4090 mobile48121620SE +/- 0.000, N = 3SE +/- 0.179, N = 30.10114.9341. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 ComputeIntel HD 4600 HAS GT2RTX 4090 mobile3691215SE +/- 0.000, N = 3SE +/- 0.062, N = 30.13411.7311. (CXX) g++ options: -std=c++17 -pthread -lOpenCL


Phoronix Test Suite v10.8.5