2307020-PTS-GPUREVIEW1 Pny RTX 4080 16GB REVIEW By Gaojie20
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2307091-NE-2307020PT51 RTX 4090 24GB -Nvidia Processor: Intel Xeon w9-3495X @ 4.80GHz (56 Cores / 112 Threads), Motherboard: ASUS Pro WS W790E-SAGE SE (0506 BIOS), Chipset: Intel Device 7aa7, Memory: 8 x 32 GB DDR5-5614MT/s Hynix HMCG88AEBRA115N, Disk: 6401GB Micron_9300_MTFDHAL6T4TDR + 0GB Virtual HDisk0, Graphics: NVIDIA GeForce RTX 4090 24GB, Audio: Realtek ALC1220, Monitor: BenQ PD2720U, Network: 2 x Intel X710 for 10GBASE-T
OS: Ubuntu 22.04, Kernel: 6.3.0-060300-generic (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 530.41.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.1.98, Vulkan: 1.3.236, Compiler: GCC 11.3.0 + CUDA 12.1, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x2b000390Graphics Notes: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.03OpenCL Notes: GPU Compute Cores: 16384Python Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
RTX 2080 Ti 22GB -Dell Changed Graphics to NVIDIA GeForce RTX 2080 Ti 22GB .
Graphics Change: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.30.40.4dOpenCL Change: GPU Compute Cores: 4352
RTX 3090 24GB -Zotac Changed Graphics to NVIDIA GeForce RTX 3090 24GB .
Graphics Change: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.26.48.65OpenCL Change: GPU Compute Cores: 10496
RTX 4080 16GB -Pny Changed Graphics to NVIDIA GeForce RTX 4080 16GB .
Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.67OpenCL Change: GPU Compute Cores: 9728
2307020-PTS-GPUREVIEW1 OpenBenchmarking.org Phoronix Test Suite Intel Xeon w9-3495X @ 4.80GHz (56 Cores / 112 Threads) ASUS Pro WS W790E-SAGE SE (0506 BIOS) Intel Device 7aa7 8 x 32 GB DDR5-5614MT/s Hynix HMCG88AEBRA115N 6401GB Micron_9300_MTFDHAL6T4TDR + 0GB Virtual HDisk0 NVIDIA GeForce RTX 4090 24GB NVIDIA GeForce RTX 2080 Ti 22GB NVIDIA GeForce RTX 3090 24GB NVIDIA GeForce RTX 4080 16GB Realtek ALC1220 BenQ PD2720U 2 x Intel X710 for 10GBASE-T Ubuntu 22.04 6.3.0-060300-generic (x86_64) GNOME Shell 42.5 X Server 1.21.1.4 NVIDIA 530.41.03 4.6.0 OpenCL 3.0 CUDA 12.1.98 1.3.236 GCC 11.3.0 + CUDA 12.1 ext4 3840x2160 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 2307020-PTS-GPUREVIEW1 Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x2b000390 - RTX 4090 24GB -Nvidia: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.03 - RTX 2080 Ti 22GB -Dell: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.30.40.4d - RTX 3090 24GB -Zotac: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.26.48.65 - RTX 4080 16GB -Pny: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.67 - RTX 4090 24GB -Nvidia: GPU Compute Cores: 16384 - RTX 2080 Ti 22GB -Dell: GPU Compute Cores: 4352 - RTX 3090 24GB -Zotac: GPU Compute Cores: 10496 - RTX 4080 16GB -Pny: GPU Compute Cores: 9728 - Python 3.10.6 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
RTX 4090 24GB -Nvidia RTX 2080 Ti 22GB -Dell RTX 3090 24GB -Zotac RTX 4080 16GB -Pny Result Overview Phoronix Test Suite 100% 184% 267% 351% 434% Chaos Group V-RAY OctaneBench NCNN Hashcat clpeak FinanceBench Blender vkpeak GROMACS IndigoBench SHOC Scalable HeterOgeneous Computing VkResample RealSR-NCNN MandelGPU NAMD CUDA NeatBench ArrayFire Waifu2x-NCNN Vulkan VkFFT cl-mem FAHBench ViennaCL LeelaChessZero
2307020-PTS-GPUREVIEW1 hashcat: MD5 hashcat: SHA1 hashcat: SHA-512 hashcat: 7-Zip hashcat: TrueCrypt RIPEMD160 + XTS fahbench: gromacs: NVIDIA CUDA GPU - water_GMX50_bare namd-cuda: ATPase Simulation - 327,506 Atoms octanebench: Total Score rodinia: OpenCL Particle Filter arrayfire: Conjugate Gradient OpenCL clpeak: Global Memory Bandwidth clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Integer Compute INT neatbench: GPU financebench: Black-Scholes OpenCL lczero: OpenCL cl-mem: Read cl-mem: Write cl-mem: Copy mandelgpu: GPU viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Max SP Flops shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - FFT SP shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - Triad shoc: OpenCL - S3D indigobench: OpenCL GPU - Supercar indigobench: OpenCL GPU - Bedroom v-ray: NVIDIA CUDA GPU v-ray: NVIDIA RTX GPU blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX caffe: AlexNet - NVIDIA CUDA - 100 caffe: AlexNet - NVIDIA CUDA - 200 caffe: AlexNet - NVIDIA CUDA - 1000 caffe: GoogleNet - NVIDIA CUDA - 100 caffe: GoogleNet - NVIDIA CUDA - 200 caffe: GoogleNet - NVIDIA CUDA - 1000 vkfft: vkresample: 2x - Single vkresample: 2x - Double vkpeak: fp32-scalar vkpeak: fp32-vec4 vkpeak: fp16-scalar vkpeak: fp16-vec4 vkpeak: fp64-scalar vkpeak: fp64-vec4 vkpeak: int32-scalar vkpeak: int32-vec4 vkpeak: int16-scalar vkpeak: int16-vec4 ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet realsr-ncnn: 4x - Yes realsr-ncnn: 4x - No waifu2x-ncnn: 2x - 3 - Yes RTX 4090 24GB -Nvidia RTX 2080 Ti 22GB -Dell RTX 3090 24GB -Zotac RTX 4080 16GB -Pny 154600000000 49437766667 6301316667 2753238 1855288 424.8095 43.412 0.04749 1328.691152 0.8759 872.60 79605.72 1391.27 40715.61 4090 2.915 15250 886.1 788.2 410.2 957952307.2 489 602 450 661 772 726 220 442 1150 1275 1290 1340 1927 2367 781 888 1030 645 188 772 123 134 141 146 25.1128 26.3686 88620.8 2973.55 2780.22 27196.3 93.5897 964.971 25.0746 642.650 78.514 35.369 4272 5455 3.64 7.47 5.66 8.44 30.49 446.178 882.810 4374.27 1659.52 3306.62 16481.4 55798 7.807 55.967 44310.49 58585.88 44263.50 87693.33 1393.74 1394.98 44268.28 44053.50 29500.04 39200.71 5.00 1.34 2.07 1.32 1.32 2.70 0.72 2.44 4.78 1.36 1.31 2.21 8.89 2.81 2.13 30.99 1.42 20.233 5.114 2.555 51608966667 16298850000 2057233333 837550 572287 293.1585 15.367 0.09552 351.01544 4.481 1.665 505.24 13801.11 502.34 11898.05 2080 8.691 13072 544.2 454.3 319.4 445572952.0 304 393 305 475 517 524 296 370 459 459 456 461 1756 2244 760 883 1028 644 181 755 105.3 113 136 147 12.8419 13.2047 16152.9 1147.57 1506.64 4832.96 32.4925 366.490 12.5929 270.821 32.618 11.150 953 1297 8.85 22.67 16.64 27.39 92.64 33770 14.743 153.158 16016.43 15855.33 15477.16 30660.54 501.16 503.55 15927.93 15716.22 10277.12 12949.30 39.84 1.45 2.51 1.54 1.42 14.32 0.70 16.15 3.39 3.12 1.14 9.62 48.32 54.16 2.06 70.01 1.51 52.665 9.202 4.299 64410383333 20456100000 2565050000 1098825 748667 315.9118 23.734 0.07376 672.091431 3.751 1.586 812.52 34929.04 642.08 17879.77 3090 5.798 14369 822.6 735.1 358.3 560060544.4 364 495 370 600 716 657 186 371 585 587 583 584 1845 2273 772 853 1018 612 183 748 106 119 127 122 25.1558 26.3706 37860.8 2154.11 2346.58 7902.02 41.3159 393.321 24.5492 427.445 51.695 20.827 2059 2836 6.05 14.26 10.75 16.10 52.20 668.023 1325.86 6603.35 2425.16 4835.58 24255.9 42832 9.395 121.413 20374.61 26345.43 20038.97 39938.83 642.88 642.44 20292.97 20024.77 13260.98 16198.52 33.57 1.47 2.38 1.50 1.83 19.06 0.74 24.15 16.85 20.67 5.74 14.84 43.66 57.58 2.14 45.05 1.63 30.997 6.443 3.502 96868900000 30876583333 3934933333 1740900 1161011 422.1435 30.724 0.06247 974.892694 2.675 0.8713 611.54 47788.14 868.47 24511.74 4080 4.387 14877 624.1 567.4 381.9 742930877.1 385 488 418 541 608 597 224 436 774 797 831 852 1867 2313 799 969 1093 669 187 776 110 131 142 144 12.4460 13.2190 55546.2 3077.39 1833.12 17914.8 60.2617 1008.202 12.4919 425.050 65.860 26.105 3135 4033 4.36 9.33 7.50 10.41 37.91 532.765 1053.27 5212.82 1644.47 3271.14 16298.5 47295 11.545 88.909 27813.44 36737.85 27791.99 55051.22 873.67 873.65 27772.31 27652.07 18525.84 24631.63 4.99 1.42 1.97 1.32 1.44 2.73 0.67 2.35 2.34 1.56 1.08 2.07 9.54 2.99 1.77 36.80 1.36 24.309 5.597 2.753 OpenBenchmarking.org
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
NAMD CUDA NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.
Result
OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms RTX 4090 24GB -Nvidia RTX 2080 Ti 22GB -Dell RTX 3090 24GB -Zotac RTX 4080 16GB -Pny 0.0215 0.043 0.0645 0.086 0.1075 SE +/- 0.00011, N = 5 SE +/- 0.00075, N = 15 SE +/- 0.00070, N = 15 SE +/- 0.00095, N = 15 0.04749 0.09552 0.07376 0.06247
GPU Power Consumption
Min Avg Max RTX 4090 24GB -Nvidia 16.0 74.0 262.6 RTX 2080 Ti 22GB -Dell 23.5 126.1 278.9 RTX 3090 24GB -Zotac 27.9 150.8 338.1 RTX 4080 16GB -Pny 12.7 68.1 228.4 OpenBenchmarking.org Watts, Fewer Is Better NAMD CUDA 2.14 GPU Power Consumption Monitor 80 160 240 320 400 1. RTX 4090 24GB -Nvidia: Approximate power consumption of 843 Joules per run. 2. RTX 2080 Ti 22GB -Dell: Approximate power consumption of 1412 Joules per run. 3. RTX 3090 24GB -Zotac: Approximate power consumption of 1779 Joules per run. 4. RTX 4080 16GB -Pny: Approximate power consumption of 781 Joules per run.
GPU Temp
Min Avg Max RTX 4090 24GB -Nvidia 39.0 41.4 49.0 RTX 2080 Ti 22GB -Dell 61.0 67.6 74.0 RTX 3090 24GB -Zotac 48.0 61.2 77.0 RTX 4080 16GB -Pny 35.0 38.4 49.0 OpenBenchmarking.org Celsius, Fewer Is Better NAMD CUDA 2.14 GPU Temperature Monitor 20 40 60 80 100
Result Confidence
OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms RTX 4090 24GB -Nvidia RTX 2080 Ti 22GB -Dell RTX 3090 24GB -Zotac RTX 4080 16GB -Pny 1 2 3 4 5 Min: 0.05 / Avg: 0.05 / Max: 0.05 Min: 0.09 / Avg: 0.1 / Max: 0.1 Min: 0.07 / Avg: 0.07 / Max: 0.08 Min: 0.06 / Avg: 0.06 / Max: 0.07
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
Test: OpenCL Particle Filter
RTX 4090 24GB -Nvidia: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ERROR: clEnqueueWriteBuffer seed_GPU (size:400000) => -1263408085
FinanceBench FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.
SHOC Scalable HeterOgeneous Computing The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.