NVIDIA GeForce RTX 3090

AMD Ryzen 9 7950X 16-Core testing with a ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2210138-NE-2112069PT61
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 3 Tests
C++ Boost Tests 2 Tests
CPU Massive 5 Tests
Creator Workloads 6 Tests
Desktop Graphics 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 8 Tests
Machine Learning 5 Tests
Multi-Core 8 Tests
NVIDIA GPU Compute 31 Tests
OpenCL 6 Tests
OpenMPI Tests 2 Tests
Python Tests 3 Tests
Renderers 4 Tests
Scientific Computing 2 Tests
Server CPU Tests 2 Tests
Vulkan Compute 8 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 3090
December 05 2021
  2 Hours, 29 Minutes
NVIDIA RTX 3090
December 05 2021
  7 Hours, 31 Minutes
NVIDIA 3090
December 06 2021
  2 Hours, 25 Minutes
RTX 4090
October 13 2022
  6 Hours, 50 Minutes
Invert Hiding All Results Option
  4 Hours, 49 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GeForce RTX 3090 - Phoronix Test Suite

NVIDIA GeForce RTX 3090

AMD Ryzen 9 7950X 16-Core testing with a ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2210138-NE-2112069PT61&grr&sor.

NVIDIA GeForce RTX 3090ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3801 BIOS)AMD Starship/Matisse32GB1000GB Sabrent Rocket 4.0 PlusNVIDIA GeForce RTX 3090 24GBNVIDIA GA102 HD AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 21.105.13.0-22-generic (x86_64)GNOME Shell 40.5X Server 1.20.13NVIDIA 495.444.6.0OpenCL 3.0 CUDA 11.5.1001.2.186GCC 11.2.0 + Clang 13.0.0-2ext43840x2160AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS)AMD Device 14d82048GB XPG GAMMIX S70 BLADE + 4001GB SSD 870 QVO 4TBGigabyte NVIDIA GeForce RTX 4090 24GBNVIDIA Device 22baPI-KVM VideoRealtek RTL8125 2.5GbE + MEDIATEK Device 0608Ubuntu 22.045.15.0-25-generic (x86_64)GNOME Shell 42.4X Server 1.21.1.3NVIDIA 520.56.06OpenCL 3.0 CUDA 11.8.871.3.205GCC 13.0.0 20221013 + Clang 14.0.0-1ubuntu11920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- RTX 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NVIDIA RTX 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NVIDIA 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - RTX 4090: --build=x86_64-linux-gnu --disable-multilib --enable-checking=release --enable-languages=c,c++ --host=x86_64-linux-gnu --target=x86_64-linux-gnu -vProcessor Details- RTX 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- NVIDIA RTX 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- NVIDIA 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- RTX 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB- NVIDIA RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB- NVIDIA 3090: BAR1 / Visible vRAM Size: 32768 MiB- RTX 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.18.00.d2OpenCL Details- RTX 3090: GPU Compute Cores: 10496- NVIDIA RTX 3090: GPU Compute Cores: 10496- NVIDIA 3090: GPU Compute Cores: 10496- RTX 4090: GPU Compute Cores: 16384Python Details- RTX 3090: Python 3.9.7- NVIDIA RTX 3090: Python 3.9.7- NVIDIA 3090: Python 3.9.7- RTX 4090: Python 3.10.6Security Details- RTX 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - NVIDIA RTX 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - NVIDIA 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - RTX 4090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA GeForce RTX 3090shoc: OpenCL - Max SP Flopsvkfft: etlegacy: 3840 x 2160vkpeak: fp64-vec4vkpeak: int16-vec4vkpeak: int16-scalarvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp64-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkpeak: fp32-scalarlczero: OpenCLoctanebench: Total Scorewarsow: 1920 x 1080fahbench: warsow: 1920 x 1200warsow: 2560 x 1440warsow: 3840 x 2160luxcorerender: DLSC - GPUblender: Barbershop - CUDAluxcorerender: LuxCore Benchmark - GPUv-ray: NVIDIA RTX GPUv-ray: NVIDIA CUDA GPUluxcorerender: Orange Juice - GPUindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarluxcorerender: Danish Mood - GPUnamd-cuda: ATPase Simulation - 327,506 Atomsblender: Barbershop - NVIDIA OptiXunvanquished: 2560 x 1440 - Ultrancnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetblender: Pabellon Barcelona - CUDAclpeak: Double-Precision Doublexonotic: 3840 x 2160 - Ultimateblender: BMW27 - CUDAunvanquished: 1920 x 1080 - Ultrarealsr-ncnn: 4x - Yesxonotic: 3840 x 2160 - Ultraunvanquished: 3840 x 2160 - Ultraunvanquished: 1920 x 1200 - Ultraunvanquished: 1920 x 1080 - Highunvanquished: 3840 x 2160 - Highunvanquished: 2560 x 1440 - Highunvanquished: 1920 x 1200 - Highxonotic: 3840 x 2160 - Highvkresample: 2x - Doubleviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYunvanquished: 2560 x 1440 - Mediumunvanquished: 3840 x 2160 - Mediumunvanquished: 1920 x 1200 - Mediumunvanquished: 1920 x 1080 - Mediumblender: Fishy Cat - CUDAviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYblender: Classroom - CUDAxonotic: 3840 x 2160 - Lowblender: Fishy Cat - NVIDIA OptiXparaview: Many Spheres - 3840 x 2160paraview: Many Spheres - 3840 x 2160paraview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1200paraview: Many Spheres - 1920 x 1200paraview: Many Spheres - 2560 x 1440paraview: Many Spheres - 2560 x 1440shoc: OpenCL - Texture Read Bandwidthblender: Pabellon Barcelona - NVIDIA OptiXblender: Classroom - NVIDIA OptiXetlegacy: 2560 x 1440blender: BMW27 - NVIDIA OptiXetlegacy: 1920 x 1080etlegacy: 1920 x 1200vkresample: 2x - Singleparaview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 3840 x 2160hashcat: SHA-512paraview: Wavelet Volume - 3840 x 2160paraview: Wavelet Volume - 3840 x 2160paraview: Wavelet Contour - 2560 x 1440paraview: Wavelet Contour - 2560 x 1440paraview: Wavelet Contour - 1920 x 1200paraview: Wavelet Contour - 1920 x 1200paraview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080hashcat: SHA1hashcat: MD5hashcat: TrueCrypt RIPEMD160 + XTSluxcorerender: Rainbow Colors and Prism - GPUparaview: Wavelet Volume - 2560 x 1440paraview: Wavelet Volume - 2560 x 1440paraview: Wavelet Volume - 1920 x 1200paraview: Wavelet Volume - 1920 x 1200paraview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080realsr-ncnn: 4x - Norodinia: OpenCL Particle Filterhashcat: 7-Ziparrayfire: Conjugate Gradient OpenCLshoc: OpenCL - Bus Speed Readbackwaifu2x-ncnn: 2x - 3 - Yescl-mem: Copycl-mem: Readcl-mem: Writeshoc: OpenCL - GEMM SGEMM_Nmandelgpu: GPUshoc: OpenCL - S3Dshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Triadclpeak: Integer Compute INTshoc: OpenCL - FFT SPshoc: OpenCL - Reductionshoc: OpenCL - MD5 Hashclpeak: Single-Precision Floatclpeak: Global Memory Bandwidthfinancebench: Black-Scholes OpenCLneatbench: GPUmixbench: NVIDIA CUDA - IntegerRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409040753.945831648.416885.7913708.5820672.1920769.27653.6141184.8920949.6427806.7420927.9922739686.231258984.3353.8915984.8984.7960.891.132856211910.4520.96153.4389.180.1279256.19461.72.5415.967.023.551.911.674.175.851.063.262.081.943.171.984.2848.11658.16374.428625511.04470.828.949499.2632615456.8463482463.6469464.9556.9297001117.81791.993.888.481.576.168.142.434.923.314099.666.1492.5490.1473.6496.122.8660259960460137723966072260637550336822.76655.14258811.229169.63691.469419.39993.959414.68693.919343.81793.22246.1717.9617.4640.26.94652.3645.39.2724134.488396.7428870000006157.361384.845366.18514.935849.675561.326274.519602.09225993000007144690000084610032.338944.746559.0510151.478634.4710874.708679.675.6213.84311385001.4727.12523.313364.1794.6743.18102.38481322759.6429.08226.334225.478117725.142100.51391.20944.193235227.21813.426.25940238.844252638.7651.1716880.0113657.3620594.7820689.72649.5340926.1720717.3127457.9020826.6622711685.109656975.5353.3272980.3985.7980.711.5791.2911.202856211210.4120.95153.5009.200.1291256.35452.52.5317.316.723.551.921.674.186.071.033.252.081.82.241.974.2248.18657.71369.924219411.02469.529.064492.4660277461.2469.8479.0471.0467.6478.3567.7214300118.51790.792.584.481.975.567.141.634.723.213798.265.5488.5485.6488.5487.322.8760160160560037724065772260537150136422.69659.905939411.329217.22891.949429.53694.059451.90394.289356.71093.332240.0918.0017.43647.66.91654.5659.19.2924070.881390.6428846666676028.366376.775374.983515.785867.721563.056206.035595.52226785000007141136666782563332.608719.750544.9810300.044643.7511036.994689.815.7153.69411401001.47027.12523.362364.2794.3744.78336.71475794831.7430.34726.328225.484018044.252101.08391.57344.545335136.60813.456.25840566.244147635.5658.5117012.2313709.4120672.2120924.98658.541496.6820953.7227806.5520960.2523029681.126108984.8354.4623984.9985.5978.611.6190.3611.222829211510.4320.94353.69.010.1277956.34466.62.5319.737.433.561.91.74.175.751.053.422.081.82.241.974.2348.09658.16367.105263211.09474.528.891490.2778714469.7471.1440472.9475.7474.6566.1228693119.07390.892.886.882.675.467.542.234.323.213798.365.3481.2483484491.922.7860560260660437723763772360837150036422.64645.851590811.229226.85492.039402.6393.799236.62992.139380.93293.572245.2317.9617.98643.76.92659.8657.89.2914081.298391.6328929000006107.065381.695430.386521.095836.216560.036247.735599.52228131000007143620000081630032.259049.543565.610497.894656.1211143.486696.475.6733.62711415001.48527.09043.318362.8795.2742.88098.46472928214.8430.27926.320225.450518742.72101.82392.0344.535835225.4813.476.25688132.776599219.31410.4439651.7529775.3844563.4944768.161409.1988463.0144666.1159114.7544774.481300.965825971.1425.8766973.0980.1985.421.3849.3118.3117.4733.04474.72817.390.1678833.55608.21.643.434.802.811.871.133.682.690.782.111.321.251.51.273.0522.221408.89457.497331510.25611.318.731586.6807687597.3615.4620.3607.3618.8628.5701.857374855.37111411810711113393.896.987.658.0332311206637.5636.9638.9640.112.59135012971283116044122172077165944757545213.30830.01988466.0517496.043174.5217448.869174.0417473.094174.2917441.984173.982939.5911.0611.13799.74.48794.0811.47.7608296.768796.14630023333318202.5411137.668282.332794.768344.584800.738320.030798.3749722966667151433333333182761435.7418033.6441127.1017621.0591101.3217786.5291111.664.1662.05525477000.93473.35792.506411.7887.9804.327192.4587587462.0646.0623.38783.366341856.422787.07953.25194.619581211.79870.622.9534090OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309020K40K60K80K100KSE +/- 781.51, N = 3SE +/- 297.72, N = 388132.740753.940566.240238.81. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1RTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 309016K32K48K64K80KSE +/- 489.60, N = 3SE +/- 384.18, N = 9765994583144252441471. (CXX) g++ options: -O3

ET: Legacy

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 6.04, N = 3SE +/- 113.10, N = 6648.4638.7635.5219.3

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4RTX 4090NVIDIA 3090NVIDIA RTX 309030060090012001500SE +/- 1.24, N = 3SE +/- 1.42, N = 31410.44658.51651.17

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30908K16K24K32K40KSE +/- 18.88, N = 3SE +/- 2.98, N = 339651.7517012.2316885.7916880.01

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30906K12K18K24K30KSE +/- 36.89, N = 3SE +/- 29.87, N = 329775.3813709.4113708.5813657.36

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309010K20K30K40K50KSE +/- 2.83, N = 3SE +/- 44.80, N = 344563.4920672.2120672.1920594.78

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309010K20K30K40K50KSE +/- 1.91, N = 3SE +/- 44.97, N = 344768.1620924.9820769.2720689.72

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309030060090012001500SE +/- 0.03, N = 3SE +/- 0.83, N = 31409.19658.50653.61649.53

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309020K40K60K80K100KSE +/- 3.14, N = 3SE +/- 52.39, N = 388463.0141496.6841184.8940926.17

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309010K20K30K40K50KSE +/- 58.62, N = 3SE +/- 45.34, N = 344666.1120953.7220949.6420717.31

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309013K26K39K52K65KSE +/- 88.56, N = 3SE +/- 69.31, N = 359114.7527806.7427806.5527457.90

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309010K20K30K40K50KSE +/- 61.65, N = 3SE +/- 65.72, N = 344774.4820960.2520927.9920826.66

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLNVIDIA 3090RTX 3090NVIDIA RTX 30905K10K15K20K25KSE +/- 98.83, N = 32302922739227111. (CXX) g++ options: -flto -pthread

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090300600900120015001300.97686.23685.11681.13

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080NVIDIA 3090RTX 3090NVIDIA RTX 3090RTX 40902004006008001000SE +/- 5.54, N = 3SE +/- 7.67, N = 10984.8984.3975.5971.1

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309090180270360450SE +/- 1.01, N = 3SE +/- 0.32, N = 3425.88354.46353.89353.33

Warsow

Resolution: 1920 x 1200

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1200NVIDIA 3090RTX 3090NVIDIA RTX 3090RTX 40902004006008001000SE +/- 5.60, N = 3SE +/- 4.70, N = 3984.9984.8980.3973.0

Warsow

Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 2560 x 1440NVIDIA RTX 3090NVIDIA 3090RTX 3090RTX 40902004006008001000SE +/- 0.35, N = 3SE +/- 5.45, N = 3985.7985.5984.7980.1

Warsow

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 3840 x 2160RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30902004006008001000SE +/- 0.07, N = 3SE +/- 0.78, N = 3985.4980.7978.6960.8

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: GPURTX 4090NVIDIA 3090NVIDIA RTX 3090510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 321.3811.6111.57MIN: 18.47 / MAX: 21.85MIN: 11.41 / MAX: 11.72MIN: 11.39 / MAX: 11.73

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: CUDARTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309020406080100SE +/- 0.17, N = 3SE +/- 0.17, N = 349.3190.3691.1391.29

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: GPURTX 4090NVIDIA 3090NVIDIA RTX 3090510152025SE +/- 0.09, N = 3SE +/- 0.01, N = 318.3111.2211.20MIN: 7.1 / MAX: 23.24MIN: 3.57 / MAX: 13.17MIN: 3.54 / MAX: 13.16

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5Mode: NVIDIA RTX GPUNVIDIA RTX 3090RTX 3090NVIDIA 30906001200180024003000SE +/- 11.72, N = 3285628562829

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5Mode: NVIDIA CUDA GPURTX 3090NVIDIA 3090NVIDIA RTX 30905001000150020002500SE +/- 0.33, N = 3211921152112

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: GPURTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309048121620SE +/- 0.04, N = 3SE +/- 0.10, N = 317.4710.4510.4310.41MIN: 0.39 / MAX: 25.03MIN: 8.53 / MAX: 13.8MIN: 8.54 / MAX: 13.75MIN: 8.47 / MAX: 13.76

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 333.0420.9620.9520.94

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 309020406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 374.7353.6053.5053.44

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: GPURTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 309048121620SE +/- 0.20, N = 3SE +/- 0.08, N = 317.399.209.189.01MIN: 0.07 / MAX: 21.12MIN: 3.03 / MAX: 10.91MIN: 3.53 / MAX: 10.86MIN: 3.3 / MAX: 10.77

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsNVIDIA 3090RTX 3090NVIDIA RTX 3090RTX 40900.03780.07560.11340.15120.189SE +/- 0.00047, N = 3SE +/- 0.00009, N = 30.127790.127920.129120.16788

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30901326395265SE +/- 0.02, N = 3SE +/- 0.06, N = 333.5556.1956.3456.35

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: UltraRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090130260390520650SE +/- 0.07, N = 3SE +/- 14.16, N = 12608.2466.6461.7452.5

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.57151.1431.71452.2862.8575SE +/- 0.00, N = 3SE +/- 0.01, N = 31.642.532.532.54MIN: 1.61 / MAX: 2.15MIN: 2.49 / MAX: 3.64MIN: 2.51 / MAX: 2.76MIN: 2.5 / MAX: 3.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090510152025SE +/- 0.03, N = 3SE +/- 1.24, N = 33.4315.9617.3119.73MIN: 3.34 / MAX: 4.06MIN: 5.77 / MAX: 36.07MIN: 5.63 / MAX: 36.32MIN: 7.36 / MAX: 35.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyRTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 3090246810SE +/- 0.00, N = 3SE +/- 0.04, N = 34.806.727.027.43MIN: 4.73 / MAX: 5.37MIN: 6.3 / MAX: 19.53MIN: 6.34 / MAX: 30.81MIN: 6.36 / MAX: 36.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50RTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 30900.8011.6022.4033.2044.005SE +/- 0.00, N = 3SE +/- 0.00, N = 32.813.553.553.56MIN: 2.79 / MAX: 3.38MIN: 3.52 / MAX: 3.69MIN: 3.52 / MAX: 3.75MIN: 3.53 / MAX: 3.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30900.4320.8641.2961.7282.16SE +/- 0.01, N = 3SE +/- 0.01, N = 31.871.901.911.92MIN: 1.83 / MAX: 2.45MIN: 1.87 / MAX: 2.6MIN: 1.86 / MAX: 9.04MIN: 1.87 / MAX: 7.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18RTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 30900.38250.7651.14751.531.9125SE +/- 0.01, N = 3SE +/- 0.00, N = 31.131.671.671.70MIN: 1.11 / MAX: 1.43MIN: 1.64 / MAX: 1.9MIN: 1.64 / MAX: 4.75MIN: 1.65 / MAX: 9.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30900.94051.8812.82153.7624.7025SE +/- 0.00, N = 3SE +/- 0.02, N = 33.684.174.174.18MIN: 3.66 / MAX: 4.19MIN: 4.12 / MAX: 4.86MIN: 4.13 / MAX: 4.34MIN: 4.12 / MAX: 11.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090246810SE +/- 0.03, N = 3SE +/- 0.26, N = 32.695.755.856.07MIN: 2.2 / MAX: 4.95MIN: 3.72 / MAX: 31MIN: 3.71 / MAX: 29.49MIN: 3.71 / MAX: 31.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.23850.4770.71550.9541.1925SE +/- 0.01, N = 3SE +/- 0.00, N = 30.781.031.051.06MIN: 0.76 / MAX: 1.68MIN: 1 / MAX: 2.07MIN: 1.01 / MAX: 2.18MIN: 1.02 / MAX: 2.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30900.76951.5392.30853.0783.8475SE +/- 0.00, N = 3SE +/- 0.00, N = 32.113.253.263.42MIN: 2.09 / MAX: 2.75MIN: 3.22 / MAX: 4.42MIN: 3.23 / MAX: 3.42MIN: 3.24 / MAX: 4.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 30900.4680.9361.4041.8722.34SE +/- 0.00, N = 3SE +/- 0.01, N = 31.322.082.082.08MIN: 1.31 / MAX: 2MIN: 2.06 / MAX: 2.29MIN: 2.05 / MAX: 3.23MIN: 2.06 / MAX: 2.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.43650.8731.30951.7462.1825SE +/- 0.00, N = 3SE +/- 0.00, N = 31.251.801.801.94MIN: 1.23 / MAX: 2.16MIN: 1.76 / MAX: 3MIN: 1.77 / MAX: 2.69MIN: 1.78 / MAX: 2.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.71331.42662.13992.85323.5665SE +/- 0.00, N = 3SE +/- 0.00, N = 31.502.242.243.17MIN: 1.47 / MAX: 2.71MIN: 2.21 / MAX: 3.37MIN: 2.21 / MAX: 4.61MIN: 2.21 / MAX: 20.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.44550.8911.33651.7822.2275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.271.971.971.98MIN: 1.25 / MAX: 2.23MIN: 1.94 / MAX: 3.15MIN: 1.94 / MAX: 2.26MIN: 1.94 / MAX: 6.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30900.9631.9262.8893.8524.815SE +/- 0.00, N = 3SE +/- 0.01, N = 33.054.224.234.28MIN: 3.02 / MAX: 4.05MIN: 4.13 / MAX: 4.48MIN: 4.16 / MAX: 4.41MIN: 4.15 / MAX: 15.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: CUDARTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30901122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 322.2248.0948.1148.18

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 309030060090012001500SE +/- 0.23, N = 3SE +/- 0.03, N = 31408.89658.16658.16657.711. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Ultimate

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: UltimateRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090100200300400500SE +/- 0.84, N = 3SE +/- 1.53, N = 3457.50374.43369.92367.11MIN: 72 / MAX: 1128MIN: 65 / MAX: 751MIN: 58 / MAX: 764MIN: 65 / MAX: 749

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: CUDARTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30903691215SE +/- 3.94, N = 15SE +/- 0.01, N = 310.2511.0211.0411.09

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: UltraRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090130260390520650SE +/- 0.91, N = 3SE +/- 1.10, N = 3611.3474.5470.8469.5

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090714212835SE +/- 0.00, N = 3SE +/- 0.06, N = 318.7328.8928.9529.06

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: UltraRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090130260390520650SE +/- 0.80, N = 3SE +/- 1.15, N = 3586.68499.26492.47490.28MIN: 218 / MAX: 1144MIN: 117 / MAX: 913MIN: 93 / MAX: 902MIN: 122 / MAX: 891

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: UltraRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090130260390520650SE +/- 1.07, N = 3SE +/- 4.41, N = 3597.3469.7461.2456.8

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: UltraRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090130260390520650SE +/- 0.79, N = 3SE +/- 5.55, N = 3615.4471.1469.8463.0

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: HighRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090130260390520650SE +/- 1.32, N = 3SE +/- 2.19, N = 3620.3482.0479.0440.0

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: HighRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090130260390520650SE +/- 1.30, N = 3SE +/- 3.53, N = 3607.3472.9471.0463.6

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: HighRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090130260390520650SE +/- 1.30, N = 3SE +/- 2.97, N = 3618.8475.7469.0467.6

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: HighRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 3090140280420560700SE +/- 3.34, N = 3SE +/- 4.62, N = 3628.5478.3474.6464.9

Xonotic

Resolution: 3840 x 2160 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: HighRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 3090150300450600750SE +/- 3.30, N = 3SE +/- 1.97, N = 3701.86567.72566.12556.93MIN: 193 / MAX: 1442MIN: 91 / MAX: 1163MIN: 113 / MAX: 1151MIN: 114 / MAX: 1122

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090306090120150SE +/- 0.01, N = 3SE +/- 0.08, N = 355.37117.82118.52119.071. (CXX) g++ options: -O3

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 3090306090120150SE +/- 0.26, N = 3114.091.990.890.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 3090306090120150SE +/- 0.67, N = 3SE +/- 0.32, N = 3118.093.892.892.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309020406080100SE +/- 0.12, N = 3107.088.486.884.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 309020406080100SE +/- 0.18, N = 3111.082.681.981.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090306090120150SE +/- 0.36, N = 3133.076.175.575.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309020406080100SE +/- 9.12, N = 3SE +/- 0.17, N = 393.868.167.567.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309020406080100SE +/- 0.12, N = 3SE +/- 0.42, N = 396.942.442.241.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 309020406080100SE +/- 0.06, N = 3SE +/- 0.00, N = 387.634.934.734.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30901326395265SE +/- 0.06, N = 3SE +/- 0.00, N = 358.023.323.223.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309070140210280350SE +/- 2.67, N = 3SE +/- 0.67, N = 33321401371371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309070140210280350SE +/- 1.45, N = 3SE +/- 0.68, N = 3311.099.698.398.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 309050100150200250SE +/- 0.88, N = 3SE +/- 0.09, N = 3206.066.165.565.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: MediumRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090140280420560700SE +/- 0.49, N = 3SE +/- 3.75, N = 3637.5492.5488.5481.2

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: MediumRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090140280420560700SE +/- 6.63, N = 3SE +/- 2.17, N = 3636.9490.1485.6483.0

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: MediumRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 3090140280420560700SE +/- 2.89, N = 3SE +/- 1.98, N = 3638.9488.5484.0473.6

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: MediumRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 3090140280420560700SE +/- 5.77, N = 3SE +/- 3.91, N = 3640.1496.1491.9487.3

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: CUDARTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090510152025SE +/- 0.00, N = 3SE +/- 0.04, N = 312.5922.7822.8622.87

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30903006009001200150013506056026011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 309030060090012001500SE +/- 3.33, N = 3SE +/- 1.33, N = 312976026015991. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 309030060090012001500SE +/- 3.33, N = 3SE +/- 1.67, N = 312836066056041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090200400600800100011606046016001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090100200300400500SE +/- 0.33, N = 34413773773771. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NNVIDIA RTX 3090RTX 3090NVIDIA 3090RTX 409050100150200250SE +/- 0.33, N = 32402392372211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090160320480640800SE +/- 0.58, N = 37206606576371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090170340510680850SE +/- 0.58, N = 3SE +/- 0.00, N = 37717237227221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 3090140280420560700SE +/- 0.33, N = 36596086066051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 3090100200300400500SE +/- 0.58, N = 34473753713711. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090120240360480600SE +/- 0.33, N = 35755035015001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 3090100200300400500SE +/- 0.88, N = 34523683643641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: CUDARTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 313.3022.6422.6922.76

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Low

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: LowRTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30902004006008001000SE +/- 2.14, N = 3SE +/- 0.55, N = 3830.02659.91655.14645.85MIN: 217 / MAX: 1735MIN: 109 / MAX: 1303MIN: 103 / MAX: 1300MIN: 117 / MAX: 1283

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30903691215SE +/- 0.05, N = 13SE +/- 0.09, N = 36.0511.2211.2211.32

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904K8K12K16K20KSE +/- 24.18, N = 3SE +/- 6.01, N = 317496.049226.859217.239169.64

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904080120160200SE +/- 0.24, N = 3SE +/- 0.06, N = 3174.5292.0391.9491.46

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30904K8K12K16K20KSE +/- 9.92, N = 3SE +/- 4.72, N = 317448.879429.549419.409402.63

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30904080120160200SE +/- 0.10, N = 3SE +/- 0.05, N = 3174.0494.0593.9593.79

ParaView

Test: Many Spheres - Resolution: 1920 x 1200

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1200RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30904K8K12K16K20KSE +/- 25.11, N = 3SE +/- 5.73, N = 317473.099451.909414.699236.63

ParaView

Test: Many Spheres - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1200RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30904080120160200SE +/- 0.25, N = 3SE +/- 0.06, N = 3174.2994.2893.9192.13

ParaView

Test: Many Spheres - Resolution: 2560 x 1440

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 2560 x 1440RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904K8K12K16K20KSE +/- 2.22, N = 3SE +/- 7.77, N = 317441.989380.939356.719343.82

ParaView

Test: Many Spheres - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 2560 x 1440RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904080120160200SE +/- 0.02, N = 3SE +/- 0.08, N = 3173.9893.5793.3393.20

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30906001200180024003000SE +/- 0.34, N = 3SE +/- 3.07, N = 32939.592246.172245.232240.091. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 311.0617.9617.9618.00

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: NVIDIA OptiXRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 309048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 311.1317.4017.4317.98

ET: Legacy

Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 2560 x 1440RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30902004006008001000SE +/- 6.23, N = 3SE +/- 7.88, N = 4799.7647.6643.7640.2

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 3090246810SE +/- 0.04, N = 15SE +/- 0.01, N = 34.486.916.926.94

ET: Legacy

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 1920 x 1080RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 3.83, N = 3SE +/- 3.73, N = 3794.0659.8654.5652.3

ET: Legacy

Resolution: 1920 x 1200

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 1920 x 1200RTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 30902004006008001000SE +/- 0.47, N = 3SE +/- 2.46, N = 3811.4659.1657.8645.3

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30903691215SE +/- 0.002, N = 3SE +/- 0.003, N = 37.7609.2729.2919.2921. (CXX) g++ options: -O3

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30902K4K6K8K10KSE +/- 17.54, N = 3SE +/- 31.15, N = 38296.774134.494081.304070.88

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30902004006008001000SE +/- 1.68, N = 3SE +/- 2.99, N = 3796.14396.74391.63390.64

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30901300M2600M3900M5200M6500MSE +/- 1942792.95, N = 3SE +/- 2179704.36, N = 36300233333289290000028870000002884666667

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30904K8K12K16K20KSE +/- 157.61, N = 3SE +/- 65.03, N = 518202.546157.366107.076028.37

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30902004006008001000SE +/- 9.85, N = 3SE +/- 4.06, N = 51137.66384.84381.69376.77

ParaView

Test: Wavelet Contour - Resolution: 2560 x 1440

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 2560 x 1440RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902K4K6K8K10KSE +/- 11.72, N = 3SE +/- 16.63, N = 38282.335430.395374.985366.18

ParaView

Test: Wavelet Contour - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 2560 x 1440RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 1.13, N = 3SE +/- 1.60, N = 3794.76521.09515.78514.93

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1200

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1200RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30902K4K6K8K10KSE +/- 3.68, N = 3SE +/- 54.66, N = 38344.585867.725849.685836.22

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1200RTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30902004006008001000SE +/- 0.35, N = 3SE +/- 5.24, N = 3800.73563.05561.32560.03

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30902K4K6K8K10KSE +/- 16.65, N = 3SE +/- 54.90, N = 38320.036274.526247.746206.04

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30902004006008001000SE +/- 1.60, N = 3SE +/- 5.27, N = 3798.37602.09599.52595.52

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 309011000M22000M33000M44000M55000MSE +/- 15117135.24, N = 3SE +/- 32122473.96, N = 349722966667228131000002267850000022599300000

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309030000M60000M90000M120000M150000MSE +/- 166666666.67, N = 3SE +/- 78065236.25, N = 3151433333333714469000007143620000071411366667

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090400K800K1200K1600K2000KSE +/- 16650.59, N = 7SE +/- 4520.08, N = 31827614846100825633816300

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: GPURTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 3090816243240SE +/- 0.22, N = 3SE +/- 0.21, N = 335.7432.6032.3332.25MIN: 34.18 / MAX: 37.28MIN: 30.12 / MAX: 34.6MIN: 30.03 / MAX: 34.38MIN: 29.03 / MAX: 34.55

ParaView

Test: Wavelet Volume - Resolution: 2560 x 1440

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 2560 x 1440RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30904K8K12K16K20KSE +/- 106.54, N = 3SE +/- 49.80, N = 318033.649049.548944.758719.75

ParaView

Test: Wavelet Volume - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 2560 x 1440RTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30902004006008001000SE +/- 6.66, N = 3SE +/- 3.11, N = 31127.10565.60559.05544.98

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1200

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1200RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904K8K12K16K20KSE +/- 64.74, N = 3SE +/- 74.83, N = 317621.0610497.8910300.0410151.48

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1200RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 4.05, N = 3SE +/- 4.68, N = 31101.32656.12643.75634.47

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30904K8K12K16K20KSE +/- 42.71, N = 3SE +/- 88.00, N = 317786.5311143.4911036.9910874.71

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080RTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 2.67, N = 3SE +/- 5.50, N = 31111.66696.47689.81679.67

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30901.28592.57183.85775.14366.4295SE +/- 0.005, N = 3SE +/- 0.022, N = 34.1665.6215.6735.715

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30900.86471.72942.59413.45884.3235SE +/- 0.050, N = 3SE +/- 0.013, N = 32.0553.6273.6943.8431. (CXX) g++ options: -O2 -lOpenCL

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090500K1000K1500K2000K2500KSE +/- 4152.51, N = 3SE +/- 4014.97, N = 32547700114150011401001138500

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 30900.33410.66821.00231.33641.6705SE +/- 0.0019, N = 3SE +/- 0.0055, N = 30.93471.47001.47001.48501. (CXX) g++ options: -rdynamic

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA RTX 3090RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0002, N = 3SE +/- 0.0000, N = 327.125227.125227.09043.35791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 30900.75651.5132.26953.0263.7825SE +/- 0.006, N = 3SE +/- 0.004, N = 32.5063.3133.3183.362

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 309090180270360450SE +/- 0.17, N = 3SE +/- 0.19, N = 3411.7364.2364.1362.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRTX 4090NVIDIA 3090RTX 3090NVIDIA RTX 30902004006008001000SE +/- 0.09, N = 3SE +/- 0.84, N = 3887.9795.2794.6794.31. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30902004006008001000SE +/- 0.43, N = 3SE +/- 0.33, N = 3804.3744.7743.1742.81. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 4090NVIDIA RTX 3090RTX 3090NVIDIA 30906K12K18K24K30KSE +/- 2.47, N = 3SE +/- 67.62, N = 327192.408336.718102.388098.461. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURTX 4090RTX 3090NVIDIA RTX 3090NVIDIA 3090130M260M390M520M650MSE +/- 430679.16, N = 3SE +/- 1595019.30, N = 3587587462.0481322759.6475794831.7472928214.81. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 3090140280420560700SE +/- 0.34, N = 3SE +/- 0.21, N = 3646.06430.35430.28429.081. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0129, N = 3SE +/- 0.0000, N = 326.334226.328226.32023.38781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadNVIDIA RTX 3090RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0039, N = 3SE +/- 0.0005, N = 325.484025.478125.45053.36631. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30909K18K27K36K45KSE +/- 370.89, N = 3SE +/- 115.05, N = 341856.4218742.7018044.2517725.141. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30906001200180024003000SE +/- 1.73, N = 3SE +/- 0.42, N = 32787.072101.822101.082100.511. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 0.25, N = 3SE +/- 0.19, N = 3953.25392.03391.57391.211. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 4090NVIDIA RTX 3090NVIDIA 3090RTX 309020406080100SE +/- 1.03, N = 15SE +/- 0.00, N = 394.6244.5544.5444.191. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRTX 4090RTX 3090NVIDIA 3090NVIDIA RTX 309020K40K60K80K100KSE +/- 435.51, N = 3SE +/- 88.80, N = 381211.7935227.2135225.4035136.601. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 30902004006008001000SE +/- 0.19, N = 3SE +/- 0.02, N = 3870.62813.47813.45813.421. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRTX 4090NVIDIA 3090NVIDIA RTX 3090RTX 3090246810SE +/- 0.024, N = 3SE +/- 0.004, N = 32.9536.2566.2586.2591. (CXX) g++ options: -O3 -march=native -fopenmp

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPURTX 409090018002700360045004090


Phoronix Test Suite v10.8.4