NVIDIA GeForce RTX 3090

AMD Ryzen 9 7950X 16-Core testing with a ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2210138-NE-2112069PT61
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 3 Tests
C++ Boost Tests 2 Tests
CPU Massive 5 Tests
Creator Workloads 6 Tests
Desktop Graphics 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 8 Tests
Machine Learning 5 Tests
Multi-Core 8 Tests
NVIDIA GPU Compute 31 Tests
OpenCL 6 Tests
OpenMPI Tests 2 Tests
Python Tests 3 Tests
Renderers 4 Tests
Scientific Computing 2 Tests
Server CPU Tests 2 Tests
Vulkan Compute 8 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 3090
December 05 2021
  2 Hours, 29 Minutes
NVIDIA RTX 3090
December 05 2021
  7 Hours, 31 Minutes
NVIDIA 3090
December 06 2021
  2 Hours, 25 Minutes
RTX 4090
October 13 2022
  6 Hours, 50 Minutes
Invert Hiding All Results Option
  4 Hours, 49 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GeForce RTX 3090 - Phoronix Test Suite

NVIDIA GeForce RTX 3090

AMD Ryzen 9 7950X 16-Core testing with a ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS) and Gigabyte NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2210138-NE-2112069PT61&grs&rdt.

NVIDIA GeForce RTX 3090ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3801 BIOS)AMD Starship/Matisse32GB1000GB Sabrent Rocket 4.0 PlusNVIDIA GeForce RTX 3090 24GBNVIDIA GA102 HD AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 21.105.13.0-22-generic (x86_64)GNOME Shell 40.5X Server 1.20.13NVIDIA 495.444.6.0OpenCL 3.0 CUDA 11.5.1001.2.186GCC 11.2.0 + Clang 13.0.0-2ext43840x2160AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS TUF GAMING X670E-PLUS WIFI (0613 BIOS)AMD Device 14d82048GB XPG GAMMIX S70 BLADE + 4001GB SSD 870 QVO 4TBGigabyte NVIDIA GeForce RTX 4090 24GBNVIDIA Device 22baPI-KVM VideoRealtek RTL8125 2.5GbE + MEDIATEK Device 0608Ubuntu 22.045.15.0-25-generic (x86_64)GNOME Shell 42.4X Server 1.21.1.3NVIDIA 520.56.06OpenCL 3.0 CUDA 11.8.871.3.205GCC 13.0.0 20221013 + Clang 14.0.0-1ubuntu11920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- RTX 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NVIDIA RTX 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NVIDIA 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - RTX 4090: --build=x86_64-linux-gnu --disable-multilib --enable-checking=release --enable-languages=c,c++ --host=x86_64-linux-gnu --target=x86_64-linux-gnu -vProcessor Details- RTX 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- NVIDIA RTX 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- NVIDIA 3090: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- RTX 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203Graphics Details- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB- NVIDIA RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB- NVIDIA 3090: BAR1 / Visible vRAM Size: 32768 MiB- RTX 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.18.00.d2OpenCL Details- RTX 3090: GPU Compute Cores: 10496- NVIDIA RTX 3090: GPU Compute Cores: 10496- NVIDIA 3090: GPU Compute Cores: 10496- RTX 4090: GPU Compute Cores: 16384Python Details- RTX 3090: Python 3.9.7- NVIDIA RTX 3090: Python 3.9.7- NVIDIA 3090: Python 3.9.7- RTX 4090: Python 3.10.6Security Details- RTX 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - NVIDIA RTX 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - NVIDIA 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - RTX 4090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA GeForce RTX 3090shoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Triadshoc: OpenCL - GEMM SGEMM_Nviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYparaview: Wavelet Volume - 3840 x 2160paraview: Wavelet Volume - 3840 x 2160viennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYshoc: OpenCL - Reductionviennacl: CPU BLAS - sDOTclpeak: Integer Compute INTvkpeak: int16-vec4viennacl: CPU BLAS - dDOTclpeak: Single-Precision Floatviennacl: OpenCL BLAS - dGEMM-TThashcat: TrueCrypt RIPEMD160 + XTShashcat: 7-Ziphashcat: SHA1shoc: OpenCL - Max SP Flopshashcat: SHA-512vkpeak: int16-scalarvkpeak: fp64-scalarblender: Pabellon Barcelona - CUDAvkpeak: fp64-vec4viennacl: OpenCL BLAS - dGEMM-TNvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkresample: 2x - Doublevkpeak: fp32-scalarclpeak: Double-Precision Doubleshoc: OpenCL - MD5 Hashviennacl: OpenCL BLAS - dGEMM-NThashcat: MD5financebench: Black-Scholes OpenCLncnn: Vulkan GPU-v3-v3 - mobilenet-v3paraview: Wavelet Volume - 2560 x 1440paraview: Wavelet Volume - 2560 x 1440paraview: Wavelet Contour - 3840 x 2160paraview: Wavelet Contour - 3840 x 2160viennacl: OpenCL BLAS - dGEMM-NNluxcorerender: Danish Mood - GPUoctanebench: Total Scoreparaview: Many Spheres - 3840 x 2160paraview: Many Spheres - 3840 x 2160paraview: Many Spheres - 1920 x 1200paraview: Many Spheres - 1920 x 1200blender: Fishy Cat - NVIDIA OptiXrodinia: OpenCL Particle Filterparaview: Many Spheres - 2560 x 1440paraview: Many Spheres - 2560 x 1440paraview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080blender: Barbershop - CUDAluxcorerender: DLSC - GPUblender: Fishy Cat - CUDAviennacl: CPU BLAS - dGEMV-Tparaview: Wavelet Volume - 1920 x 1200paraview: Wavelet Volume - 1920 x 1200vkfft: blender: Classroom - CUDAblender: Barbershop - NVIDIA OptiXluxcorerender: Orange Juice - GPUparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080luxcorerender: LuxCore Benchmark - GPUblender: Pabellon Barcelona - NVIDIA OptiXncnn: Vulkan GPU - efficientnet-b0blender: Classroom - NVIDIA OptiXarrayfire: Conjugate Gradient OpenCLindigobench: OpenCL GPU - Bedroomncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - shufflenet-v2realsr-ncnn: 4x - Yesblender: BMW27 - NVIDIA OptiXncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - yolov4-tinyparaview: Wavelet Contour - 2560 x 1440paraview: Wavelet Contour - 2560 x 1440shoc: OpenCL - S3Dncnn: Vulkan GPU - resnet18paraview: Wavelet Contour - 1920 x 1200paraview: Wavelet Contour - 1920 x 1200unvanquished: 1920 x 1080 - Highncnn: Vulkan GPU - mobilenetindigobench: OpenCL GPU - Supercarrealsr-ncnn: 4x - Noviennacl: CPU BLAS - dGEMM-NNncnn: Vulkan GPU - blazefaceunvanquished: 1920 x 1200 - Highunvanquished: 1920 x 1200 - Mediumwaifu2x-ncnn: 2x - 3 - Yesparaview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080unvanquished: 1920 x 1200 - Ultrashoc: OpenCL - FFT SPunvanquished: 2560 x 1440 - Mediumunvanquished: 2560 x 1440 - Highunvanquished: 3840 x 2160 - Mediumnamd-cuda: ATPase Simulation - 327,506 Atomsunvanquished: 1920 x 1080 - Mediumshoc: OpenCL - Texture Read Bandwidthunvanquished: 3840 x 2160 - Highunvanquished: 3840 x 2160 - Ultraunvanquished: 1920 x 1080 - Ultraxonotic: 3840 x 2160 - Lowviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTncnn: Vulkan GPU - resnet50xonotic: 3840 x 2160 - Highetlegacy: 1920 x 1200viennacl: CPU BLAS - dGEMM-TTetlegacy: 2560 x 1440xonotic: 3840 x 2160 - Ultimatemandelgpu: GPUviennacl: OpenCL BLAS - sCOPYetlegacy: 1920 x 1080fahbench: viennacl: OpenCL BLAS - sDOTvkresample: 2x - Singlexonotic: 3840 x 2160 - Ultraviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - sAXPYncnn: Vulkan GPU - vgg16cl-mem: Copyviennacl: OpenCL BLAS - dDOTcl-mem: Readluxcorerender: Rainbow Colors and Prism - GPUviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dGEMV-Ncl-mem: Writeclpeak: Global Memory Bandwidthviennacl: OpenCL BLAS - dAXPYncnn: Vulkan GPU - alexnetwarsow: 3840 x 2160warsow: 1920 x 1080lczero: OpenCLwarsow: 1920 x 1200v-ray: NVIDIA RTX GPUwarsow: 2560 x 1440v-ray: NVIDIA CUDA GPUneatbench: GPUblender: BMW27 - CUDAncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - googlenetviennacl: CPU BLAS - dGEMV-Nunvanquished: 2560 x 1440 - Ultraetlegacy: 3840 x 2160waifu2x-ncnn: 2x - 3 - NoRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409027.125226.334225.47818102.3899.666.1384.846157.36134.923.3391.20914017725.1416885.7942.435227.2160284610011385002259930000040753.9288700000013708.58653.6148.1159920672.1920769.2741184.8920949.6427806.74117.81720927.99658.1644.1932604714469000006.2593.17559.058944.7464134.488396.746019.18686.23125891.469169.63693.919414.68611.223.84393.29343.8179419.39993.9591.1322.8676.110151.478634.474583122.7656.1910.45679.6710874.70817.963.2617.41.4720.9612.081.981.9428.9496.942.547.02514.935366.18429.0821.67561.325849.6754824.2853.4385.62181.51.06464.9473.63.3136274.519602.094632100.51492.5469490.10.12792496.12246.17463.6456.8470.8655.14258893.888.43.55556.9297001645.391.9640.2374.4286255481322759.6368652.3353.89153759.272499.26326153775034.17364.1660794.632.33606239743.1813.427221.91960.8984.322739984.82856984.7211911.0415.965.8568.1461.7648.427.125226.328225.48408336.7198.265.5376.776028.36634.723.2391.57313718044.2516880.0141.635136.6060182563311401002267850000040238.8288466666713657.36649.5348.18651.1760120594.7820689.7240926.1720717.3127457.90118.51720826.66657.7144.5453605714113666676.2582.24544.988719.7504070.881390.646009.20685.10965691.949217.22894.289451.90311.323.69493.339356.7109429.53694.0591.2911.5722.8775.510300.044643.754425222.6956.3510.41689.8111036.99411.2018.003.2517.431.47020.9512.081.971.829.0646.912.536.72515.785374.983430.3471.67563.055867.721479.04.2253.5005.71581.91.03478.3488.53.3626206.035595.52469.82101.08488.5467.6485.60.12912487.32240.09471.0461.2469.5659.905939492.584.43.55567.7214300659.190.7647.6369.9242194475794831.7364654.5353.32723719.292492.46602773775014.18364.2657794.332.60605240744.7813.457221.92980.7975.522711980.32856985.7211211.0217.316.0767.1452.5638.727.090426.320225.45058098.4698.365.3381.696107.06534.323.2392.0313718742.717012.2342.235225.460581630011415002281310000040566.2289290000013709.41658.548.09658.5160220672.2120924.9841496.6820953.7227806.55119.07320960.25658.1644.5358606714362000006.2562.24565.69049.5434081.298391.636049.01681.12610892.039226.85492.139236.62911.223.62793.579380.9329402.6393.7990.3611.6122.7875.410497.894656.124414722.6456.3410.43696.4711143.48611.2217.963.4217.981.48520.9432.081.971.828.8916.922.537.43521.095430.386430.2791.7560.035836.2164404.2353.65.67382.61.05474.64843.3186247.735599.52471.12101.82481.2475.74830.12779491.92245.23472.9469.7474.5645.851590892.886.83.56566.1228693657.890.8643.7367.1052632472928214.8364659.8354.46233719.291490.27787143775004.17362.8637795.232.25608237742.8813.477231.9978.6984.823029984.92829985.5211511.0919.735.7567.5466.6635.53.35793.38783.366327192.43112061137.6618202.54187.658.0953.25133241856.4239651.7596.981211.791350182761425477004972296666788132.7630023333329775.381409.1922.221410.44129744563.4944768.1688463.0144666.1159114.7555.37144774.481408.8994.619512831514333333332.9531.51127.1018033.6448296.768796.14116017.391300.965825174.5217496.043174.2917473.0946.052.055173.9817441.98417448.869174.0449.3121.3812.5913317621.0591101.327659913.3033.5517.471111.6617786.52918.3111.062.1111.130.934733.0441.321.271.2518.7314.481.644.80794.768282.332646.0621.13800.738344.584620.33.0574.7284.1661110.78628.5638.92.5068320.030798.37615.42787.07637.5618.8636.90.16788640.12939.59607.3597.3611.3830.01988461181072.81701.8573748811.4114799.7457.4973315587587462.0452794.0425.87664477.760586.68076874415753.68411.7720887.935.74659221804.3870.627711.87985.4971.1973.0980.1409010.253.432.6993.8608.2219.3OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0002, N = 3SE +/- 0.0000, N = 327.125227.125227.09043.35791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0129, N = 3SE +/- 0.0000, N = 326.334226.328226.32023.38781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090612182430SE +/- 0.0039, N = 3SE +/- 0.0005, N = 325.478125.484025.45053.36631. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40906K12K18K24K30KSE +/- 67.62, N = 3SE +/- 2.47, N = 38102.388336.718098.4627192.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409070140210280350SE +/- 0.68, N = 3SE +/- 1.45, N = 399.698.298.3311.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409050100150200250SE +/- 0.09, N = 3SE +/- 0.88, N = 366.165.565.3206.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 4.06, N = 5SE +/- 9.85, N = 3384.84376.77381.691137.66

ParaView

Test: Wavelet Volume - Resolution: 3840 x 2160

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 65.03, N = 5SE +/- 157.61, N = 36157.366028.376107.0718202.54

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.00, N = 3SE +/- 0.06, N = 334.934.734.387.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40901326395265SE +/- 0.00, N = 3SE +/- 0.06, N = 323.323.223.258.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.19, N = 3SE +/- 0.25, N = 3391.21391.57392.03953.251. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409070140210280350SE +/- 0.67, N = 3SE +/- 2.67, N = 31401371373321. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40909K18K27K36K45KSE +/- 115.05, N = 3SE +/- 370.89, N = 317725.1418044.2518742.7041856.421. (CXX) g++ options: -O3 -rdynamic -lOpenCL

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40908K16K24K32K40KSE +/- 2.98, N = 3SE +/- 18.88, N = 316885.7916880.0117012.2339651.75

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.42, N = 3SE +/- 0.12, N = 342.441.642.296.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020K40K60K80K100KSE +/- 88.80, N = 3SE +/- 435.51, N = 335227.2135136.6035225.4081211.791. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40903006009001200150060260160513501. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090400K800K1200K1600K2000KSE +/- 4520.08, N = 3SE +/- 16650.59, N = 78461008256338163001827614

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090500K1000K1500K2000K2500KSE +/- 4014.97, N = 3SE +/- 4152.51, N = 31138500114010011415002547700

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409011000M22000M33000M44000M55000MSE +/- 32122473.96, N = 3SE +/- 15117135.24, N = 322599300000226785000002281310000049722966667

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020K40K60K80K100KSE +/- 297.72, N = 3SE +/- 781.51, N = 340753.940238.840566.288132.71. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40901300M2600M3900M5200M6500MSE +/- 2179704.36, N = 3SE +/- 1942792.95, N = 32887000000288466666728929000006300233333

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40906K12K18K24K30KSE +/- 29.87, N = 3SE +/- 36.89, N = 313708.5813657.3613709.4129775.38

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500SE +/- 0.83, N = 3SE +/- 0.03, N = 3653.61649.53658.501409.19

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: CUDARTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40901122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 348.1148.1848.0922.22

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500SE +/- 1.42, N = 3SE +/- 1.24, N = 3651.17658.511410.44

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500SE +/- 1.33, N = 3SE +/- 3.33, N = 359960160212971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409010K20K30K40K50KSE +/- 44.80, N = 3SE +/- 2.83, N = 320672.1920594.7820672.2144563.49

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409010K20K30K40K50KSE +/- 44.97, N = 3SE +/- 1.91, N = 320769.2720689.7220924.9844768.16

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020K40K60K80K100KSE +/- 52.39, N = 3SE +/- 3.14, N = 341184.8940926.1741496.6888463.01

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409010K20K30K40K50KSE +/- 45.34, N = 3SE +/- 58.62, N = 320949.6420717.3120953.7244666.11

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409013K26K39K52K65KSE +/- 69.31, N = 3SE +/- 88.56, N = 327806.7427457.9027806.5559114.75

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090306090120150SE +/- 0.08, N = 3SE +/- 0.01, N = 3117.82118.52119.0755.371. (CXX) g++ options: -O3

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409010K20K30K40K50KSE +/- 65.72, N = 3SE +/- 61.65, N = 320927.9920826.6620960.2544774.48

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500SE +/- 0.03, N = 3SE +/- 0.23, N = 3658.16657.71658.161408.891. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.00, N = 3SE +/- 1.03, N = 1544.1944.5544.5494.621. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500SE +/- 1.67, N = 3SE +/- 3.33, N = 360460560612831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030000M60000M90000M120000M150000MSE +/- 78065236.25, N = 3SE +/- 166666666.67, N = 3714469000007141136666771436200000151433333333

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090246810SE +/- 0.004, N = 3SE +/- 0.024, N = 36.2596.2586.2562.9531. (CXX) g++ options: -O3 -march=native -fopenmp

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.71331.42662.13992.85323.5665SE +/- 0.00, N = 3SE +/- 0.00, N = 33.172.242.241.50MIN: 2.21 / MAX: 20.43MIN: 2.21 / MAX: 3.37MIN: 2.21 / MAX: 4.61MIN: 1.47 / MAX: 2.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ParaView

Test: Wavelet Volume - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 3.11, N = 3SE +/- 6.66, N = 3559.05544.98565.601127.10

ParaView

Test: Wavelet Volume - Resolution: 2560 x 1440

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 49.80, N = 3SE +/- 106.54, N = 38944.758719.759049.5418033.64

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902K4K6K8K10KSE +/- 31.15, N = 3SE +/- 17.54, N = 34134.494070.884081.308296.77

ParaView

Test: Wavelet Contour - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 2.99, N = 3SE +/- 1.68, N = 3396.74390.64391.63796.14

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090200400600800100060160060411601. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: GPURTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409048121620SE +/- 0.08, N = 3SE +/- 0.20, N = 39.189.209.0117.39MIN: 3.53 / MAX: 10.86MIN: 3.03 / MAX: 10.91MIN: 3.3 / MAX: 10.77MIN: 0.07 / MAX: 21.12

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409030060090012001500686.23685.11681.131300.97

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904080120160200SE +/- 0.06, N = 3SE +/- 0.24, N = 391.4691.9492.03174.52

ParaView

Test: Many Spheres - Resolution: 3840 x 2160

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 6.01, N = 3SE +/- 24.18, N = 39169.649217.239226.8517496.04

ParaView

Test: Many Spheres - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904080120160200SE +/- 0.06, N = 3SE +/- 0.25, N = 393.9194.2892.13174.29

ParaView

Test: Many Spheres - Resolution: 1920 x 1200

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 5.73, N = 3SE +/- 25.11, N = 39414.699451.909236.6317473.09

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40903691215SE +/- 0.09, N = 3SE +/- 0.05, N = 1311.2211.3211.226.05

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.86471.72942.59413.45884.3235SE +/- 0.013, N = 3SE +/- 0.050, N = 33.8433.6943.6272.0551. (CXX) g++ options: -O2 -lOpenCL

ParaView

Test: Many Spheres - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904080120160200SE +/- 0.08, N = 3SE +/- 0.02, N = 393.2093.3393.57173.98

ParaView

Test: Many Spheres - Resolution: 2560 x 1440

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 7.77, N = 3SE +/- 2.22, N = 39343.829356.719380.9317441.98

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 4.72, N = 3SE +/- 9.92, N = 39419.409429.549402.6317448.87

ParaView

Test: Many Spheres - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904080120160200SE +/- 0.05, N = 3SE +/- 0.10, N = 393.9594.0593.79174.04

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: CUDARTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.17, N = 3SE +/- 0.17, N = 391.1391.2990.3649.31

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: GPUNVIDIA RTX 3090NVIDIA 3090RTX 4090510152025SE +/- 0.03, N = 3SE +/- 0.02, N = 311.5711.6121.38MIN: 11.39 / MAX: 11.73MIN: 11.41 / MAX: 11.72MIN: 18.47 / MAX: 21.85

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Fishy Cat - Compute: CUDARTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090510152025SE +/- 0.04, N = 3SE +/- 0.00, N = 322.8622.8722.7812.59

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090306090120150SE +/- 0.36, N = 376.175.575.4133.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1200

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 74.83, N = 3SE +/- 64.74, N = 310151.4810300.0410497.8917621.06

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 4.68, N = 3SE +/- 4.05, N = 3634.47643.75656.121101.32

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409016K32K48K64K80KSE +/- 384.18, N = 9SE +/- 489.60, N = 3458314425244147765991. (CXX) g++ options: -O3

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: CUDARTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 322.7622.6922.6413.30

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Barbershop - Compute: NVIDIA OptiXRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40901326395265SE +/- 0.06, N = 3SE +/- 0.02, N = 356.1956.3556.3433.55

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: GPURTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409048121620SE +/- 0.10, N = 3SE +/- 0.04, N = 310.4510.4110.4317.47MIN: 8.53 / MAX: 13.8MIN: 8.47 / MAX: 13.76MIN: 8.54 / MAX: 13.75MIN: 0.39 / MAX: 25.03

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 5.50, N = 3SE +/- 2.67, N = 3679.67689.81696.471111.66

ParaView

Test: Wavelet Volume - Resolution: 1920 x 1080

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40904K8K12K16K20KSE +/- 88.00, N = 3SE +/- 42.71, N = 310874.7111036.9911143.4917786.53

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: GPUNVIDIA RTX 3090NVIDIA 3090RTX 4090510152025SE +/- 0.01, N = 3SE +/- 0.09, N = 311.2011.2218.31MIN: 3.54 / MAX: 13.16MIN: 3.57 / MAX: 13.17MIN: 7.1 / MAX: 23.24

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409048121620SE +/- 0.02, N = 3SE +/- 0.01, N = 317.9618.0017.9611.06

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.76951.5392.30853.0783.8475SE +/- 0.00, N = 3SE +/- 0.00, N = 33.263.253.422.11MIN: 3.23 / MAX: 3.42MIN: 3.22 / MAX: 4.42MIN: 3.24 / MAX: 4.26MIN: 2.09 / MAX: 2.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: Classroom - Compute: NVIDIA OptiXRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 317.4017.4317.9811.13

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.33410.66821.00231.33641.6705SE +/- 0.0055, N = 3SE +/- 0.0019, N = 31.47001.47001.48500.93471. (CXX) g++ options: -rdynamic

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 320.9620.9520.9433.04

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.4680.9361.4041.8722.34SE +/- 0.01, N = 3SE +/- 0.00, N = 32.082.082.081.32MIN: 2.06 / MAX: 2.29MIN: 2.05 / MAX: 3.23MIN: 2.06 / MAX: 2.51MIN: 1.31 / MAX: 21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.44550.8911.33651.7822.2275SE +/- 0.00, N = 3SE +/- 0.00, N = 31.981.971.971.27MIN: 1.94 / MAX: 6.06MIN: 1.94 / MAX: 3.15MIN: 1.94 / MAX: 2.26MIN: 1.25 / MAX: 2.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.43650.8731.30951.7462.1825SE +/- 0.00, N = 3SE +/- 0.00, N = 31.941.801.801.25MIN: 1.78 / MAX: 2.28MIN: 1.76 / MAX: 3MIN: 1.77 / MAX: 2.69MIN: 1.23 / MAX: 2.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090714212835SE +/- 0.06, N = 3SE +/- 0.00, N = 328.9529.0628.8918.73

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: NVIDIA OptiXRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090246810SE +/- 0.01, N = 3SE +/- 0.04, N = 156.946.916.924.48

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.57151.1431.71452.2862.8575SE +/- 0.01, N = 3SE +/- 0.00, N = 32.542.532.531.64MIN: 2.5 / MAX: 3.7MIN: 2.49 / MAX: 3.64MIN: 2.51 / MAX: 2.76MIN: 1.61 / MAX: 2.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090246810SE +/- 0.04, N = 3SE +/- 0.00, N = 37.026.727.434.80MIN: 6.34 / MAX: 30.81MIN: 6.3 / MAX: 19.53MIN: 6.36 / MAX: 36.74MIN: 4.73 / MAX: 5.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ParaView

Test: Wavelet Contour - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 1.60, N = 3SE +/- 1.13, N = 3514.93515.78521.09794.76

ParaView

Test: Wavelet Contour - Resolution: 2560 x 1440

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902K4K6K8K10KSE +/- 16.63, N = 3SE +/- 11.72, N = 35366.185374.985430.398282.33

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 0.21, N = 3SE +/- 0.34, N = 3429.08430.35430.28646.061. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.38250.7651.14751.531.9125SE +/- 0.00, N = 3SE +/- 0.01, N = 31.671.671.701.13MIN: 1.64 / MAX: 1.9MIN: 1.64 / MAX: 4.75MIN: 1.65 / MAX: 9.46MIN: 1.11 / MAX: 1.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1200

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 5.24, N = 3SE +/- 0.35, N = 3561.32563.05560.03800.73

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1200

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902K4K6K8K10KSE +/- 54.66, N = 3SE +/- 3.68, N = 35849.685867.725836.228344.58

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: HighRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 2.19, N = 3SE +/- 1.32, N = 3482.0479.0440.0620.3

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.9631.9262.8893.8524.815SE +/- 0.01, N = 3SE +/- 0.00, N = 34.284.224.233.05MIN: 4.15 / MAX: 15.44MIN: 4.13 / MAX: 4.48MIN: 4.16 / MAX: 4.41MIN: 3.02 / MAX: 4.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.03, N = 3SE +/- 0.02, N = 353.4453.5053.6074.73

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40901.28592.57183.85775.14366.4295SE +/- 0.022, N = 3SE +/- 0.005, N = 35.6215.7155.6734.166

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.18, N = 381.581.982.6111.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.23850.4770.71550.9541.1925SE +/- 0.00, N = 3SE +/- 0.01, N = 31.061.031.050.78MIN: 1.02 / MAX: 2.12MIN: 1 / MAX: 2.07MIN: 1.01 / MAX: 2.18MIN: 0.76 / MAX: 1.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: HighRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 4.62, N = 3SE +/- 3.34, N = 3464.9478.3474.6628.5

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: MediumRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 1.98, N = 3SE +/- 2.89, N = 3473.6488.5484.0638.9

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.75651.5132.26953.0263.7825SE +/- 0.004, N = 3SE +/- 0.006, N = 33.3133.3623.3182.506

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902K4K6K8K10KSE +/- 54.90, N = 3SE +/- 16.65, N = 36274.526206.046247.748320.03

ParaView

Test: Wavelet Contour - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 5.27, N = 3SE +/- 1.60, N = 3602.09595.52599.52798.37

Unvanquished

Resolution: 1920 x 1200 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1200 - Effects Quality: UltraRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 5.55, N = 3SE +/- 0.79, N = 3463.0469.8471.1615.4

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40906001200180024003000SE +/- 0.42, N = 3SE +/- 1.73, N = 32100.512101.082101.822787.071. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: MediumRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 3.75, N = 3SE +/- 0.49, N = 3492.5488.5481.2637.5

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: HighRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 2.97, N = 3SE +/- 1.30, N = 3469.0467.6475.7618.8

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: MediumRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 2.17, N = 3SE +/- 6.63, N = 3490.1485.6483.0636.9

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.03780.07560.11340.15120.189SE +/- 0.00047, N = 3SE +/- 0.00009, N = 30.127920.129120.127790.16788

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: MediumRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 3.91, N = 3SE +/- 5.77, N = 3496.1487.3491.9640.1

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40906001200180024003000SE +/- 3.07, N = 3SE +/- 0.34, N = 32246.172240.092245.232939.591. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: HighRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 3.53, N = 3SE +/- 1.30, N = 3463.6471.0472.9607.3

Unvanquished

Resolution: 3840 x 2160 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 3840 x 2160 - Effects Quality: UltraRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 4.41, N = 3SE +/- 1.07, N = 3456.8461.2469.7597.3

Unvanquished

Resolution: 1920 x 1080 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 1920 x 1080 - Effects Quality: UltraRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 1.10, N = 3SE +/- 0.91, N = 3470.8469.5474.5611.3

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Low

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: LowRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.55, N = 3SE +/- 2.14, N = 3655.14659.91645.85830.02MIN: 103 / MAX: 1300MIN: 109 / MAX: 1303MIN: 117 / MAX: 1283MIN: 217 / MAX: 1735

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090306090120150SE +/- 0.32, N = 3SE +/- 0.67, N = 393.892.592.8118.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.12, N = 388.484.486.8107.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.8011.6022.4033.2044.005SE +/- 0.00, N = 3SE +/- 0.00, N = 33.553.553.562.81MIN: 3.52 / MAX: 3.69MIN: 3.52 / MAX: 3.75MIN: 3.53 / MAX: 3.69MIN: 2.79 / MAX: 3.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Xonotic

Resolution: 3840 x 2160 - Effects Quality: High

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: HighRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090150300450600750SE +/- 1.97, N = 3SE +/- 3.30, N = 3556.93567.72566.12701.86MIN: 114 / MAX: 1122MIN: 91 / MAX: 1163MIN: 113 / MAX: 1151MIN: 193 / MAX: 1442

ET: Legacy

Resolution: 1920 x 1200

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 2.46, N = 3SE +/- 0.47, N = 3645.3659.1657.8811.4

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090306090120150SE +/- 0.26, N = 391.990.790.8114.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ET: Legacy

Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 7.88, N = 4SE +/- 6.23, N = 3640.2647.6643.7799.7

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Ultimate

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: UltimateRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090100200300400500SE +/- 1.53, N = 3SE +/- 0.84, N = 3374.43369.92367.11457.50MIN: 65 / MAX: 751MIN: 58 / MAX: 764MIN: 65 / MAX: 749MIN: 72 / MAX: 1128

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130M260M390M520M650MSE +/- 1595019.30, N = 3SE +/- 430679.16, N = 3481322759.6475794831.7472928214.8587587462.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090100200300400500SE +/- 0.88, N = 33683643644521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ET: Legacy

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 3.73, N = 3SE +/- 3.83, N = 3652.3654.5659.8794.0

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409090180270360450SE +/- 0.32, N = 3SE +/- 1.01, N = 3353.89353.33354.46425.88

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090100200300400500SE +/- 0.58, N = 33753713714471. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40903691215SE +/- 0.003, N = 3SE +/- 0.002, N = 39.2729.2929.2917.7601. (CXX) g++ options: -O3

Xonotic

Resolution: 3840 x 2160 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterXonotic 0.8.2Resolution: 3840 x 2160 - Effects Quality: UltraRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 1.15, N = 3SE +/- 0.80, N = 3499.26492.47490.28586.68MIN: 117 / MAX: 913MIN: 93 / MAX: 902MIN: 122 / MAX: 891MIN: 218 / MAX: 1144

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090100200300400500SE +/- 0.33, N = 33773773774411. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090120240360480600SE +/- 0.33, N = 35035015005751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.94051.8812.82153.7624.7025SE +/- 0.02, N = 3SE +/- 0.00, N = 34.174.184.173.68MIN: 4.12 / MAX: 4.86MIN: 4.12 / MAX: 11.44MIN: 4.13 / MAX: 4.34MIN: 3.66 / MAX: 4.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409090180270360450SE +/- 0.19, N = 3SE +/- 0.17, N = 3364.1364.2362.8411.71. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090160320480640800SE +/- 0.58, N = 36606576377201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.84, N = 3SE +/- 0.09, N = 3794.6794.3795.2887.91. (CC) gcc options: -O2 -flto -lOpenCL

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: GPURTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090816243240SE +/- 0.21, N = 3SE +/- 0.22, N = 332.3332.6032.2535.74MIN: 30.03 / MAX: 34.38MIN: 30.12 / MAX: 34.6MIN: 29.03 / MAX: 34.55MIN: 34.18 / MAX: 37.28

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 0.33, N = 36066056086591. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409050100150200250SE +/- 0.33, N = 32392402372211. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.33, N = 3SE +/- 0.43, N = 3743.1744.7742.8804.31. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.02, N = 3SE +/- 0.19, N = 3813.42813.45813.47870.621. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090170340510680850SE +/- 0.00, N = 3SE +/- 0.58, N = 37227227237711. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40900.4320.8641.2961.7282.16SE +/- 0.01, N = 3SE +/- 0.01, N = 31.911.921.901.87MIN: 1.86 / MAX: 9.04MIN: 1.87 / MAX: 7.42MIN: 1.87 / MAX: 2.6MIN: 1.83 / MAX: 2.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Warsow

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.78, N = 3SE +/- 0.07, N = 3960.8980.7978.6985.4

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1080RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 5.54, N = 3SE +/- 7.67, N = 10984.3975.5984.8971.1

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRTX 3090NVIDIA RTX 3090NVIDIA 30905K10K15K20K25KSE +/- 98.83, N = 32273922711230291. (CXX) g++ options: -flto -pthread

Warsow

Resolution: 1920 x 1200

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 1200RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 5.60, N = 3SE +/- 4.70, N = 3984.8980.3984.9973.0

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5Mode: NVIDIA RTX GPURTX 3090NVIDIA RTX 3090NVIDIA 30906001200180024003000SE +/- 11.72, N = 3285628562829

Warsow

Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 2560 x 1440RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40902004006008001000SE +/- 0.35, N = 3SE +/- 5.45, N = 3984.7985.7985.5980.1

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5Mode: NVIDIA CUDA GPURTX 3090NVIDIA RTX 3090NVIDIA 30905001000150020002500SE +/- 0.33, N = 3211921122115

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPURTX 409090018002700360045004090

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.0Blend File: BMW27 - Compute: CUDARTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 40903691215SE +/- 0.01, N = 3SE +/- 3.94, N = 1511.0411.0211.0910.25

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090510152025SE +/- 1.24, N = 3SE +/- 0.03, N = 315.9617.3119.733.43MIN: 5.77 / MAX: 36.07MIN: 5.63 / MAX: 36.32MIN: 7.36 / MAX: 35.55MIN: 3.34 / MAX: 4.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090246810SE +/- 0.26, N = 3SE +/- 0.03, N = 35.856.075.752.69MIN: 3.71 / MAX: 29.49MIN: 3.71 / MAX: 31.52MIN: 3.72 / MAX: 31MIN: 2.2 / MAX: 4.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 409020406080100SE +/- 0.17, N = 3SE +/- 9.12, N = 368.167.167.593.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Unvanquished

Resolution: 2560 x 1440 - Effects Quality: Ultra

OpenBenchmarking.orgFrames Per Second, More Is BetterUnvanquished 0.52.1Resolution: 2560 x 1440 - Effects Quality: UltraRTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090130260390520650SE +/- 14.16, N = 12SE +/- 0.07, N = 3461.7452.5466.6608.2

ET: Legacy

Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is BetterET: Legacy 2.78Resolution: 3840 x 2160RTX 3090NVIDIA RTX 3090NVIDIA 3090RTX 4090140280420560700SE +/- 6.04, N = 3SE +/- 113.10, N = 6648.4638.7635.5219.3


Phoronix Test Suite v10.8.4