Intel Core i9-13900K testing with a ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS) and NVIDIA GeForce RTX 3090 24GB on EndeavourOS rolling via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2402116-SADD-240207012
RTX 4070 SUPER
Intel Core i9-13900K testing with a ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS) and NVIDIA GeForce RTX 3090 24GB on EndeavourOS rolling via the Phoronix Test Suite.
NVIDIA RTX 4070 SUPER:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: ASUS NVIDIA GeForce RTX 4070 SUPER 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 4070:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: MSI NVIDIA GeForce RTX 4070 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 4070 TI:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: NVIDIA GeForce RTX 4070 Ti 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 3090:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC1220, Monitor: PI-KVM Video, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.4-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
ArrayFire 3.9
Test: Conjugate Gradient OpenCL
Blender 4.0
Blend File: BMW27 - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 5.57 |===========================================
NVIDIA RTX 4070 ....... 6.21 |================================================
NVIDIA RTX 4070 TI .... 5.43 |==========================================
NVIDIA RTX 3090 ....... 6.31 |=================================================
Blender 4.0
Blend File: Classroom - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 12.60 |========================================
NVIDIA RTX 4070 ....... 14.86 |===============================================
NVIDIA RTX 4070 TI .... 12.30 |=======================================
NVIDIA RTX 3090 ....... 15.26 |================================================
Blender 4.0
Blend File: Fishy Cat - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 9.45 |=========================================
NVIDIA RTX 4070 ....... 11.03 |================================================
NVIDIA RTX 4070 TI .... 9.02 |=======================================
NVIDIA RTX 3090 ....... 10.64 |==============================================
Blender 4.0
Blend File: Barbershop - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 51.30 |==========================================
NVIDIA RTX 4070 ....... 58.44 |================================================
NVIDIA RTX 4070 TI .... 50.73 |==========================================
NVIDIA RTX 3090 ....... 54.30 |=============================================
Blender 4.0
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 14.29 |========================================
NVIDIA RTX 4070 ....... 16.55 |==============================================
NVIDIA RTX 4070 TI .... 13.97 |=======================================
NVIDIA RTX 3090 ....... 17.30 |================================================
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000
Milli-Seconds < Lower Is Better
cl-mem 2017-01-13
Benchmark: Copy
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 331.8 |============================================
NVIDIA RTX 4070 ....... 330.3 |============================================
NVIDIA RTX 4070 TI .... 333.3 |============================================
NVIDIA RTX 3090 ....... 360.8 |================================================
cl-mem 2017-01-13
Benchmark: Read
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 446.2 |==========================
NVIDIA RTX 4070 ....... 446.3 |==========================
NVIDIA RTX 4070 TI .... 446.3 |==========================
NVIDIA RTX 3090 ....... 825.8 |================================================
cl-mem 2017-01-13
Benchmark: Write
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 407.5 |==========================
NVIDIA RTX 4070 ....... 406.7 |==========================
NVIDIA RTX 4070 TI .... 412.2 |==========================
NVIDIA RTX 3090 ....... 753.8 |================================================
clpeak 1.1.2
OpenCL Test: Integer Compute INT
GIOPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 18170.54 |=========================================
NVIDIA RTX 4070 ....... 14555.19 |=================================
NVIDIA RTX 4070 TI .... 19821.10 |=============================================
NVIDIA RTX 3090 ....... 17923.33 |=========================================
clpeak 1.1.2
OpenCL Test: Single-Precision Float
GFLOPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 35492.69 |=========================================
NVIDIA RTX 4070 ....... 28479.39 |=================================
NVIDIA RTX 4070 TI .... 38691.73 |=============================================
NVIDIA RTX 3090 ....... 34906.79 |=========================================
clpeak 1.1.2
OpenCL Test: Double-Precision Double
GFLOPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 630.11 |============================================
NVIDIA RTX 4070 ....... 515.17 |====================================
NVIDIA RTX 4070 TI .... 667.05 |===============================================
NVIDIA RTX 3090 ....... 642.23 |=============================================
clpeak 1.1.2
OpenCL Test: Global Memory Bandwidth
GBPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 437.65 |=========================
NVIDIA RTX 4070 ....... 437.21 |=========================
NVIDIA RTX 4070 TI .... 437.63 |=========================
NVIDIA RTX 3090 ....... 816.55 |===============================================
FAHBench 2.3.2
Ns Per Day > Higher Is Better
NVIDIA RTX 4070 SUPER . 366.06 |=============================================
NVIDIA RTX 4070 ....... 317.20 |=======================================
NVIDIA RTX 4070 TI .... 382.16 |===============================================
NVIDIA RTX 3090 ....... 343.02 |==========================================
FinanceBench 2016-07-25
Benchmark: Black-Scholes OpenCL
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 5.912 |=========================================
NVIDIA RTX 4070 ....... 6.906 |================================================
NVIDIA RTX 4070 TI .... 5.226 |====================================
NVIDIA RTX 3090 ....... 5.741 |========================================
GpuOwl 7.2.1
Exponent: 57885161
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER . 869.07 |============================================
NVIDIA RTX 4070 ....... 714.80 |=====================================
NVIDIA RTX 4070 TI .... 919.13 |===============================================
NVIDIA RTX 3090 ....... 866.31 |============================================
GpuOwl 7.2.1
Exponent: 77936867
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER . 646.41 |=============================================
NVIDIA RTX 4070 ....... 530.32 |=====================================
NVIDIA RTX 4070 TI .... 676.59 |===============================================
NVIDIA RTX 3090 ....... 645.99 |=============================================
GpuOwl 7.2.1
Exponent: 332220523
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER . 137.44 |============================================
NVIDIA RTX 4070 ....... 112.61 |====================================
NVIDIA RTX 4070 TI .... 145.84 |===============================================
NVIDIA RTX 3090 ....... 137.32 |============================================
Hashcat 6.2.4
Benchmark: MD5
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 67583033333 |=======================================
NVIDIA RTX 4070 ....... 56147866667 |================================
NVIDIA RTX 4070 TI .... 73312233333 |==========================================
NVIDIA RTX 3090 ....... 67177300000 |======================================
Hashcat 6.2.4
Benchmark: SHA1
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 22132600000 |========================================
NVIDIA RTX 4070 ....... 18202466667 |================================
NVIDIA RTX 4070 TI .... 23532400000 |==========================================
NVIDIA RTX 3090 ....... 21323733333 |======================================
Hashcat 6.2.4
Benchmark: 7-Zip
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 1176467 |===========================================
NVIDIA RTX 4070 ....... 976967 |====================================
NVIDIA RTX 4070 TI .... 1262633 |==============================================
NVIDIA RTX 3090 ....... 1056000 |======================================
Hashcat 6.2.4
Benchmark: SHA-512
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 3232733333 |========================================
NVIDIA RTX 4070 ....... 2673300000 |=================================
NVIDIA RTX 4070 TI .... 3462500000 |===========================================
NVIDIA RTX 3090 ....... 3081866667 |======================================
Hashcat 6.2.4
Benchmark: TrueCrypt RIPEMD160 + XTS
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 802967 |============================================
NVIDIA RTX 4070 ....... 660967 |====================================
NVIDIA RTX 4070 TI .... 858600 |===============================================
NVIDIA RTX 3090 ....... 797833 |============================================
IndigoBench 4.4
Acceleration: OpenCL GPU - Scene: Bedroom
M samples/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 19.80 |=============================================
NVIDIA RTX 4070 ....... 18.20 |==========================================
NVIDIA RTX 4070 TI .... 20.26 |==============================================
NVIDIA RTX 3090 ....... 20.96 |================================================
IndigoBench 4.4
Acceleration: OpenCL GPU - Scene: Supercar
M samples/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 52.81 |===============================================
NVIDIA RTX 4070 ....... 48.52 |===========================================
NVIDIA RTX 4070 TI .... 53.59 |================================================
NVIDIA RTX 3090 ....... 52.01 |===============================================
LeelaChessZero 0.30
Backend: OpenCL
Nodes Per Second > Higher Is Better
Libplacebo 5.229.1
FPS > Higher Is Better
Libplacebo 5.229.1
Test: deband_heavy
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 2186.70 |============================================
NVIDIA RTX 4070 ....... 1847.98 |=====================================
NVIDIA RTX 4070 ....... 1844.08 |=====================================
NVIDIA RTX 4070 ....... 1843.26 |=====================================
NVIDIA RTX 4070 TI .... 2306.67 |==============================================
NVIDIA RTX 4070 TI .... 2306.56 |==============================================
NVIDIA RTX 3090 ....... 2017.75 |========================================
NVIDIA RTX 3090 ....... 2015.93 |========================================
NVIDIA RTX 3090 ....... 2024.61 |========================================
NVIDIA RTX 3090 ....... 2020.16 |========================================
Libplacebo 5.229.1
Test: polar_nocompute
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 2327.55 |============================================
NVIDIA RTX 4070 ....... 1972.78 |=====================================
NVIDIA RTX 4070 ....... 1969.19 |=====================================
NVIDIA RTX 4070 ....... 1968.37 |=====================================
NVIDIA RTX 4070 TI .... 2461.23 |==============================================
NVIDIA RTX 4070 TI .... 2459.03 |==============================================
NVIDIA RTX 3090 ....... 2119.89 |========================================
NVIDIA RTX 3090 ....... 2116.50 |========================================
NVIDIA RTX 3090 ....... 2126.31 |========================================
NVIDIA RTX 3090 ....... 2116.79 |========================================
Libplacebo 5.229.1
Test: hdr_peakdetect
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 3292.37 |==============================
NVIDIA RTX 4070 ....... 3310.02 |==============================
NVIDIA RTX 4070 ....... 3452.43 |===============================
NVIDIA RTX 4070 ....... 3329.26 |==============================
NVIDIA RTX 4070 TI .... 3544.60 |================================
NVIDIA RTX 4070 TI .... 3475.06 |===============================
NVIDIA RTX 3090 ....... 4997.08 |=============================================
NVIDIA RTX 3090 ....... 5104.10 |==============================================
NVIDIA RTX 3090 ....... 4969.74 |=============================================
NVIDIA RTX 3090 ....... 5055.88 |==============================================
Libplacebo 5.229.1
Test: hdr_lut
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 3905.98 |=============================================
NVIDIA RTX 4070 ....... 3927.11 |=============================================
NVIDIA RTX 4070 ....... 3940.40 |==============================================
NVIDIA RTX 4070 ....... 3946.90 |==============================================
NVIDIA RTX 4070 TI .... 3971.61 |==============================================
NVIDIA RTX 4070 TI .... 3976.04 |==============================================
NVIDIA RTX 3090 ....... 3313.26 |======================================
NVIDIA RTX 3090 ....... 3376.85 |=======================================
NVIDIA RTX 3090 ....... 3333.77 |=======================================
NVIDIA RTX 3090 ....... 3369.88 |=======================================
Libplacebo 5.229.1
Test: av1_grain_lap
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 4171.00 |==============================================
NVIDIA RTX 4070 ....... 4103.40 |=============================================
NVIDIA RTX 4070 ....... 4126.40 |==============================================
NVIDIA RTX 4070 ....... 4152.41 |==============================================
NVIDIA RTX 4070 TI .... 4140.87 |==============================================
NVIDIA RTX 4070 TI .... 4143.96 |==============================================
NVIDIA RTX 3090 ....... 4126.89 |==============================================
NVIDIA RTX 3090 ....... 4096.48 |=============================================
NVIDIA RTX 3090 ....... 4120.27 |=============================================
NVIDIA RTX 3090 ....... 4100.36 |=============================================
LuxCoreRender 2.6
Scene: DLSC - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 13.59 |===============================================
NVIDIA RTX 4070 ....... 11.74 |========================================
NVIDIA RTX 4070 TI .... 13.95 |================================================
NVIDIA RTX 3090 ....... 12.99 |=============================================
LuxCoreRender 2.6
Scene: Danish Mood - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 10.56 |==============================================
NVIDIA RTX 4070 ....... 8.89 |=======================================
NVIDIA RTX 4070 TI .... 10.99 |================================================
NVIDIA RTX 3090 ....... 10.20 |=============================================
LuxCoreRender 2.6
Scene: Orange Juice - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 11.72 |==============================================
NVIDIA RTX 4070 ....... 10.40 |=========================================
NVIDIA RTX 4070 TI .... 11.89 |===============================================
NVIDIA RTX 3090 ....... 12.14 |================================================
LuxCoreRender 2.6
Scene: LuxCore Benchmark - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 12.82 |===============================================
NVIDIA RTX 4070 ....... 10.92 |========================================
NVIDIA RTX 4070 TI .... 13.23 |================================================
NVIDIA RTX 3090 ....... 13.12 |================================================
LuxCoreRender 2.6
Scene: Rainbow Colors and Prism - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 27.67 |========================================
NVIDIA RTX 4070 ....... 23.26 |==================================
NVIDIA RTX 4070 TI .... 27.71 |========================================
NVIDIA RTX 3090 ....... 33.29 |================================================
MandelGPU 1.3pts1
OpenCL Device: GPU
Samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 587219538.2 |========================================
NVIDIA RTX 4070 ....... 516770131.2 |===================================
NVIDIA RTX 4070 TI .... 619106132.5 |==========================================
NVIDIA RTX 3090 ....... 484098913.8 |=================================
NAMD CUDA 2.14
ATPase Simulation - 327,506 Atoms
days/ns < Lower Is Better
NVIDIA RTX 4070 SUPER . 0.06791 |=============================
NVIDIA RTX 4070 ....... 0.07498 |================================
NVIDIA RTX 4070 TI .... 0.06788 |=============================
NVIDIA RTX 3090 ....... 0.10822 |==============================================
NCNN 20230517
Target: Vulkan GPU
ms < Lower Is Better
NCNN 20230517
Target: Vulkan GPU - Model: mobilenet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 8.62 |==================================
NVIDIA RTX 4070 ....... 10.14 |========================================
NVIDIA RTX 4070 ....... 7.20 |=============================
NVIDIA RTX 4070 TI .... 8.43 |==================================
NVIDIA RTX 4070 TI .... 7.45 |==============================
NVIDIA RTX 3090 ....... 12.07 |================================================
NVIDIA RTX 3090 ....... 6.92 |============================
NVIDIA RTX 3090 ....... 7.27 |=============================
NCNN 20230517
Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 3.03 |================================
NVIDIA RTX 4070 ....... 4.69 |=================================================
NVIDIA RTX 4070 ....... 2.48 |==========================
NVIDIA RTX 4070 TI .... 2.43 |=========================
NVIDIA RTX 4070 TI .... 2.54 |===========================
NVIDIA RTX 3090 ....... 2.65 |============================
NVIDIA RTX 3090 ....... 2.67 |============================
NVIDIA RTX 3090 ....... 2.34 |========================
NCNN 20230517
Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 2.25 |=============
NVIDIA RTX 4070 ....... 8.71 |=================================================
NVIDIA RTX 4070 ....... 2.15 |============
NVIDIA RTX 4070 TI .... 2.09 |============
NVIDIA RTX 3090 ....... 3.19 |==================
NVIDIA RTX 3090 ....... 2.20 |============
NVIDIA RTX 3090 ....... 2.21 |============
NCNN 20230517
Target: Vulkan GPU - Model: shufflenet-v2
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 2.31 |===========================
NVIDIA RTX 4070 ....... 2.11 |=========================
NVIDIA RTX 4070 ....... 2.08 |========================
NVIDIA RTX 4070 TI .... 2.03 |========================
NVIDIA RTX 4070 TI .... 2.01 |========================
NVIDIA RTX 3090 ....... 4.17 |=================================================
NVIDIA RTX 3090 ....... 2.09 |=========================
NVIDIA RTX 3090 ....... 2.04 |========================
NCNN 20230517
Target: Vulkan GPU - Model: mnasnet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 3.85 |==============================================
NVIDIA RTX 4070 ....... 2.22 |==========================
NVIDIA RTX 4070 ....... 2.24 |===========================
NVIDIA RTX 4070 TI .... 2.30 |===========================
NVIDIA RTX 4070 TI .... 4.14 |=================================================
NVIDIA RTX 3090 ....... 2.24 |===========================
NVIDIA RTX 3090 ....... 2.30 |===========================
NVIDIA RTX 3090 ....... 2.16 |==========================
NCNN 20230517
Target: Vulkan GPU - Model: efficientnet-b0
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 5.07 |==================
NVIDIA RTX 4070 ....... 3.46 |============
NVIDIA RTX 4070 ....... 3.59 |============
NVIDIA RTX 4070 TI .... 3.49 |============
NVIDIA RTX 4070 TI .... 3.46 |============
NVIDIA RTX 3090 ....... 13.87 |================================================
NVIDIA RTX 3090 ....... 3.54 |============
NVIDIA RTX 3090 ....... 3.34 |============
NCNN 20230517
Target: Vulkan GPU - Model: blazeface
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 0.84 |===============================================
NVIDIA RTX 4070 ....... 0.84 |===============================================
NVIDIA RTX 4070 TI .... 0.81 |==============================================
NVIDIA RTX 4070 TI .... 0.82 |==============================================
NVIDIA RTX 3090 ....... 0.86 |================================================
NVIDIA RTX 3090 ....... 0.84 |===============================================
NVIDIA RTX 3090 ....... 0.87 |=================================================
NCNN 20230517
Target: Vulkan GPU - Model: googlenet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 11.04 |================================================
NVIDIA RTX 4070 ....... 6.06 |==========================
NVIDIA RTX 4070 TI .... 5.87 |==========================
NVIDIA RTX 4070 TI .... 7.37 |================================
NVIDIA RTX 3090 ....... 7.49 |=================================
NVIDIA RTX 3090 ....... 6.11 |===========================
NVIDIA RTX 3090 ....... 6.14 |===========================
NCNN 20230517
Target: Vulkan GPU - Model: vgg16
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 117.81 |======================================
NVIDIA RTX 4070 ....... 54.54 |==================
NVIDIA RTX 4070 ....... 45.52 |===============
NVIDIA RTX 4070 TI .... 32.05 |==========
NVIDIA RTX 4070 TI .... 34.49 |===========
NVIDIA RTX 3090 ....... 145.72 |===============================================
NVIDIA RTX 3090 ....... 24.45 |========
NVIDIA RTX 3090 ....... 17.88 |======
NCNN 20230517
Target: Vulkan GPU - Model: resnet18
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 8.97 |=========================
NVIDIA RTX 4070 ....... 8.58 |========================
NVIDIA RTX 4070 ....... 5.11 |==============
NVIDIA RTX 4070 TI .... 5.47 |===============
NVIDIA RTX 4070 TI .... 7.74 |=====================
NVIDIA RTX 3090 ....... 17.41 |================================================
NVIDIA RTX 3090 ....... 8.94 |=========================
NVIDIA RTX 3090 ....... 4.12 |===========
NCNN 20230517
Target: Vulkan GPU - Model: alexnet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 16.17 |================================================
NVIDIA RTX 4070 ....... 9.33 |============================
NVIDIA RTX 4070 ....... 5.78 |=================
NVIDIA RTX 4070 TI .... 3.74 |===========
NVIDIA RTX 4070 TI .... 6.07 |==================
NVIDIA RTX 3090 ....... 3.69 |===========
NVIDIA RTX 3090 ....... 6.20 |==================
NVIDIA RTX 3090 ....... 3.60 |===========
NCNN 20230517
Target: Vulkan GPU - Model: resnet50
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 46.26 |================================================
NVIDIA RTX 4070 ....... 8.24 |=========
NVIDIA RTX 4070 ....... 8.72 |=========
NVIDIA RTX 4070 TI .... 14.32 |===============
NVIDIA RTX 4070 TI .... 12.25 |=============
NVIDIA RTX 3090 ....... 27.77 |=============================
NVIDIA RTX 3090 ....... 8.20 |=========
NVIDIA RTX 3090 ....... 12.70 |=============
NCNN 20230517
Target: Vulkan GPU - Model: yolov4-tiny
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 63.82 |================================================
NVIDIA RTX 4070 ....... 25.11 |===================
NVIDIA RTX 4070 ....... 20.74 |================
NVIDIA RTX 4070 TI .... 16.47 |============
NVIDIA RTX 4070 TI .... 16.37 |============
NVIDIA RTX 3090 ....... 26.85 |====================
NVIDIA RTX 3090 ....... 13.31 |==========
NVIDIA RTX 3090 ....... 11.29 |========
NCNN 20230517
Target: Vulkan GPU - Model: squeezenet_ssd
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 6.86 |=================================================
NVIDIA RTX 4070 ....... 5.27 |======================================
NVIDIA RTX 4070 ....... 5.18 |=====================================
NVIDIA RTX 4070 TI .... 5.36 |======================================
NVIDIA RTX 4070 TI .... 6.13 |============================================
NVIDIA RTX 3090 ....... 6.63 |===============================================
NVIDIA RTX 3090 ....... 5.20 |=====================================
NVIDIA RTX 3090 ....... 4.90 |===================================
NCNN 20230517
Target: Vulkan GPU - Model: regnety_400m
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 11.11 |================================================
NVIDIA RTX 4070 ....... 6.50 |============================
NVIDIA RTX 4070 ....... 6.21 |===========================
NVIDIA RTX 4070 TI .... 5.97 |==========================
NVIDIA RTX 4070 TI .... 5.89 |=========================
NVIDIA RTX 3090 ....... 8.06 |===================================
NVIDIA RTX 3090 ....... 6.47 |============================
NVIDIA RTX 3090 ....... 6.73 |=============================
NCNN 20230517
Target: Vulkan GPU - Model: vision_transformer
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 844.61 |===============================================
NVIDIA RTX 4070 ....... 281.56 |================
NVIDIA RTX 4070 ....... 382.82 |=====================
NVIDIA RTX 4070 TI .... 390.18 |======================
NVIDIA RTX 4070 TI .... 497.66 |============================
NVIDIA RTX 3090 ....... 663.24 |=====================================
NVIDIA RTX 3090 ....... 327.82 |==================
NVIDIA RTX 3090 ....... 354.57 |====================
NCNN 20230517
Target: Vulkan GPU - Model: FastestDet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 2.86 |======================
NVIDIA RTX 4070 ....... 2.34 |==================
NVIDIA RTX 4070 ....... 2.67 |=====================
NVIDIA RTX 4070 TI .... 2.84 |======================
NVIDIA RTX 4070 TI .... 3.04 |=======================
NVIDIA RTX 3090 ....... 6.38 |=================================================
NVIDIA RTX 3090 ....... 2.50 |===================
NVIDIA RTX 3090 ....... 2.65 |====================
NeatBench 5
Acceleration: GPU
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER . 4070 |=================================================
NVIDIA RTX 4070 ....... 4070 |=================================================
NVIDIA RTX 4070 TI .... 4070 |=================================================
NVIDIA RTX 3090 ....... 3090 |=====================================
OctaneBench 2020.1
Total Score
Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 720.97 |==============================================
NVIDIA RTX 4070 ....... 648.00 |=========================================
NVIDIA RTX 4070 TI .... 735.94 |===============================================
NVIDIA RTX 3090 ....... 674.25 |===========================================
PlaidML
FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL
Examples Per Second > Higher Is Better
ProjectPhysX OpenCL-Benchmark 1.2
Operation: FP64 Compute
TFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 0.621 |=============================================
NVIDIA RTX 4070 ....... 0.510 |=====================================
NVIDIA RTX 4070 TI .... 0.660 |================================================
NVIDIA RTX 3090 ....... 0.637 |==============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: FP32 Compute
TFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 38.59 |=============================================
NVIDIA RTX 4070 ....... 31.77 |=====================================
NVIDIA RTX 4070 TI .... 40.91 |================================================
NVIDIA RTX 3090 ....... 39.40 |==============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT64 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 4.214 |==============================================
NVIDIA RTX 4070 ....... 3.443 |=====================================
NVIDIA RTX 4070 TI .... 4.420 |================================================
NVIDIA RTX 3090 ....... 3.135 |==================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT32 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 19.89 |=============================================
NVIDIA RTX 4070 ....... 16.38 |=====================================
NVIDIA RTX 4070 TI .... 21.05 |================================================
NVIDIA RTX 3090 ....... 20.03 |==============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT16 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 17.17 |=============================================
NVIDIA RTX 4070 ....... 14.28 |=====================================
NVIDIA RTX 4070 TI .... 18.28 |================================================
NVIDIA RTX 3090 ....... 17.00 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT8 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 14.31 |============================================
NVIDIA RTX 4070 ....... 12.12 |=====================================
NVIDIA RTX 4070 TI .... 15.73 |================================================
NVIDIA RTX 3090 ....... 13.73 |==========================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: Memory Bandwidth Coalesced Read
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 464.86 |=========================
NVIDIA RTX 4070 ....... 465.18 |=========================
NVIDIA RTX 4070 TI .... 465.07 |=========================
NVIDIA RTX 3090 ....... 864.11 |===============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: Memory Bandwidth Coalesced Write
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 455.01 |========================
NVIDIA RTX 4070 ....... 459.43 |========================
NVIDIA RTX 4070 TI .... 457.17 |========================
NVIDIA RTX 3090 ....... 887.31 |===============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 557.73 |===============================================
NVIDIA RTX 4070 ....... 546.76 |==============================================
NVIDIA RTX 4070 TI .... 535.39 |=============================================
NVIDIA RTX 3090 ....... 525.12 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 201.94 |===============================================
NVIDIA RTX 4070 ....... 198.18 |==============================================
NVIDIA RTX 4070 TI .... 201.19 |===============================================
NVIDIA RTX 3090 ....... 197.12 |==============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 509.45 |===============================================
NVIDIA RTX 4070 ....... 458.39 |==========================================
NVIDIA RTX 4070 TI .... 502.92 |==============================================
NVIDIA RTX 3090 ....... 419.76 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 501.50 |===============================================
NVIDIA RTX 4070 ....... 459.94 |===========================================
NVIDIA RTX 4070 TI .... 505.55 |===============================================
NVIDIA RTX 3090 ....... 420.29 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 507.45 |===============================================
NVIDIA RTX 4070 ....... 458.36 |==========================================
NVIDIA RTX 4070 TI .... 505.62 |===============================================
NVIDIA RTX 3090 ....... 419.03 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 195.40 |===============================================
NVIDIA RTX 4070 ....... 187.26 |=============================================
NVIDIA RTX 4070 TI .... 194.29 |===============================================
NVIDIA RTX 3090 ....... 164.14 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 504.67 |===============================================
NVIDIA RTX 4070 ....... 459.93 |===========================================
NVIDIA RTX 3090 ....... 416.89 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 195.39 |==============================================
NVIDIA RTX 4070 ....... 187.69 |============================================
NVIDIA RTX 4070 TI .... 198.82 |===============================================
NVIDIA RTX 3090 ....... 163.74 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 504.27 |===============================================
NVIDIA RTX 4070 ....... 459.27 |===========================================
NVIDIA RTX 4070 TI .... 504.66 |===============================================
NVIDIA RTX 3090 ....... 416.20 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 196.07 |===============================================
NVIDIA RTX 4070 ....... 186.63 |=============================================
NVIDIA RTX 4070 TI .... 197.02 |===============================================
NVIDIA RTX 3090 ....... 164.14 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 194.58 |===============================================
NVIDIA RTX 4070 ....... 187.27 |=============================================
NVIDIA RTX 4070 TI .... 195.86 |===============================================
NVIDIA RTX 3090 ....... 161.01 |=======================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 195.30 |===============================================
NVIDIA RTX 4070 ....... 187.51 |=============================================
NVIDIA RTX 4070 TI .... 194.87 |===============================================
NVIDIA RTX 3090 ....... 164.35 |========================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 106.37 |==============================================
NVIDIA RTX 4070 ....... 107.59 |===============================================
NVIDIA RTX 4070 TI .... 108.59 |===============================================
NVIDIA RTX 3090 ....... 105.55 |==============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 ....... 103.68 |===============================================
NVIDIA RTX 4070 TI .... 103.45 |===============================================
NVIDIA RTX 3090 ....... 98.11 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 102.60 |===============================================
NVIDIA RTX 4070 ....... 102.90 |===============================================
NVIDIA RTX 4070 TI .... 96.50 |============================================
NVIDIA RTX 3090 ....... 99.05 |=============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 102.60 |===============================================
NVIDIA RTX 4070 ....... 101.55 |==============================================
NVIDIA RTX 4070 TI .... 103.20 |===============================================
NVIDIA RTX 3090 ....... 99.84 |=============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 103.17 |===============================================
NVIDIA RTX 4070 ....... 101.24 |==============================================
NVIDIA RTX 4070 TI .... 103.24 |===============================================
NVIDIA RTX 3090 ....... 99.43 |=============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 103.57 |===============================================
NVIDIA RTX 4070 ....... 101.43 |==============================================
NVIDIA RTX 4070 TI .... 103.50 |===============================================
NVIDIA RTX 3090 ....... 99.25 |=============================================
RealSR-NCNN 20200818
Scale: 4x - TAA: No
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 6.323 |===========================================
NVIDIA RTX 4070 ....... 7.092 |================================================
NVIDIA RTX 4070 TI .... 5.962 |========================================
NVIDIA RTX 3090 ....... 5.556 |======================================
RealSR-NCNN 20200818
Scale: 4x - TAA: Yes
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 34.89 |=======================================
NVIDIA RTX 4070 ....... 42.85 |================================================
NVIDIA RTX 4070 TI .... 33.63 |======================================
NVIDIA RTX 3090 ....... 30.31 |==================================
Rodinia 3.1
Test: OpenCL Particle Filter
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 3.480 |=========================================
NVIDIA RTX 4070 ....... 4.098 |================================================
NVIDIA RTX 4070 TI .... 3.291 |=======================================
NVIDIA RTX 3090 ....... 3.844 |=============================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 1.35 |================================================
NVIDIA RTX 4070 ....... 1.36 |================================================
NVIDIA RTX 4070 TI .... 1.38 |=================================================
NVIDIA RTX 3090 ....... 1.38 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 13.92 |=============================================
NVIDIA RTX 4070 ....... 14.04 |==============================================
NVIDIA RTX 4070 TI .... 14.79 |================================================
NVIDIA RTX 3090 ....... 14.45 |===============================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 1.48 |================================================
NVIDIA RTX 4070 ....... 1.50 |=================================================
NVIDIA RTX 4070 TI .... 1.49 |=================================================
NVIDIA RTX 3090 ....... 1.49 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 1.50 |=================================================
NVIDIA RTX 4070 ....... 1.50 |=================================================
NVIDIA RTX 4070 TI .... 1.50 |=================================================
NVIDIA RTX 3090 ....... 1.50 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 ....... 1.50 |=================================================
NVIDIA RTX 4070 TI .... 1.50 |=================================================
NVIDIA RTX 3090 ....... 1.51 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 31.59 |===============================================
NVIDIA RTX 4070 ....... 31.45 |===============================================
NVIDIA RTX 4070 TI .... 31.70 |================================================
NVIDIA RTX 3090 ....... 31.98 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 256 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 TI .... 1.50 |=================================================
NVIDIA RTX 3090 ....... 1.51 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 33.40 |================================================
NVIDIA RTX 4070 ....... 33.32 |================================================
NVIDIA RTX 4070 TI .... 33.29 |================================================
NVIDIA RTX 3090 ....... 33.53 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 512 - Model: VGG-16
images/sec > Higher Is Better
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 33.97 |================================================
NVIDIA RTX 4070 ....... 33.93 |================================================
NVIDIA RTX 4070 TI .... 34.06 |================================================
NVIDIA RTX 3090 ....... 33.93 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 12.62 |===============================================
NVIDIA RTX 4070 ....... 12.78 |================================================
NVIDIA RTX 4070 TI .... 12.79 |================================================
NVIDIA RTX 3090 ....... 12.82 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 4.35 |=================================================
NVIDIA RTX 4070 ....... 4.34 |=================================================
NVIDIA RTX 4070 TI .... 4.32 |=================================================
NVIDIA RTX 3090 ....... 4.35 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 256 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 34.16 |===============================================
NVIDIA RTX 4070 TI .... 34.61 |================================================
NVIDIA RTX 3090 ....... 34.46 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 512 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 35.10 |===============================================
NVIDIA RTX 4070 ....... 35.21 |================================================
NVIDIA RTX 4070 TI .... 35.44 |================================================
NVIDIA RTX 3090 ....... 35.58 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 15.67 |================================================
NVIDIA RTX 4070 ....... 15.66 |================================================
NVIDIA RTX 4070 TI .... 15.69 |================================================
NVIDIA RTX 3090 ....... 15.68 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 5.46 |=================================================
NVIDIA RTX 4070 ....... 5.49 |=================================================
NVIDIA RTX 4070 TI .... 5.46 |=================================================
NVIDIA RTX 3090 ....... 5.49 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 15.61 |===============================================
NVIDIA RTX 4070 ....... 15.63 |===============================================
NVIDIA RTX 4070 TI .... 15.81 |================================================
NVIDIA RTX 3090 ....... 15.67 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 5.51 |================================================
NVIDIA RTX 4070 ....... 5.55 |=================================================
NVIDIA RTX 4070 TI .... 5.50 |================================================
NVIDIA RTX 3090 ....... 5.57 |=================================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 15.52 |================================================
NVIDIA RTX 4070 ....... 15.54 |================================================
NVIDIA RTX 4070 TI .... 15.50 |================================================
NVIDIA RTX 3090 ....... 15.63 |================================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER . 5.55 |=================================================
NVIDIA RTX 4070 ....... 5.55 |=================================================
NVIDIA RTX 4070 TI .... 5.53 |=================================================
NVIDIA RTX 3090 ....... 5.57 |=================================================
ViennaCL 1.7.1
Test: CPU BLAS - sCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 132 |==================================================
NVIDIA RTX 4070 ....... 131 |==================================================
NVIDIA RTX 4070 TI .... 132 |==================================================
NVIDIA RTX 3090 ....... 132 |==================================================
ViennaCL 1.7.1
Test: CPU BLAS - sAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 156 |==================================================
NVIDIA RTX 4070 ....... 153 |=================================================
NVIDIA RTX 4070 TI .... 156 |==================================================
NVIDIA RTX 3090 ....... 154 |=================================================
ViennaCL 1.7.1
Test: CPU BLAS - sDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 165.0 |===============================================
NVIDIA RTX 4070 ....... 166.0 |===============================================
NVIDIA RTX 4070 TI .... 168.0 |================================================
NVIDIA RTX 3090 ....... 132.1 |======================================
ViennaCL 1.7.1
Test: CPU BLAS - dCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 70.8 |=================================================
NVIDIA RTX 4070 ....... 71.0 |=================================================
NVIDIA RTX 4070 TI .... 71.3 |=================================================
NVIDIA RTX 3090 ....... 70.2 |================================================
ViennaCL 1.7.1
Test: CPU BLAS - dAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 87.2 |=================================================
NVIDIA RTX 4070 ....... 86.8 |=================================================
NVIDIA RTX 4070 TI .... 87.3 |=================================================
NVIDIA RTX 3090 ....... 86.2 |================================================
ViennaCL 1.7.1
Test: CPU BLAS - dDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 96.8 |=================================================
NVIDIA RTX 4070 ....... 96.7 |=================================================
NVIDIA RTX 4070 TI .... 96.4 |=================================================
NVIDIA RTX 3090 ....... 95.2 |================================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMV-N
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 102 |==================================================
NVIDIA RTX 4070 ....... 103 |==================================================
NVIDIA RTX 4070 TI .... 103 |==================================================
NVIDIA RTX 3090 ....... 103 |==================================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMV-T
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 109.0 |================================================
NVIDIA RTX 4070 ....... 109.0 |================================================
NVIDIA RTX 4070 TI .... 102.7 |=============================================
NVIDIA RTX 3090 ....... 110.0 |================================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-NN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 119 |=================================================
NVIDIA RTX 4070 ....... 122 |==================================================
NVIDIA RTX 4070 TI .... 117 |================================================
NVIDIA RTX 3090 ....... 113 |==============================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-NT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 117 |================================================
NVIDIA RTX 4070 ....... 122 |==================================================
NVIDIA RTX 4070 TI .... 118 |================================================
NVIDIA RTX 3090 ....... 119 |=================================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-TN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 115 |==============================================
NVIDIA RTX 4070 ....... 121 |================================================
NVIDIA RTX 4070 TI .... 125 |==================================================
NVIDIA RTX 3090 ....... 121 |================================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-TT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 122 |=================================================
NVIDIA RTX 4070 ....... 118 |================================================
NVIDIA RTX 4070 TI .... 124 |==================================================
NVIDIA RTX 3090 ....... 113 |==============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 334 |==============================================
NVIDIA RTX 4070 ....... 330 |=============================================
NVIDIA RTX 4070 TI .... 336 |==============================================
NVIDIA RTX 3090 ....... 363 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 392 |=======================================
NVIDIA RTX 4070 ....... 389 |=======================================
NVIDIA RTX 4070 TI .... 393 |=======================================
NVIDIA RTX 3090 ....... 498 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 370 |=================================================
NVIDIA RTX 4070 ....... 362 |================================================
NVIDIA RTX 4070 TI .... 365 |=================================================
NVIDIA RTX 3090 ....... 376 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 423 |===================================
NVIDIA RTX 4070 ....... 423 |===================================
NVIDIA RTX 4070 TI .... 424 |===================================
NVIDIA RTX 3090 ....... 605 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 437 |==============================
NVIDIA RTX 4070 ....... 455 |===============================
NVIDIA RTX 4070 TI .... 437 |==============================
NVIDIA RTX 3090 ....... 724 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 458 |===================================
NVIDIA RTX 4070 ....... 456 |===================================
NVIDIA RTX 4070 TI .... 457 |===================================
NVIDIA RTX 3090 ....... 659 |==================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMV-N
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 210 |==================================================
NVIDIA RTX 4070 ....... 209 |==================================================
NVIDIA RTX 4070 TI .... 211 |==================================================
NVIDIA RTX 3090 ....... 187 |============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMV-T
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 389 |==================================================
NVIDIA RTX 4070 ....... 387 |=================================================
NVIDIA RTX 4070 TI .... 391 |==================================================
NVIDIA RTX 3090 ....... 374 |================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-NN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 577 |================================================
NVIDIA RTX 4070 ....... 473 |=======================================
NVIDIA RTX 4070 TI .... 604 |==================================================
NVIDIA RTX 3090 ....... 592 |=================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-NT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 584 |================================================
NVIDIA RTX 4070 ....... 477 |=======================================
NVIDIA RTX 4070 TI .... 612 |==================================================
NVIDIA RTX 3090 ....... 595 |=================================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-TN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 599 |===============================================
NVIDIA RTX 4070 ....... 494 |=======================================
NVIDIA RTX 4070 TI .... 634 |==================================================
NVIDIA RTX 3090 ....... 594 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-TT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER . 613 |===============================================
NVIDIA RTX 4070 ....... 502 |=======================================
NVIDIA RTX 4070 TI .... 648 |==================================================
NVIDIA RTX 3090 ....... 593 |==============================================
VkFFT 1.2.31
Test: FFT + iFFT R2C / C2R
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 54794 |===============================================
NVIDIA RTX 4070 ....... 47097 |=========================================
NVIDIA RTX 4070 TI .... 55446 |================================================
NVIDIA RTX 3090 ....... 48418 |==========================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in half precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 131705 |=======================
NVIDIA RTX 4070 ....... 137762 |========================
NVIDIA RTX 4070 TI .... 136210 |=======================
NVIDIA RTX 3090 ....... 273221 |===============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C Bluestein in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 15166 |================================================
NVIDIA RTX 4070 ....... 13714 |===========================================
NVIDIA RTX 4070 TI .... 15125 |================================================
NVIDIA RTX 3090 ....... 14205 |=============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in double precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 24317 |======================================
NVIDIA RTX 4070 ....... 22390 |===================================
NVIDIA RTX 4070 TI .... 25431 |=======================================
NVIDIA RTX 3090 ....... 30912 |================================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 73929 |========================
NVIDIA RTX 4070 ....... 77774 |==========================
NVIDIA RTX 4070 TI .... 73942 |========================
NVIDIA RTX 3090 ....... 141876 |===============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C multidimensional in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 50299 |===============================================
NVIDIA RTX 4070 ....... 47212 |============================================
NVIDIA RTX 4070 TI .... 51528 |================================================
NVIDIA RTX 3090 ....... 50856 |===============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C Bluestein benchmark in double precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 4451 |===============================================
NVIDIA RTX 4070 ....... 3886 |=========================================
NVIDIA RTX 4070 TI .... 4647 |=================================================
NVIDIA RTX 3090 ....... 4195 |============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER . 75078 |========================
NVIDIA RTX 4070 ....... 79057 |==========================
NVIDIA RTX 4070 TI .... 75141 |========================
NVIDIA RTX 3090 ....... 144311 |===============================================
vkpeak 20230730
GFLOPS > Higher Is Better
vkpeak 20230730
fp32-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 20263.13 |===================================================
NVIDIA RTX 3090 . 20319.80 |===================================================
NVIDIA RTX 3090 . 20317.63 |===================================================
NVIDIA RTX 3090 . 20353.95 |===================================================
vkpeak 20230730
fp32-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 26563.72 |===================================================
NVIDIA RTX 3090 . 26630.59 |===================================================
NVIDIA RTX 3090 . 26767.21 |===================================================
NVIDIA RTX 3090 . 26699.66 |===================================================
vkpeak 20230730
fp16-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 20080.47 |===================================================
NVIDIA RTX 3090 . 20113.01 |===================================================
NVIDIA RTX 3090 . 20134.06 |===================================================
NVIDIA RTX 3090 . 20151.44 |===================================================
vkpeak 20230730
fp16-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 39771.97 |===================================================
NVIDIA RTX 3090 . 39835.21 |===================================================
NVIDIA RTX 3090 . 39746.91 |===================================================
NVIDIA RTX 3090 . 39860.80 |===================================================
vkpeak 20230730
fp64-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 638.70 |=====================================================
NVIDIA RTX 3090 . 638.75 |=====================================================
NVIDIA RTX 3090 . 638.77 |=====================================================
NVIDIA RTX 3090 . 638.84 |=====================================================
vkpeak 20230730
fp64-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 . 638.72 |=====================================================
NVIDIA RTX 3090 . 638.77 |=====================================================
NVIDIA RTX 3090 . 639.52 |=====================================================
NVIDIA RTX 3090 . 638.74 |=====================================================
vkpeak 20230730
int32-scalar
GIOPS > Higher Is Better
NVIDIA RTX 3090 . 20280.33 |===================================================
NVIDIA RTX 3090 . 20290.30 |===================================================
NVIDIA RTX 3090 . 20315.10 |===================================================
NVIDIA RTX 3090 . 20295.27 |===================================================
vkpeak 20230730
int32-vec4
GIOPS > Higher Is Better
NVIDIA RTX 3090 . 19996.92 |===================================================
NVIDIA RTX 3090 . 20017.06 |===================================================
NVIDIA RTX 3090 . 20005.52 |===================================================
NVIDIA RTX 3090 . 20009.73 |===================================================
vkpeak 20230730
int16-scalar
GIOPS > Higher Is Better
NVIDIA RTX 3090 . 13259.97 |===================================================
NVIDIA RTX 3090 . 13273.53 |===================================================
NVIDIA RTX 3090 . 13225.17 |===================================================
NVIDIA RTX 3090 . 13264.91 |===================================================
vkpeak 20230730
int16-vec4
GIOPS > Higher Is Better
NVIDIA RTX 3090 . 16331.16 |===================================================
NVIDIA RTX 3090 . 16338.23 |===================================================
NVIDIA RTX 3090 . 16302.58 |===================================================
NVIDIA RTX 3090 . 16329.72 |===================================================
VkResample 1.0
Upscale: 2x - Precision: Double
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 339.59 |======================================
NVIDIA RTX 4070 ....... 415.16 |===============================================
NVIDIA RTX 4070 TI .... 322.06 |====================================
NVIDIA RTX 3090 ....... 333.64 |======================================
VkResample 1.0
Upscale: 2x - Precision: Single
ms < Lower Is Better
NVIDIA RTX 4070 SUPER . 18.49 |================================================
NVIDIA RTX 4070 ....... 18.02 |===============================================
NVIDIA RTX 4070 TI .... 18.46 |================================================
NVIDIA RTX 3090 ....... 10.32 |===========================
Waifu2x-NCNN Vulkan 20200818
Scale: 2x - Denoise: 3 - TAA: No
Seconds < Lower Is Better
Waifu2x-NCNN Vulkan 20200818
Scale: 2x - Denoise: 3 - TAA: Yes
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER . 2.855 |===========================================
NVIDIA RTX 4070 ....... 3.168 |===============================================
NVIDIA RTX 4070 TI .... 2.854 |===========================================
NVIDIA RTX 3090 ....... 3.202 |================================================