Intel Core i9-13900K testing with a ASUS TUF GAMING Z790-PRO WIFI (1630 BIOS) and ASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GB on EndeavourOS rolling via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2402174-SADD-240211636
RTX 4070 SUPER
Intel Core i9-13900K testing with a ASUS TUF GAMING Z790-PRO WIFI (1630 BIOS) and ASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GB on EndeavourOS rolling via the Phoronix Test Suite.
NVIDIA RTX 4070 SUPER:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: ASUS NVIDIA GeForce RTX 4070 SUPER 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 4070:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: MSI NVIDIA GeForce RTX 4070 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 4070 TI:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: NVIDIA GeForce RTX 4070 Ti 12GB, Audio: Realtek ALC1220, Monitor: ARZOPA, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.1-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 3090:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1401 BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001, Graphics: NVIDIA GeForce RTX 3090 24GB, Audio: Realtek ALC1220, Monitor: PI-KVM Video, Network: Intel I226-V + Intel Device 7a70
OS: EndeavourOS rolling, Kernel: 6.7.4-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
NVIDIA RTX 4070 TI SUPER:
Processor: Intel Core i9-13900K @ 5.50GHz (24 Cores / 32 Threads), Motherboard: ASUS TUF GAMING Z790-PRO WIFI (1630 BIOS), Chipset: Intel Raptor Lake-S PCH, Memory: 32GB, Disk: 4001GB Seagate ZP4000GP304001 + 0GB CD-ROM Drive, Graphics: ASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GB, Audio: Realtek ALC1220, Monitor: PI-KVM Video, Network: Intel I226-V + Intel Raptor Lake-S PCH CNVi WiFi
OS: EndeavourOS rolling, Kernel: 6.7.4-arch1-1 (x86_64), Desktop: KDE Plasma 5.27.10, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 2.1 AMD-APP (3602.0) + OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.1 20230801 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080
ArrayFire 3.9
Test: Conjugate Gradient OpenCL
Blender 4.0
Blend File: BMW27 - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 5.57 |=========================================
NVIDIA RTX 4070 .......... 6.21 |=============================================
NVIDIA RTX 4070 TI ....... 5.43 |========================================
NVIDIA RTX 3090 .......... 6.31 |==============================================
NVIDIA RTX 4070 TI SUPER . 5.04 |=====================================
Blender 4.0
Blend File: Classroom - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 12.60 |=====================================
NVIDIA RTX 4070 .......... 14.86 |============================================
NVIDIA RTX 4070 TI ....... 12.30 |====================================
NVIDIA RTX 3090 .......... 15.26 |=============================================
NVIDIA RTX 4070 TI SUPER . 11.20 |=================================
Blender 4.0
Blend File: Fishy Cat - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 9.45 |=======================================
NVIDIA RTX 4070 .......... 11.03 |=============================================
NVIDIA RTX 4070 TI ....... 9.02 |=====================================
NVIDIA RTX 3090 .......... 10.64 |===========================================
NVIDIA RTX 4070 TI SUPER . 8.32 |==================================
Blender 4.0
Blend File: Barbershop - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 51.30 |========================================
NVIDIA RTX 4070 .......... 58.44 |=============================================
NVIDIA RTX 4070 TI ....... 50.73 |=======================================
NVIDIA RTX 3090 .......... 54.30 |==========================================
NVIDIA RTX 4070 TI SUPER . 44.49 |==================================
Blender 4.0
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 14.29 |=====================================
NVIDIA RTX 4070 .......... 16.55 |===========================================
NVIDIA RTX 4070 TI ....... 13.97 |====================================
NVIDIA RTX 3090 .......... 17.30 |=============================================
NVIDIA RTX 4070 TI SUPER . 12.56 |=================================
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200
Milli-Seconds < Lower Is Better
Caffe 2020-02-13
Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000
Milli-Seconds < Lower Is Better
cl-mem 2017-01-13
Benchmark: Copy
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 331.8 |========================================
NVIDIA RTX 4070 .......... 330.3 |========================================
NVIDIA RTX 4070 TI ....... 333.3 |========================================
NVIDIA RTX 3090 .......... 360.8 |============================================
NVIDIA RTX 4070 TI SUPER . 370.7 |=============================================
cl-mem 2017-01-13
Benchmark: Read
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 446.2 |========================
NVIDIA RTX 4070 .......... 446.3 |========================
NVIDIA RTX 4070 TI ....... 446.3 |========================
NVIDIA RTX 3090 .......... 825.8 |=============================================
NVIDIA RTX 4070 TI SUPER . 595.2 |================================
cl-mem 2017-01-13
Benchmark: Write
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 407.5 |========================
NVIDIA RTX 4070 .......... 406.7 |========================
NVIDIA RTX 4070 TI ....... 412.2 |=========================
NVIDIA RTX 3090 .......... 753.8 |=============================================
NVIDIA RTX 4070 TI SUPER . 551.9 |=================================
clpeak 1.1.2
OpenCL Test: Integer Compute INT
GIOPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 18170.54 |==================================
NVIDIA RTX 4070 .......... 14555.19 |============================
NVIDIA RTX 4070 TI ....... 19821.10 |======================================
NVIDIA RTX 3090 .......... 17923.33 |==================================
NVIDIA RTX 4070 TI SUPER . 22171.25 |==========================================
clpeak 1.1.2
OpenCL Test: Single-Precision Float
GFLOPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 35492.69 |==================================
NVIDIA RTX 4070 .......... 28479.39 |============================
NVIDIA RTX 4070 TI ....... 38691.73 |======================================
NVIDIA RTX 3090 .......... 34906.79 |==================================
NVIDIA RTX 4070 TI SUPER . 43244.79 |==========================================
clpeak 1.1.2
OpenCL Test: Double-Precision Double
GFLOPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 630.11 |=====================================
NVIDIA RTX 4070 .......... 515.17 |==============================
NVIDIA RTX 4070 TI ....... 667.05 |=======================================
NVIDIA RTX 3090 .......... 642.23 |======================================
NVIDIA RTX 4070 TI SUPER . 750.36 |============================================
clpeak 1.1.2
OpenCL Test: Global Memory Bandwidth
GBPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 437.65 |========================
NVIDIA RTX 4070 .......... 437.21 |========================
NVIDIA RTX 4070 TI ....... 437.63 |========================
NVIDIA RTX 3090 .......... 816.55 |============================================
NVIDIA RTX 4070 TI SUPER . 582.84 |===============================
FAHBench 2.3.2
Ns Per Day > Higher Is Better
NVIDIA RTX 4070 SUPER .... 366.06 |=========================================
NVIDIA RTX 4070 .......... 317.20 |===================================
NVIDIA RTX 4070 TI ....... 382.16 |===========================================
NVIDIA RTX 3090 .......... 343.02 |======================================
NVIDIA RTX 4070 TI SUPER . 394.74 |============================================
FinanceBench 2016-07-25
Benchmark: Black-Scholes OpenCL
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 5.912 |=======================================
NVIDIA RTX 4070 .......... 6.906 |=============================================
NVIDIA RTX 4070 TI ....... 5.226 |==================================
NVIDIA RTX 3090 .......... 5.741 |=====================================
NVIDIA RTX 4070 TI SUPER . 0.501 |===
GpuOwl 7.2.1
Exponent: 57885161
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER .... 869.07 |====================================
NVIDIA RTX 4070 .......... 714.80 |==============================
NVIDIA RTX 4070 TI ....... 919.13 |=======================================
NVIDIA RTX 3090 .......... 866.31 |====================================
NVIDIA RTX 4070 TI SUPER . 1025.99 |===========================================
GpuOwl 7.2.1
Exponent: 77936867
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER .... 646.41 |=====================================
NVIDIA RTX 4070 .......... 530.32 |===============================
NVIDIA RTX 4070 TI ....... 676.59 |=======================================
NVIDIA RTX 3090 .......... 645.99 |=====================================
NVIDIA RTX 4070 TI SUPER . 761.61 |============================================
GpuOwl 7.2.1
Exponent: 332220523
Iterations / Second > Higher Is Better
NVIDIA RTX 4070 SUPER .... 137.44 |=====================================
NVIDIA RTX 4070 .......... 112.61 |==============================
NVIDIA RTX 4070 TI ....... 145.84 |=======================================
NVIDIA RTX 3090 .......... 137.32 |=====================================
NVIDIA RTX 4070 TI SUPER . 163.41 |============================================
Hashcat 6.2.4
Benchmark: MD5
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 67583033333 |================================
NVIDIA RTX 4070 .......... 56147866667 |===========================
NVIDIA RTX 4070 TI ....... 73312233333 |===================================
NVIDIA RTX 3090 .......... 67177300000 |================================
NVIDIA RTX 4070 TI SUPER . 82004966667 |=======================================
Hashcat 6.2.4
Benchmark: SHA1
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 22132600000 |=================================
NVIDIA RTX 4070 .......... 18202466667 |===========================
NVIDIA RTX 4070 TI ....... 23532400000 |===================================
NVIDIA RTX 3090 .......... 21323733333 |================================
NVIDIA RTX 4070 TI SUPER . 26388600000 |=======================================
Hashcat 6.2.4
Benchmark: 7-Zip
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 1176467 |====================================
NVIDIA RTX 4070 .......... 976967 |==============================
NVIDIA RTX 4070 TI ....... 1262633 |======================================
NVIDIA RTX 3090 .......... 1056000 |================================
NVIDIA RTX 4070 TI SUPER . 1420700 |===========================================
Hashcat 6.2.4
Benchmark: SHA-512
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 3232733333 |=================================
NVIDIA RTX 4070 .......... 2673300000 |============================
NVIDIA RTX 4070 TI ....... 3462500000 |====================================
NVIDIA RTX 3090 .......... 3081866667 |================================
NVIDIA RTX 4070 TI SUPER . 3887033333 |========================================
Hashcat 6.2.4
Benchmark: TrueCrypt RIPEMD160 + XTS
H/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 802967 |=====================================
NVIDIA RTX 4070 .......... 660967 |==============================
NVIDIA RTX 4070 TI ....... 858600 |=======================================
NVIDIA RTX 3090 .......... 797833 |=====================================
NVIDIA RTX 4070 TI SUPER . 961733 |============================================
IndigoBench 4.4
Acceleration: OpenCL GPU - Scene: Bedroom
M samples/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 19.80 |====================================
NVIDIA RTX 4070 .......... 18.20 |=================================
NVIDIA RTX 4070 TI ....... 20.26 |=====================================
NVIDIA RTX 3090 .......... 20.96 |======================================
NVIDIA RTX 4070 TI SUPER . 24.57 |=============================================
IndigoBench 4.4
Acceleration: OpenCL GPU - Scene: Supercar
M samples/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 52.81 |=======================================
NVIDIA RTX 4070 .......... 48.52 |====================================
NVIDIA RTX 4070 TI ....... 53.59 |=======================================
NVIDIA RTX 3090 .......... 52.01 |======================================
NVIDIA RTX 4070 TI SUPER . 61.34 |=============================================
LeelaChessZero 0.30
Backend: OpenCL
Nodes Per Second > Higher Is Better
Libplacebo 5.229.1
FPS > Higher Is Better
Libplacebo 5.229.1
Test: deband_heavy
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 2186.70 |======================================
NVIDIA RTX 4070 .......... 1847.98 |================================
NVIDIA RTX 4070 .......... 1844.08 |================================
NVIDIA RTX 4070 .......... 1843.26 |================================
NVIDIA RTX 4070 TI ....... 2306.67 |========================================
NVIDIA RTX 4070 TI ....... 2306.56 |========================================
NVIDIA RTX 3090 .......... 2017.75 |===================================
NVIDIA RTX 3090 .......... 2015.93 |===================================
NVIDIA RTX 3090 .......... 2024.61 |===================================
NVIDIA RTX 3090 .......... 2020.16 |===================================
NVIDIA RTX 4070 TI SUPER . 2493.29 |===========================================
NVIDIA RTX 4070 TI SUPER . 2495.92 |===========================================
Libplacebo 5.229.1
Test: polar_nocompute
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 2327.55 |======================================
NVIDIA RTX 4070 .......... 1972.78 |================================
NVIDIA RTX 4070 .......... 1969.19 |================================
NVIDIA RTX 4070 .......... 1968.37 |================================
NVIDIA RTX 4070 TI ....... 2461.23 |========================================
NVIDIA RTX 4070 TI ....... 2459.03 |========================================
NVIDIA RTX 3090 .......... 2119.89 |==================================
NVIDIA RTX 3090 .......... 2116.50 |==================================
NVIDIA RTX 3090 .......... 2126.31 |==================================
NVIDIA RTX 3090 .......... 2116.79 |==================================
NVIDIA RTX 4070 TI SUPER . 2646.70 |===========================================
NVIDIA RTX 4070 TI SUPER . 2653.03 |===========================================
Libplacebo 5.229.1
Test: hdr_peakdetect
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 3292.37 |============================
NVIDIA RTX 4070 .......... 3310.02 |============================
NVIDIA RTX 4070 .......... 3452.43 |=============================
NVIDIA RTX 4070 .......... 3329.26 |============================
NVIDIA RTX 4070 TI ....... 3544.60 |==============================
NVIDIA RTX 4070 TI ....... 3475.06 |=============================
NVIDIA RTX 3090 .......... 4997.08 |==========================================
NVIDIA RTX 3090 .......... 5104.10 |===========================================
NVIDIA RTX 3090 .......... 4969.74 |==========================================
NVIDIA RTX 3090 .......... 5055.88 |===========================================
NVIDIA RTX 4070 TI SUPER . 3931.57 |=================================
NVIDIA RTX 4070 TI SUPER . 3913.34 |=================================
Libplacebo 5.229.1
Test: hdr_lut
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 3905.98 |==========================================
NVIDIA RTX 4070 .......... 3927.11 |==========================================
NVIDIA RTX 4070 .......... 3940.40 |===========================================
NVIDIA RTX 4070 .......... 3946.90 |===========================================
NVIDIA RTX 4070 TI ....... 3971.61 |===========================================
NVIDIA RTX 4070 TI ....... 3976.04 |===========================================
NVIDIA RTX 3090 .......... 3313.26 |====================================
NVIDIA RTX 3090 .......... 3376.85 |=====================================
NVIDIA RTX 3090 .......... 3333.77 |====================================
NVIDIA RTX 3090 .......... 3369.88 |====================================
NVIDIA RTX 4070 TI SUPER . 3845.51 |==========================================
NVIDIA RTX 4070 TI SUPER . 3822.16 |=========================================
Libplacebo 5.229.1
Test: av1_grain_lap
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 4171.00 |===========================================
NVIDIA RTX 4070 .......... 4103.40 |==========================================
NVIDIA RTX 4070 .......... 4126.40 |===========================================
NVIDIA RTX 4070 .......... 4152.41 |===========================================
NVIDIA RTX 4070 TI ....... 4140.87 |===========================================
NVIDIA RTX 4070 TI ....... 4143.96 |===========================================
NVIDIA RTX 3090 .......... 4126.89 |===========================================
NVIDIA RTX 3090 .......... 4096.48 |==========================================
NVIDIA RTX 3090 .......... 4120.27 |==========================================
NVIDIA RTX 3090 .......... 4100.36 |==========================================
NVIDIA RTX 4070 TI SUPER . 4057.41 |==========================================
NVIDIA RTX 4070 TI SUPER . 4044.72 |==========================================
LuxCoreRender 2.6
Scene: DLSC - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 13.59 |======================================
NVIDIA RTX 4070 .......... 11.74 |=================================
NVIDIA RTX 4070 TI ....... 13.95 |=======================================
NVIDIA RTX 3090 .......... 12.99 |====================================
NVIDIA RTX 4070 TI SUPER . 16.23 |=============================================
LuxCoreRender 2.6
Scene: Danish Mood - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 10.56 |======================================
NVIDIA RTX 4070 .......... 8.89 |================================
NVIDIA RTX 4070 TI ....... 10.99 |========================================
NVIDIA RTX 3090 .......... 10.20 |=====================================
NVIDIA RTX 4070 TI SUPER . 12.42 |=============================================
LuxCoreRender 2.6
Scene: Orange Juice - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 11.72 |=======================================
NVIDIA RTX 4070 .......... 10.40 |==================================
NVIDIA RTX 4070 TI ....... 11.89 |=======================================
NVIDIA RTX 3090 .......... 12.14 |========================================
NVIDIA RTX 4070 TI SUPER . 13.64 |=============================================
LuxCoreRender 2.6
Scene: LuxCore Benchmark - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 12.82 |=======================================
NVIDIA RTX 4070 .......... 10.92 |==================================
NVIDIA RTX 4070 TI ....... 13.23 |=========================================
NVIDIA RTX 3090 .......... 13.12 |========================================
NVIDIA RTX 4070 TI SUPER . 14.61 |=============================================
LuxCoreRender 2.6
Scene: Rainbow Colors and Prism - Acceleration: GPU
M samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 27.67 |=====================================
NVIDIA RTX 4070 .......... 23.26 |===============================
NVIDIA RTX 4070 TI ....... 27.71 |=====================================
NVIDIA RTX 3090 .......... 33.29 |=============================================
NVIDIA RTX 4070 TI SUPER . 31.86 |===========================================
MandelGPU 1.3pts1
OpenCL Device: GPU
Samples/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 587219538.2 |===================================
NVIDIA RTX 4070 .......... 516770131.2 |===============================
NVIDIA RTX 4070 TI ....... 619106132.5 |=====================================
NVIDIA RTX 3090 .......... 484098913.8 |=============================
NVIDIA RTX 4070 TI SUPER . 656484783.7 |=======================================
NAMD CUDA 2.14
ATPase Simulation - 327,506 Atoms
days/ns < Lower Is Better
NVIDIA RTX 4070 SUPER .... 0.06791 |===========================
NVIDIA RTX 4070 .......... 0.07498 |==============================
NVIDIA RTX 4070 TI ....... 0.06788 |===========================
NVIDIA RTX 3090 .......... 0.10822 |===========================================
NVIDIA RTX 4070 TI SUPER . 0.07715 |===============================
NCNN 20230517
Target: Vulkan GPU
ms < Lower Is Better
NCNN 20230517
Target: Vulkan GPU - Model: mobilenet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 8.62 |================================
NVIDIA RTX 4070 .......... 10.14 |======================================
NVIDIA RTX 4070 .......... 7.20 |===========================
NVIDIA RTX 4070 TI ....... 8.43 |===============================
NVIDIA RTX 4070 TI ....... 7.45 |============================
NVIDIA RTX 3090 .......... 12.07 |=============================================
NVIDIA RTX 3090 .......... 6.92 |==========================
NVIDIA RTX 3090 .......... 7.27 |===========================
NVIDIA RTX 4070 TI SUPER . 6.28 |=======================
NVIDIA RTX 4070 TI SUPER . 7.48 |============================
NCNN 20230517
Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 3.03 |==============================
NVIDIA RTX 4070 .......... 4.69 |==============================================
NVIDIA RTX 4070 .......... 2.48 |========================
NVIDIA RTX 4070 TI ....... 2.43 |========================
NVIDIA RTX 4070 TI ....... 2.54 |=========================
NVIDIA RTX 3090 .......... 2.65 |==========================
NVIDIA RTX 3090 .......... 2.67 |==========================
NVIDIA RTX 3090 .......... 2.34 |=======================
NVIDIA RTX 4070 TI SUPER . 2.42 |========================
NVIDIA RTX 4070 TI SUPER . 2.70 |==========================
NCNN 20230517
Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 2.25 |============
NVIDIA RTX 4070 .......... 8.71 |==============================================
NVIDIA RTX 4070 .......... 2.15 |===========
NVIDIA RTX 4070 TI ....... 2.09 |===========
NVIDIA RTX 3090 .......... 3.19 |=================
NVIDIA RTX 3090 .......... 2.20 |============
NVIDIA RTX 3090 .......... 2.21 |============
NVIDIA RTX 4070 TI SUPER . 1.87 |==========
NVIDIA RTX 4070 TI SUPER . 2.16 |===========
NCNN 20230517
Target: Vulkan GPU - Model: shufflenet-v2
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 2.31 |=========================
NVIDIA RTX 4070 .......... 2.11 |=======================
NVIDIA RTX 4070 .......... 2.08 |=======================
NVIDIA RTX 4070 TI ....... 2.03 |======================
NVIDIA RTX 4070 TI ....... 2.01 |======================
NVIDIA RTX 3090 .......... 4.17 |==============================================
NVIDIA RTX 3090 .......... 2.09 |=======================
NVIDIA RTX 3090 .......... 2.04 |=======================
NVIDIA RTX 4070 TI SUPER . 2.05 |=======================
NVIDIA RTX 4070 TI SUPER . 2.13 |=======================
NCNN 20230517
Target: Vulkan GPU - Model: mnasnet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 3.85 |===========================================
NVIDIA RTX 4070 .......... 2.22 |=========================
NVIDIA RTX 4070 .......... 2.24 |=========================
NVIDIA RTX 4070 TI ....... 2.30 |==========================
NVIDIA RTX 4070 TI ....... 4.14 |==============================================
NVIDIA RTX 3090 .......... 2.24 |=========================
NVIDIA RTX 3090 .......... 2.30 |==========================
NVIDIA RTX 3090 .......... 2.16 |========================
NVIDIA RTX 4070 TI SUPER . 2.21 |=========================
NVIDIA RTX 4070 TI SUPER . 2.26 |=========================
NCNN 20230517
Target: Vulkan GPU - Model: efficientnet-b0
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 5.07 |================
NVIDIA RTX 4070 .......... 3.46 |===========
NVIDIA RTX 4070 .......... 3.59 |============
NVIDIA RTX 4070 TI ....... 3.49 |===========
NVIDIA RTX 4070 TI ....... 3.46 |===========
NVIDIA RTX 3090 .......... 13.87 |=============================================
NVIDIA RTX 3090 .......... 3.54 |===========
NVIDIA RTX 3090 .......... 3.34 |===========
NVIDIA RTX 4070 TI SUPER . 3.36 |===========
NVIDIA RTX 4070 TI SUPER . 3.48 |===========
NCNN 20230517
Target: Vulkan GPU - Model: blazeface
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 0.84 |============================================
NVIDIA RTX 4070 .......... 0.84 |============================================
NVIDIA RTX 4070 TI ....... 0.81 |==========================================
NVIDIA RTX 4070 TI ....... 0.82 |===========================================
NVIDIA RTX 3090 .......... 0.86 |=============================================
NVIDIA RTX 3090 .......... 0.84 |============================================
NVIDIA RTX 3090 .......... 0.87 |=============================================
NVIDIA RTX 4070 TI SUPER . 0.86 |=============================================
NVIDIA RTX 4070 TI SUPER . 0.88 |==============================================
NCNN 20230517
Target: Vulkan GPU - Model: googlenet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 11.04 |=============================================
NVIDIA RTX 4070 .......... 6.06 |=========================
NVIDIA RTX 4070 TI ....... 5.87 |========================
NVIDIA RTX 4070 TI ....... 7.37 |==============================
NVIDIA RTX 3090 .......... 7.49 |===============================
NVIDIA RTX 3090 .......... 6.11 |=========================
NVIDIA RTX 3090 .......... 6.14 |=========================
NVIDIA RTX 4070 TI SUPER . 6.25 |=========================
NVIDIA RTX 4070 TI SUPER . 6.46 |==========================
NCNN 20230517
Target: Vulkan GPU - Model: vgg16
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 117.81 |====================================
NVIDIA RTX 4070 .......... 54.54 |================
NVIDIA RTX 4070 .......... 45.52 |==============
NVIDIA RTX 4070 TI ....... 32.05 |==========
NVIDIA RTX 4070 TI ....... 34.49 |==========
NVIDIA RTX 3090 .......... 145.72 |============================================
NVIDIA RTX 3090 .......... 24.45 |=======
NVIDIA RTX 3090 .......... 17.88 |=====
NVIDIA RTX 4070 TI SUPER . 21.76 |=======
NVIDIA RTX 4070 TI SUPER . 24.85 |========
NCNN 20230517
Target: Vulkan GPU - Model: resnet18
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 8.97 |=======================
NVIDIA RTX 4070 .......... 8.58 |======================
NVIDIA RTX 4070 .......... 5.11 |=============
NVIDIA RTX 4070 TI ....... 5.47 |==============
NVIDIA RTX 4070 TI ....... 7.74 |====================
NVIDIA RTX 3090 .......... 17.41 |=============================================
NVIDIA RTX 3090 .......... 8.94 |=======================
NVIDIA RTX 3090 .......... 4.12 |===========
NVIDIA RTX 4070 TI SUPER . 4.64 |============
NVIDIA RTX 4070 TI SUPER . 7.58 |====================
NCNN 20230517
Target: Vulkan GPU - Model: alexnet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 16.17 |=============================================
NVIDIA RTX 4070 .......... 9.33 |==========================
NVIDIA RTX 4070 .......... 5.78 |================
NVIDIA RTX 4070 TI ....... 3.74 |==========
NVIDIA RTX 4070 TI ....... 6.07 |=================
NVIDIA RTX 3090 .......... 3.69 |==========
NVIDIA RTX 3090 .......... 6.20 |=================
NVIDIA RTX 3090 .......... 3.60 |==========
NVIDIA RTX 4070 TI SUPER . 4.38 |============
NVIDIA RTX 4070 TI SUPER . 4.41 |============
NCNN 20230517
Target: Vulkan GPU - Model: resnet50
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 46.26 |=============================================
NVIDIA RTX 4070 .......... 8.24 |========
NVIDIA RTX 4070 .......... 8.72 |========
NVIDIA RTX 4070 TI ....... 14.32 |==============
NVIDIA RTX 4070 TI ....... 12.25 |============
NVIDIA RTX 3090 .......... 27.77 |===========================
NVIDIA RTX 3090 .......... 8.20 |========
NVIDIA RTX 3090 .......... 12.70 |============
NVIDIA RTX 4070 TI SUPER . 8.58 |========
NVIDIA RTX 4070 TI SUPER . 8.79 |=========
NCNN 20230517
Target: Vulkan GPU - Model: yolov4-tiny
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 63.82 |=============================================
NVIDIA RTX 4070 .......... 25.11 |==================
NVIDIA RTX 4070 .......... 20.74 |===============
NVIDIA RTX 4070 TI ....... 16.47 |============
NVIDIA RTX 4070 TI ....... 16.37 |============
NVIDIA RTX 3090 .......... 26.85 |===================
NVIDIA RTX 3090 .......... 13.31 |=========
NVIDIA RTX 3090 .......... 11.29 |========
NVIDIA RTX 4070 TI SUPER . 14.26 |==========
NVIDIA RTX 4070 TI SUPER . 17.20 |============
NCNN 20230517
Target: Vulkan GPU - Model: squeezenet_ssd
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 6.86 |==============================================
NVIDIA RTX 4070 .......... 5.27 |===================================
NVIDIA RTX 4070 .......... 5.18 |===================================
NVIDIA RTX 4070 TI ....... 5.36 |====================================
NVIDIA RTX 4070 TI ....... 6.13 |=========================================
NVIDIA RTX 3090 .......... 6.63 |============================================
NVIDIA RTX 3090 .......... 5.20 |===================================
NVIDIA RTX 3090 .......... 4.90 |=================================
NVIDIA RTX 4070 TI SUPER . 5.11 |==================================
NVIDIA RTX 4070 TI SUPER . 5.19 |===================================
NCNN 20230517
Target: Vulkan GPU - Model: regnety_400m
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 11.11 |=============================================
NVIDIA RTX 4070 .......... 6.50 |==========================
NVIDIA RTX 4070 .......... 6.21 |=========================
NVIDIA RTX 4070 TI ....... 5.97 |========================
NVIDIA RTX 4070 TI ....... 5.89 |========================
NVIDIA RTX 3090 .......... 8.06 |=================================
NVIDIA RTX 3090 .......... 6.47 |==========================
NVIDIA RTX 3090 .......... 6.73 |===========================
NVIDIA RTX 4070 TI SUPER . 6.77 |===========================
NVIDIA RTX 4070 TI SUPER . 6.59 |===========================
NCNN 20230517
Target: Vulkan GPU - Model: vision_transformer
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 844.61 |============================================
NVIDIA RTX 4070 .......... 281.56 |===============
NVIDIA RTX 4070 .......... 382.82 |====================
NVIDIA RTX 4070 TI ....... 390.18 |====================
NVIDIA RTX 4070 TI ....... 497.66 |==========================
NVIDIA RTX 3090 .......... 663.24 |===================================
NVIDIA RTX 3090 .......... 327.82 |=================
NVIDIA RTX 3090 .......... 354.57 |==================
NVIDIA RTX 4070 TI SUPER . 571.53 |==============================
NVIDIA RTX 4070 TI SUPER . 312.10 |================
NCNN 20230517
Target: Vulkan GPU - Model: FastestDet
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 2.86 |=====================
NVIDIA RTX 4070 .......... 2.34 |=================
NVIDIA RTX 4070 .......... 2.67 |===================
NVIDIA RTX 4070 TI ....... 2.84 |====================
NVIDIA RTX 4070 TI ....... 3.04 |======================
NVIDIA RTX 3090 .......... 6.38 |==============================================
NVIDIA RTX 3090 .......... 2.50 |==================
NVIDIA RTX 3090 .......... 2.65 |===================
NVIDIA RTX 4070 TI SUPER . 2.54 |==================
NVIDIA RTX 4070 TI SUPER . 2.55 |==================
NeatBench 5
Acceleration: GPU
FPS > Higher Is Better
NVIDIA RTX 4070 SUPER .... 4070.0 |============================================
NVIDIA RTX 4070 .......... 4070.0 |============================================
NVIDIA RTX 4070 TI ....... 4070.0 |============================================
NVIDIA RTX 3090 .......... 3090.0 |=================================
NVIDIA RTX 4070 TI SUPER . 2084.1 |=======================
OctaneBench 2020.1
Total Score
Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 720.97 |====================================
NVIDIA RTX 4070 .......... 648.00 |=================================
NVIDIA RTX 4070 TI ....... 735.94 |=====================================
NVIDIA RTX 3090 .......... 674.25 |==================================
NVIDIA RTX 4070 TI SUPER . 876.44 |============================================
PlaidML
FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL
Examples Per Second > Higher Is Better
PlaidML
FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL
Examples Per Second > Higher Is Better
ProjectPhysX OpenCL-Benchmark 1.2
Operation: FP64 Compute
TFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 0.621 |======================================
NVIDIA RTX 4070 .......... 0.510 |===============================
NVIDIA RTX 4070 TI ....... 0.660 |========================================
NVIDIA RTX 3090 .......... 0.637 |=======================================
NVIDIA RTX 4070 TI SUPER . 0.743 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: FP32 Compute
TFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 38.59 |======================================
NVIDIA RTX 4070 .......... 31.77 |===============================
NVIDIA RTX 4070 TI ....... 40.91 |========================================
NVIDIA RTX 3090 .......... 39.40 |=======================================
NVIDIA RTX 4070 TI SUPER . 45.95 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT64 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 4.214 |===========================================
NVIDIA RTX 4070 .......... 3.443 |===================================
NVIDIA RTX 4070 TI ....... 4.420 |=============================================
NVIDIA RTX 3090 .......... 3.135 |================================
NVIDIA RTX 4070 TI SUPER . 4.414 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT32 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 19.89 |======================================
NVIDIA RTX 4070 .......... 16.38 |===============================
NVIDIA RTX 4070 TI ....... 21.05 |========================================
NVIDIA RTX 3090 .......... 20.03 |======================================
NVIDIA RTX 4070 TI SUPER . 23.66 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT16 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 17.17 |======================================
NVIDIA RTX 4070 .......... 14.28 |===============================
NVIDIA RTX 4070 TI ....... 18.28 |========================================
NVIDIA RTX 3090 .......... 17.00 |=====================================
NVIDIA RTX 4070 TI SUPER . 20.50 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: INT8 Compute
TIOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 14.31 |=====================================
NVIDIA RTX 4070 .......... 12.12 |===============================
NVIDIA RTX 4070 TI ....... 15.73 |========================================
NVIDIA RTX 3090 .......... 13.73 |===================================
NVIDIA RTX 4070 TI SUPER . 17.62 |=============================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: Memory Bandwidth Coalesced Read
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 464.86 |========================
NVIDIA RTX 4070 .......... 465.18 |========================
NVIDIA RTX 4070 TI ....... 465.07 |========================
NVIDIA RTX 3090 .......... 864.11 |============================================
NVIDIA RTX 4070 TI SUPER . 619.03 |================================
ProjectPhysX OpenCL-Benchmark 1.2
Operation: Memory Bandwidth Coalesced Write
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 455.01 |=======================
NVIDIA RTX 4070 .......... 459.43 |=======================
NVIDIA RTX 4070 TI ....... 457.17 |=======================
NVIDIA RTX 3090 .......... 887.31 |============================================
NVIDIA RTX 4070 TI SUPER . 608.94 |==============================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 557.73 |============================================
NVIDIA RTX 4070 .......... 546.76 |===========================================
NVIDIA RTX 4070 TI ....... 535.39 |==========================================
NVIDIA RTX 3090 .......... 525.12 |=========================================
NVIDIA RTX 4070 TI SUPER . 558.82 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 201.94 |============================================
NVIDIA RTX 4070 .......... 198.18 |===========================================
NVIDIA RTX 4070 TI ....... 201.19 |============================================
NVIDIA RTX 3090 .......... 197.12 |===========================================
NVIDIA RTX 4070 TI SUPER . 200.46 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 509.45 |==========================================
NVIDIA RTX 4070 .......... 458.39 |======================================
NVIDIA RTX 4070 TI ....... 502.92 |==========================================
NVIDIA RTX 3090 .......... 419.76 |===================================
NVIDIA RTX 4070 TI SUPER . 531.96 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 501.50 |=========================================
NVIDIA RTX 4070 .......... 459.94 |======================================
NVIDIA RTX 4070 TI ....... 505.55 |==========================================
NVIDIA RTX 3090 .......... 420.29 |===================================
NVIDIA RTX 4070 TI SUPER . 532.77 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 507.45 |==========================================
NVIDIA RTX 4070 .......... 458.36 |======================================
NVIDIA RTX 4070 TI ....... 505.62 |==========================================
NVIDIA RTX 3090 .......... 419.03 |===================================
NVIDIA RTX 4070 TI SUPER . 527.82 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 195.40 |===========================================
NVIDIA RTX 4070 .......... 187.26 |=========================================
NVIDIA RTX 4070 TI ....... 194.29 |===========================================
NVIDIA RTX 3090 .......... 164.14 |====================================
NVIDIA RTX 4070 TI SUPER . 198.58 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 504.67 |==========================================
NVIDIA RTX 4070 .......... 459.93 |======================================
NVIDIA RTX 3090 .......... 416.89 |===================================
NVIDIA RTX 4070 TI SUPER . 529.14 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 195.39 |===========================================
NVIDIA RTX 4070 .......... 187.69 |==========================================
NVIDIA RTX 4070 TI ....... 198.82 |============================================
NVIDIA RTX 3090 .......... 163.74 |====================================
NVIDIA RTX 4070 TI SUPER . 197.82 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 504.27 |==========================================
NVIDIA RTX 4070 .......... 459.27 |======================================
NVIDIA RTX 4070 TI ....... 504.66 |==========================================
NVIDIA RTX 3090 .......... 416.20 |===================================
NVIDIA RTX 4070 TI SUPER . 529.49 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 196.07 |============================================
NVIDIA RTX 4070 .......... 186.63 |==========================================
NVIDIA RTX 4070 TI ....... 197.02 |============================================
NVIDIA RTX 3090 .......... 164.14 |=====================================
NVIDIA RTX 4070 TI SUPER . 196.50 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 194.58 |===========================================
NVIDIA RTX 4070 .......... 187.27 |=========================================
NVIDIA RTX 4070 TI ....... 195.86 |===========================================
NVIDIA RTX 3090 .......... 161.01 |====================================
NVIDIA RTX 4070 TI SUPER . 198.70 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 195.30 |===========================================
NVIDIA RTX 4070 .......... 187.51 |==========================================
NVIDIA RTX 4070 TI ....... 194.87 |===========================================
NVIDIA RTX 3090 .......... 164.35 |=====================================
NVIDIA RTX 4070 TI SUPER . 198.01 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 106.37 |===========================================
NVIDIA RTX 4070 .......... 107.59 |============================================
NVIDIA RTX 4070 TI ....... 108.59 |============================================
NVIDIA RTX 3090 .......... 105.55 |===========================================
NVIDIA RTX 4070 TI SUPER . 105.86 |===========================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 .......... 103.68 |============================================
NVIDIA RTX 4070 TI ....... 103.45 |============================================
NVIDIA RTX 3090 .......... 98.11 |==========================================
NVIDIA RTX 4070 TI SUPER . 103.66 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 102.60 |============================================
NVIDIA RTX 4070 .......... 102.90 |============================================
NVIDIA RTX 4070 TI ....... 96.50 |=========================================
NVIDIA RTX 3090 .......... 99.05 |==========================================
NVIDIA RTX 4070 TI SUPER . 102.83 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 102.60 |============================================
NVIDIA RTX 4070 .......... 101.55 |===========================================
NVIDIA RTX 4070 TI ....... 103.20 |============================================
NVIDIA RTX 3090 .......... 99.84 |==========================================
NVIDIA RTX 4070 TI SUPER . 103.49 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 103.17 |============================================
NVIDIA RTX 4070 .......... 101.24 |===========================================
NVIDIA RTX 4070 TI ....... 103.24 |============================================
NVIDIA RTX 3090 .......... 99.43 |==========================================
NVIDIA RTX 4070 TI SUPER . 102.83 |============================================
PyTorch 2.1
Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l
batches/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 103.57 |============================================
NVIDIA RTX 4070 .......... 101.43 |===========================================
NVIDIA RTX 4070 TI ....... 103.50 |============================================
NVIDIA RTX 3090 .......... 99.25 |==========================================
NVIDIA RTX 4070 TI SUPER . 103.53 |============================================
RealSR-NCNN 20200818
Scale: 4x - TAA: No
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 6.323 |========================================
NVIDIA RTX 4070 .......... 7.092 |=============================================
NVIDIA RTX 4070 TI ....... 5.962 |======================================
NVIDIA RTX 3090 .......... 5.556 |===================================
NVIDIA RTX 4070 TI SUPER . 5.633 |====================================
RealSR-NCNN 20200818
Scale: 4x - TAA: Yes
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 34.89 |=====================================
NVIDIA RTX 4070 .......... 42.85 |=============================================
NVIDIA RTX 4070 TI ....... 33.63 |===================================
NVIDIA RTX 3090 .......... 30.31 |================================
NVIDIA RTX 4070 TI SUPER . 30.72 |================================
Rodinia 3.1
Test: OpenCL Particle Filter
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 3.480 |======================================
NVIDIA RTX 4070 .......... 4.098 |=============================================
NVIDIA RTX 4070 TI ....... 3.291 |====================================
NVIDIA RTX 3090 .......... 3.844 |==========================================
NVIDIA RTX 4070 TI SUPER . 2.973 |=================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 1.35 |=============================================
NVIDIA RTX 4070 .......... 1.36 |=============================================
NVIDIA RTX 4070 TI ....... 1.38 |==============================================
NVIDIA RTX 3090 .......... 1.38 |==============================================
NVIDIA RTX 4070 TI SUPER . 1.32 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 13.92 |==========================================
NVIDIA RTX 4070 .......... 14.04 |===========================================
NVIDIA RTX 4070 TI ....... 14.79 |=============================================
NVIDIA RTX 3090 .......... 14.45 |============================================
NVIDIA RTX 4070 TI SUPER . 12.26 |=====================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 1.48 |=============================================
NVIDIA RTX 4070 .......... 1.50 |==============================================
NVIDIA RTX 4070 TI ....... 1.49 |==============================================
NVIDIA RTX 3090 .......... 1.49 |==============================================
NVIDIA RTX 4070 TI SUPER . 1.45 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 1.50 |==============================================
NVIDIA RTX 4070 .......... 1.50 |==============================================
NVIDIA RTX 4070 TI ....... 1.50 |==============================================
NVIDIA RTX 3090 .......... 1.50 |==============================================
NVIDIA RTX 4070 TI SUPER . 1.46 |=============================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 .......... 1.50 |==============================================
NVIDIA RTX 4070 TI ....... 1.50 |==============================================
NVIDIA RTX 3090 .......... 1.51 |==============================================
NVIDIA RTX 4070 TI SUPER . 1.46 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 31.59 |============================================
NVIDIA RTX 4070 .......... 31.45 |============================================
NVIDIA RTX 4070 TI ....... 31.70 |=============================================
NVIDIA RTX 3090 .......... 31.98 |=============================================
NVIDIA RTX 4070 TI SUPER . 31.10 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 256 - Model: VGG-16
images/sec > Higher Is Better
NVIDIA RTX 4070 TI ....... 1.50 |==============================================
NVIDIA RTX 3090 .......... 1.51 |==============================================
NVIDIA RTX 4070 TI SUPER . 1.47 |=============================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 33.40 |=============================================
NVIDIA RTX 4070 .......... 33.32 |=============================================
NVIDIA RTX 4070 TI ....... 33.29 |=============================================
NVIDIA RTX 3090 .......... 33.53 |=============================================
NVIDIA RTX 4070 TI SUPER . 32.88 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 512 - Model: VGG-16
images/sec > Higher Is Better
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 33.97 |=============================================
NVIDIA RTX 4070 .......... 33.93 |=============================================
NVIDIA RTX 4070 TI ....... 34.06 |=============================================
NVIDIA RTX 3090 .......... 33.93 |=============================================
NVIDIA RTX 4070 TI SUPER . 33.55 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 12.62 |============================================
NVIDIA RTX 4070 .......... 12.78 |=============================================
NVIDIA RTX 4070 TI ....... 12.79 |=============================================
NVIDIA RTX 3090 .......... 12.82 |=============================================
NVIDIA RTX 4070 TI SUPER . 12.24 |===========================================
TensorFlow 2.12
Device: GPU - Batch Size: 1 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 4.35 |==============================================
NVIDIA RTX 4070 .......... 4.34 |==============================================
NVIDIA RTX 4070 TI ....... 4.32 |==============================================
NVIDIA RTX 3090 .......... 4.35 |==============================================
NVIDIA RTX 4070 TI SUPER . 4.14 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 256 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 34.16 |============================================
NVIDIA RTX 4070 TI ....... 34.61 |=============================================
NVIDIA RTX 3090 .......... 34.46 |=============================================
NVIDIA RTX 4070 TI SUPER . 33.95 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 512 - Model: AlexNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 35.10 |============================================
NVIDIA RTX 4070 .......... 35.21 |=============================================
NVIDIA RTX 4070 TI ....... 35.44 |=============================================
NVIDIA RTX 3090 .......... 35.58 |=============================================
NVIDIA RTX 4070 TI SUPER . 35.02 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 15.67 |=============================================
NVIDIA RTX 4070 .......... 15.66 |=============================================
NVIDIA RTX 4070 TI ....... 15.69 |=============================================
NVIDIA RTX 3090 .......... 15.68 |=============================================
NVIDIA RTX 4070 TI SUPER . 15.29 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 16 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 5.46 |==============================================
NVIDIA RTX 4070 .......... 5.49 |==============================================
NVIDIA RTX 4070 TI ....... 5.46 |==============================================
NVIDIA RTX 3090 .......... 5.49 |==============================================
NVIDIA RTX 4070 TI SUPER . 5.32 |=============================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 15.61 |============================================
NVIDIA RTX 4070 .......... 15.63 |============================================
NVIDIA RTX 4070 TI ....... 15.81 |=============================================
NVIDIA RTX 3090 .......... 15.67 |=============================================
NVIDIA RTX 4070 TI SUPER . 15.11 |===========================================
TensorFlow 2.12
Device: GPU - Batch Size: 32 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 5.51 |==============================================
NVIDIA RTX 4070 .......... 5.55 |==============================================
NVIDIA RTX 4070 TI ....... 5.50 |=============================================
NVIDIA RTX 3090 .......... 5.57 |==============================================
NVIDIA RTX 4070 TI SUPER . 5.35 |============================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: GoogLeNet
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 15.52 |=============================================
NVIDIA RTX 4070 .......... 15.54 |=============================================
NVIDIA RTX 4070 TI ....... 15.50 |=============================================
NVIDIA RTX 3090 .......... 15.63 |=============================================
NVIDIA RTX 4070 TI SUPER . 15.00 |===========================================
TensorFlow 2.12
Device: GPU - Batch Size: 64 - Model: ResNet-50
images/sec > Higher Is Better
NVIDIA RTX 4070 SUPER .... 5.55 |==============================================
NVIDIA RTX 4070 .......... 5.55 |==============================================
NVIDIA RTX 4070 TI ....... 5.53 |==============================================
NVIDIA RTX 3090 .......... 5.57 |==============================================
NVIDIA RTX 4070 TI SUPER . 5.33 |============================================
ViennaCL 1.7.1
Test: CPU BLAS - sCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 132 |===============================================
NVIDIA RTX 4070 .......... 131 |===============================================
NVIDIA RTX 4070 TI ....... 132 |===============================================
NVIDIA RTX 3090 .......... 132 |===============================================
NVIDIA RTX 4070 TI SUPER . 107 |======================================
ViennaCL 1.7.1
Test: CPU BLAS - sAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 156 |===============================================
NVIDIA RTX 4070 .......... 153 |==============================================
NVIDIA RTX 4070 TI ....... 156 |===============================================
NVIDIA RTX 3090 .......... 154 |==============================================
NVIDIA RTX 4070 TI SUPER . 120 |====================================
ViennaCL 1.7.1
Test: CPU BLAS - sDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 165.0 |============================================
NVIDIA RTX 4070 .......... 166.0 |============================================
NVIDIA RTX 4070 TI ....... 168.0 |=============================================
NVIDIA RTX 3090 .......... 132.1 |===================================
NVIDIA RTX 4070 TI SUPER . 129.0 |===================================
ViennaCL 1.7.1
Test: CPU BLAS - dCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 70.8 |==============================================
NVIDIA RTX 4070 .......... 71.0 |==============================================
NVIDIA RTX 4070 TI ....... 71.3 |==============================================
NVIDIA RTX 3090 .......... 70.2 |=============================================
NVIDIA RTX 4070 TI SUPER . 52.7 |==================================
ViennaCL 1.7.1
Test: CPU BLAS - dAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 87.2 |==============================================
NVIDIA RTX 4070 .......... 86.8 |==============================================
NVIDIA RTX 4070 TI ....... 87.3 |==============================================
NVIDIA RTX 3090 .......... 86.2 |=============================================
NVIDIA RTX 4070 TI SUPER . 64.3 |==================================
ViennaCL 1.7.1
Test: CPU BLAS - dDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 96.8 |==============================================
NVIDIA RTX 4070 .......... 96.7 |==============================================
NVIDIA RTX 4070 TI ....... 96.4 |==============================================
NVIDIA RTX 3090 .......... 95.2 |=============================================
NVIDIA RTX 4070 TI SUPER . 70.8 |==================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMV-N
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 102.0 |=============================================
NVIDIA RTX 4070 .......... 103.0 |=============================================
NVIDIA RTX 4070 TI ....... 103.0 |=============================================
NVIDIA RTX 3090 .......... 103.0 |=============================================
NVIDIA RTX 4070 TI SUPER . 78.5 |==================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMV-T
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 109.0 |=============================================
NVIDIA RTX 4070 .......... 109.0 |=============================================
NVIDIA RTX 4070 TI ....... 102.7 |==========================================
NVIDIA RTX 3090 .......... 110.0 |=============================================
NVIDIA RTX 4070 TI SUPER . 82.6 |==================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-NN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 119 |==============================================
NVIDIA RTX 4070 .......... 122 |===============================================
NVIDIA RTX 4070 TI ....... 117 |=============================================
NVIDIA RTX 3090 .......... 113 |============================================
NVIDIA RTX 4070 TI SUPER . 122 |===============================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-NT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 117 |=============================================
NVIDIA RTX 4070 .......... 122 |===============================================
NVIDIA RTX 4070 TI ....... 118 |=============================================
NVIDIA RTX 3090 .......... 119 |==============================================
NVIDIA RTX 4070 TI SUPER . 119 |==============================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-TN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 115 |===========================================
NVIDIA RTX 4070 .......... 121 |=============================================
NVIDIA RTX 4070 TI ....... 125 |===============================================
NVIDIA RTX 3090 .......... 121 |=============================================
NVIDIA RTX 4070 TI SUPER . 120 |=============================================
ViennaCL 1.7.1
Test: CPU BLAS - dGEMM-TT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 122 |==============================================
NVIDIA RTX 4070 .......... 118 |=============================================
NVIDIA RTX 4070 TI ....... 124 |===============================================
NVIDIA RTX 3090 .......... 113 |===========================================
NVIDIA RTX 4070 TI SUPER . 117 |============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 334 |==========================================
NVIDIA RTX 4070 .......... 330 |==========================================
NVIDIA RTX 4070 TI ....... 336 |==========================================
NVIDIA RTX 3090 .......... 363 |==============================================
NVIDIA RTX 4070 TI SUPER . 373 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 392 |=====================================
NVIDIA RTX 4070 .......... 389 |=====================================
NVIDIA RTX 4070 TI ....... 393 |=====================================
NVIDIA RTX 3090 .......... 498 |===============================================
NVIDIA RTX 4070 TI SUPER . 469 |============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - sDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 370 |==========================================
NVIDIA RTX 4070 .......... 362 |=========================================
NVIDIA RTX 4070 TI ....... 365 |==========================================
NVIDIA RTX 3090 .......... 376 |===========================================
NVIDIA RTX 4070 TI SUPER . 410 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dCOPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 423 |=================================
NVIDIA RTX 4070 .......... 423 |=================================
NVIDIA RTX 4070 TI ....... 424 |=================================
NVIDIA RTX 3090 .......... 605 |===============================================
NVIDIA RTX 4070 TI SUPER . 512 |========================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dAXPY
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 437 |============================
NVIDIA RTX 4070 .......... 455 |==============================
NVIDIA RTX 4070 TI ....... 437 |============================
NVIDIA RTX 3090 .......... 724 |===============================================
NVIDIA RTX 4070 TI SUPER . 585 |======================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dDOT
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 458 |=================================
NVIDIA RTX 4070 .......... 456 |=================================
NVIDIA RTX 4070 TI ....... 457 |=================================
NVIDIA RTX 3090 .......... 659 |===============================================
NVIDIA RTX 4070 TI SUPER . 575 |=========================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMV-N
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 210 |=============================================
NVIDIA RTX 4070 .......... 209 |=============================================
NVIDIA RTX 4070 TI ....... 211 |=============================================
NVIDIA RTX 3090 .......... 187 |========================================
NVIDIA RTX 4070 TI SUPER . 218 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMV-T
GB/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 389 |===========================================
NVIDIA RTX 4070 .......... 387 |===========================================
NVIDIA RTX 4070 TI ....... 391 |===========================================
NVIDIA RTX 3090 .......... 374 |=========================================
NVIDIA RTX 4070 TI SUPER . 424 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-NN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 577 |========================================
NVIDIA RTX 4070 .......... 473 |=================================
NVIDIA RTX 4070 TI ....... 604 |==========================================
NVIDIA RTX 3090 .......... 592 |=========================================
NVIDIA RTX 4070 TI SUPER . 681 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-NT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 584 |========================================
NVIDIA RTX 4070 .......... 477 |=================================
NVIDIA RTX 4070 TI ....... 612 |==========================================
NVIDIA RTX 3090 .......... 595 |=========================================
NVIDIA RTX 4070 TI SUPER . 689 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-TN
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 599 |=======================================
NVIDIA RTX 4070 .......... 494 |=================================
NVIDIA RTX 4070 TI ....... 634 |==========================================
NVIDIA RTX 3090 .......... 594 |=======================================
NVIDIA RTX 4070 TI SUPER . 714 |===============================================
ViennaCL 1.7.1
Test: OpenCL BLAS - dGEMM-TT
GFLOPs/s > Higher Is Better
NVIDIA RTX 4070 SUPER .... 613 |=======================================
NVIDIA RTX 4070 .......... 502 |================================
NVIDIA RTX 4070 TI ....... 648 |==========================================
NVIDIA RTX 3090 .......... 593 |======================================
NVIDIA RTX 4070 TI SUPER . 731 |===============================================
VkFFT 1.2.31
Test: FFT + iFFT R2C / C2R
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 54794 |==========================================
NVIDIA RTX 4070 .......... 47097 |====================================
NVIDIA RTX 4070 TI ....... 55446 |==========================================
NVIDIA RTX 3090 .......... 48418 |=====================================
NVIDIA RTX 4070 TI SUPER . 59378 |=============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in half precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 131705 |=====================
NVIDIA RTX 4070 .......... 137762 |======================
NVIDIA RTX 4070 TI ....... 136210 |======================
NVIDIA RTX 3090 .......... 273221 |============================================
NVIDIA RTX 4070 TI SUPER . 143992 |=======================
VkFFT 1.2.31
Test: FFT + iFFT C2C Bluestein in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 15166 |==========================================
NVIDIA RTX 4070 .......... 13714 |======================================
NVIDIA RTX 4070 TI ....... 15125 |==========================================
NVIDIA RTX 3090 .......... 14205 |========================================
NVIDIA RTX 4070 TI SUPER . 16141 |=============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in double precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 24317 |===================================
NVIDIA RTX 4070 .......... 22390 |=================================
NVIDIA RTX 4070 TI ....... 25431 |=====================================
NVIDIA RTX 3090 .......... 30912 |=============================================
NVIDIA RTX 4070 TI SUPER . 27947 |=========================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 73929 |=======================
NVIDIA RTX 4070 .......... 77774 |========================
NVIDIA RTX 4070 TI ....... 73942 |=======================
NVIDIA RTX 3090 .......... 141876 |============================================
NVIDIA RTX 4070 TI SUPER . 104003 |================================
VkFFT 1.2.31
Test: FFT + iFFT C2C multidimensional in single precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 50299 |======================================
NVIDIA RTX 4070 .......... 47212 |====================================
NVIDIA RTX 4070 TI ....... 51528 |=======================================
NVIDIA RTX 3090 .......... 50856 |======================================
NVIDIA RTX 4070 TI SUPER . 59790 |=============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C Bluestein benchmark in double precision
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 4451 |=========================================
NVIDIA RTX 4070 .......... 3886 |===================================
NVIDIA RTX 4070 TI ....... 4647 |==========================================
NVIDIA RTX 3090 .......... 4195 |======================================
NVIDIA RTX 4070 TI SUPER . 5047 |==============================================
VkFFT 1.2.31
Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling
Benchmark Score > Higher Is Better
NVIDIA RTX 4070 SUPER .... 75078 |=======================
NVIDIA RTX 4070 .......... 79057 |========================
NVIDIA RTX 4070 TI ....... 75141 |=======================
NVIDIA RTX 3090 .......... 144311 |============================================
NVIDIA RTX 4070 TI SUPER . 105549 |================================
vkpeak 20230730
GFLOPS > Higher Is Better
vkpeak 20230730
fp32-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 20263.13 |====================================
NVIDIA RTX 3090 .......... 20319.80 |====================================
NVIDIA RTX 3090 .......... 20317.63 |====================================
NVIDIA RTX 3090 .......... 20353.95 |====================================
NVIDIA RTX 4070 TI SUPER . 23883.53 |==========================================
NVIDIA RTX 4070 TI SUPER . 23920.67 |==========================================
vkpeak 20230730
fp32-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 26563.72 |===================================
NVIDIA RTX 3090 .......... 26630.59 |===================================
NVIDIA RTX 3090 .......... 26767.21 |====================================
NVIDIA RTX 3090 .......... 26699.66 |===================================
NVIDIA RTX 4070 TI SUPER . 31591.71 |==========================================
NVIDIA RTX 4070 TI SUPER . 31635.47 |==========================================
vkpeak 20230730
fp16-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 20080.47 |===================================
NVIDIA RTX 3090 .......... 20113.01 |===================================
NVIDIA RTX 3090 .......... 20134.06 |===================================
NVIDIA RTX 3090 .......... 20151.44 |===================================
NVIDIA RTX 4070 TI SUPER . 23825.05 |==========================================
NVIDIA RTX 4070 TI SUPER . 23894.70 |==========================================
vkpeak 20230730
fp16-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 39771.97 |===================================
NVIDIA RTX 3090 .......... 39835.21 |===================================
NVIDIA RTX 3090 .......... 39746.91 |===================================
NVIDIA RTX 3090 .......... 39860.80 |===================================
NVIDIA RTX 4070 TI SUPER . 47192.56 |==========================================
NVIDIA RTX 4070 TI SUPER . 47340.52 |==========================================
vkpeak 20230730
fp64-scalar
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 638.70 |=====================================
NVIDIA RTX 3090 .......... 638.75 |=====================================
NVIDIA RTX 3090 .......... 638.77 |=====================================
NVIDIA RTX 3090 .......... 638.84 |=====================================
NVIDIA RTX 4070 TI SUPER . 750.47 |============================================
NVIDIA RTX 4070 TI SUPER . 750.49 |============================================
vkpeak 20230730
fp64-vec4
GFLOPS > Higher Is Better
NVIDIA RTX 3090 .......... 638.72 |=====================================
NVIDIA RTX 3090 .......... 638.77 |=====================================
NVIDIA RTX 3090 .......... 639.52 |=====================================
NVIDIA RTX 3090 .......... 638.74 |=====================================
NVIDIA RTX 4070 TI SUPER . 749.76 |============================================
NVIDIA RTX 4070 TI SUPER . 750.68 |============================================
vkpeak 20230730
int32-scalar
GIOPS > Higher Is Better
NVIDIA RTX 3090 .......... 20280.33 |====================================
NVIDIA RTX 3090 .......... 20290.30 |====================================
NVIDIA RTX 3090 .......... 20315.10 |====================================
NVIDIA RTX 3090 .......... 20295.27 |====================================
NVIDIA RTX 4070 TI SUPER . 23874.85 |==========================================
NVIDIA RTX 4070 TI SUPER . 23888.02 |==========================================
vkpeak 20230730
int32-vec4
GIOPS > Higher Is Better
NVIDIA RTX 3090 .......... 19996.92 |===================================
NVIDIA RTX 3090 .......... 20017.06 |===================================
NVIDIA RTX 3090 .......... 20005.52 |===================================
NVIDIA RTX 3090 .......... 20009.73 |===================================
NVIDIA RTX 4070 TI SUPER . 23733.30 |==========================================
NVIDIA RTX 4070 TI SUPER . 23768.27 |==========================================
vkpeak 20230730
int16-scalar
GIOPS > Higher Is Better
NVIDIA RTX 3090 .......... 13259.97 |===================================
NVIDIA RTX 3090 .......... 13273.53 |===================================
NVIDIA RTX 3090 .......... 13225.17 |===================================
NVIDIA RTX 3090 .......... 13264.91 |===================================
NVIDIA RTX 4070 TI SUPER . 15859.37 |==========================================
NVIDIA RTX 4070 TI SUPER . 15901.32 |==========================================
vkpeak 20230730
int16-vec4
GIOPS > Higher Is Better
NVIDIA RTX 3090 .......... 16331.16 |================================
NVIDIA RTX 3090 .......... 16338.23 |================================
NVIDIA RTX 3090 .......... 16302.58 |================================
NVIDIA RTX 3090 .......... 16329.72 |================================
NVIDIA RTX 4070 TI SUPER . 21124.09 |==========================================
NVIDIA RTX 4070 TI SUPER . 21156.99 |==========================================
VkResample 1.0
Upscale: 2x - Precision: Double
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 339.59 |====================================
NVIDIA RTX 4070 .......... 415.16 |============================================
NVIDIA RTX 4070 TI ....... 322.06 |==================================
NVIDIA RTX 3090 .......... 333.64 |===================================
NVIDIA RTX 4070 TI SUPER . 285.99 |==============================
VkResample 1.0
Upscale: 2x - Precision: Single
ms < Lower Is Better
NVIDIA RTX 4070 SUPER .... 18.49 |=============================================
NVIDIA RTX 4070 .......... 18.02 |============================================
NVIDIA RTX 4070 TI ....... 18.46 |=============================================
NVIDIA RTX 3090 .......... 10.32 |=========================
NVIDIA RTX 4070 TI SUPER . 13.36 |=================================
Waifu2x-NCNN Vulkan 20200818
Scale: 2x - Denoise: 3 - TAA: No
Seconds < Lower Is Better
Waifu2x-NCNN Vulkan 20200818
Scale: 2x - Denoise: 3 - TAA: Yes
Seconds < Lower Is Better
NVIDIA RTX 4070 SUPER .... 2.855 |========================================
NVIDIA RTX 4070 .......... 3.168 |=============================================
NVIDIA RTX 4070 TI ....... 2.854 |========================================
NVIDIA RTX 3090 .......... 3.202 |=============================================
NVIDIA RTX 4070 TI SUPER . 2.660 |=====================================