bbbbbb AMD Ryzen 7 3700X 8-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (5003 BIOS) and Gigabyte NVIDIA GeForce GTX 1060 6GB on RockyLinux 9.4 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408054-NE-BBBBBB06306&grt .
bbbbbb Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Compiler File-System Screen Resolution Gigabyte NVIDIA GeForce GTX 1060 AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (5003 BIOS) AMD Starship/Matisse 32GB 1000GB Samsung SSD 970 EVO 1TB + 960GB ADATA SU650 + 4001GB Seagate ST4000DM000-1F21 + 240GB INTEL SSDSC2BW24 + 16GB USB DISK 3.0 Gigabyte NVIDIA GeForce GTX 1060 6GB NVIDIA GP106 HD Audio Realtek RTL8111/8168/8211/8411 + Intel Wi-Fi 5 RockyLinux 9.4 6.1.102-1.el9.elrepo.x86_64 (x86_64) KDE Plasma 5.27.11 X Server 1.20.11 NVIDIA 560.28.03 4.6.0 OpenCL 3.0 CUDA 12.6.32 GCC 11.4.1 20231218 xfs 1440x880 OpenBenchmarking.org - Transparent Huge Pages: always - --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8701030 - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 86.06.45.00.9e - GPU Compute Cores: 1280 - Python 3.9.18 - SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
bbbbbb blender: BMW27 - NVIDIA OptiX blender: Junkshop - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth fahbench: hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS indigobench: OpenCL GPU - Bedroom indigobench: OpenCL GPU - Supercar luxcorerender: DLSC - GPU luxcorerender: Danish Mood - GPU luxcorerender: Orange Juice - GPU luxcorerender: LuxCore Benchmark - GPU luxcorerender: Rainbow Colors and Prism - GPU namd-cuda: ATPase Simulation - 327,506 Atoms ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet neatbench: GPU octanebench: Total Score realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT vkfft: FFT + iFFT R2C / C2R vkfft: FFT + iFFT C2C 1D batched in half precision vkfft: FFT + iFFT C2C Bluestein in single precision vkfft: FFT + iFFT C2C 1D batched in double precision vkfft: FFT + iFFT C2C 1D batched in single precision vkfft: FFT + iFFT C2C multidimensional in single precision vkfft: FFT + iFFT C2C Bluestein benchmark in double precision vkfft: FFT + iFFT C2C 1D batched in single precision, no reshuffling vkpeak: fp32-scalar vkpeak: fp32-vec4 vkpeak: fp64-scalar vkpeak: fp64-vec4 vkpeak: int32-scalar vkpeak: int32-vec4 vkresample: 2x - Double vkresample: 2x - Single waifu2x-ncnn: 2x - 3 - No waifu2x-ncnn: 2x - 3 - Yes Gigabyte NVIDIA GeForce GTX 1060 89.63 128.87 184.75 176.99 745.28 450.97 1569.94 4561.78 152.87 145.38 108.4919 12914566667 4539333333 188967 480733333 135633 3.445 9.827 1.14 0.73 1.41 0.89 5.96 0.39580 23.66 7.36 6.10 6.67 6.12 11.64 1.99 19.84 67.71 13.30 12.53 30.26 35.24 17.00 16.14 145.30 6.97 1060 82.389071 21.282 142.854 18.2 27.0 25.5 18.9 29.0 26.9 28.8 28.6 38.8 37.0 42.0 39.7 14157 40331 4430 7166 24645 14111 1469 25624 4810.60 4640.91 154.49 154.51 1620.74 1496.67 500.001 60.612 2.390 9.477 OpenBenchmarking.org
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 20 40 60 80 100 SE +/- 0.48, N = 3 89.63
Blender Blend File: Junkshop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.60, N = 3 128.87
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Classroom - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 40 80 120 160 200 SE +/- 0.05, N = 3 184.75
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Fishy Cat - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 40 80 120 160 200 SE +/- 0.02, N = 3 176.99
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 160 320 480 640 800 SE +/- 0.23, N = 3 745.28
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Gigabyte NVIDIA GeForce GTX 1060 100 200 300 400 500 SE +/- 0.05, N = 3 450.97
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute INT Gigabyte NVIDIA GeForce GTX 1060 300 600 900 1200 1500 SE +/- 22.93, N = 15 1569.94 1. (CXX) g++ options: -O3 -lpthread -ldl
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float Gigabyte NVIDIA GeForce GTX 1060 1000 2000 3000 4000 5000 SE +/- 71.27, N = 12 4561.78 1. (CXX) g++ options: -O3 -lpthread -ldl
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 1.68, N = 3 152.87 1. (CXX) g++ options: -O3 -lpthread -ldl
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.53, N = 3 145.38 1. (CXX) g++ options: -O3 -lpthread -ldl
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 Gigabyte NVIDIA GeForce GTX 1060 20 40 60 80 100 SE +/- 0.05, N = 3 108.49
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 Gigabyte NVIDIA GeForce GTX 1060 3000M 6000M 9000M 12000M 15000M SE +/- 491030.66, N = 3 12914566667
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 Gigabyte NVIDIA GeForce GTX 1060 1000M 2000M 3000M 4000M 5000M SE +/- 2188099.12, N = 3 4539333333
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip Gigabyte NVIDIA GeForce GTX 1060 40K 80K 120K 160K 200K SE +/- 233.33, N = 3 188967
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 Gigabyte NVIDIA GeForce GTX 1060 100M 200M 300M 400M 500M SE +/- 1683580.84, N = 3 480733333
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS Gigabyte NVIDIA GeForce GTX 1060 30K 60K 90K 120K 150K SE +/- 1685.56, N = 3 135633
IndigoBench Acceleration: OpenCL GPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom Gigabyte NVIDIA GeForce GTX 1060 0.7751 1.5502 2.3253 3.1004 3.8755 SE +/- 0.000, N = 3 3.445
IndigoBench Acceleration: OpenCL GPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar Gigabyte NVIDIA GeForce GTX 1060 3 6 9 12 15 SE +/- 0.002, N = 3 9.827
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 0.2565 0.513 0.7695 1.026 1.2825 SE +/- 0.00, N = 3 1.14 MIN: 0.92 / MAX: 1.18
LuxCoreRender Scene: Danish Mood - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 0.1643 0.3286 0.4929 0.6572 0.8215 SE +/- 0.01, N = 5 0.73 MIN: 0.16 / MAX: 0.97
LuxCoreRender Scene: Orange Juice - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 0.3173 0.6346 0.9519 1.2692 1.5865 SE +/- 0.00, N = 3 1.41 MIN: 0.17 / MAX: 1.63
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 0.2003 0.4006 0.6009 0.8012 1.0015 SE +/- 0.00, N = 3 0.89 MIN: 0.18 / MAX: 1.15
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 1.341 2.682 4.023 5.364 6.705 SE +/- 0.02, N = 3 5.96 MIN: 4.91 / MAX: 6.38
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms Gigabyte NVIDIA GeForce GTX 1060 0.0891 0.1782 0.2673 0.3564 0.4455 SE +/- 0.00141, N = 3 0.39580
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet Gigabyte NVIDIA GeForce GTX 1060 6 12 18 24 30 SE +/- 0.30, N = 3 23.66 MIN: 13.78 / MAX: 209.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Gigabyte NVIDIA GeForce GTX 1060 2 4 6 8 10 SE +/- 0.10, N = 3 7.36 MIN: 4.01 / MAX: 194.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Gigabyte NVIDIA GeForce GTX 1060 2 4 6 8 10 SE +/- 0.24, N = 3 6.10 MIN: 3.12 / MAX: 185.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 Gigabyte NVIDIA GeForce GTX 1060 2 4 6 8 10 SE +/- 0.27, N = 3 6.67 MIN: 3.67 / MAX: 174.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet Gigabyte NVIDIA GeForce GTX 1060 2 4 6 8 10 SE +/- 0.13, N = 3 6.12 MIN: 3.37 / MAX: 174.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 Gigabyte NVIDIA GeForce GTX 1060 3 6 9 12 15 SE +/- 0.36, N = 3 11.64 MIN: 6.34 / MAX: 212.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface Gigabyte NVIDIA GeForce GTX 1060 0.4478 0.8956 1.3434 1.7912 2.239 SE +/- 0.01, N = 3 1.99 MIN: 1.2 / MAX: 164 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet Gigabyte NVIDIA GeForce GTX 1060 5 10 15 20 25 SE +/- 0.26, N = 3 19.84 MIN: 11.33 / MAX: 206.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 Gigabyte NVIDIA GeForce GTX 1060 15 30 45 60 75 SE +/- 0.33, N = 3 67.71 MIN: 43.02 / MAX: 321.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 Gigabyte NVIDIA GeForce GTX 1060 3 6 9 12 15 SE +/- 0.18, N = 3 13.30 MIN: 7.78 / MAX: 179.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet Gigabyte NVIDIA GeForce GTX 1060 3 6 9 12 15 SE +/- 0.24, N = 3 12.53 MIN: 7.3 / MAX: 212.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 Gigabyte NVIDIA GeForce GTX 1060 7 14 21 28 35 SE +/- 0.16, N = 3 30.26 MIN: 18.08 / MAX: 201.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny Gigabyte NVIDIA GeForce GTX 1060 8 16 24 32 40 SE +/- 0.23, N = 3 35.24 MIN: 21.65 / MAX: 217.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd Gigabyte NVIDIA GeForce GTX 1060 4 8 12 16 20 SE +/- 0.18, N = 3 17.00 MIN: 9.81 / MAX: 227.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m Gigabyte NVIDIA GeForce GTX 1060 4 8 12 16 20 SE +/- 0.29, N = 3 16.14 MIN: 8.93 / MAX: 203.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.29, N = 3 145.30 MIN: 92.75 / MAX: 356.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet Gigabyte NVIDIA GeForce GTX 1060 2 4 6 8 10 SE +/- 0.13, N = 3 6.97 MIN: 3.75 / MAX: 176.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU Gigabyte NVIDIA GeForce GTX 1060 200 400 600 800 1000 SE +/- 0.00, N = 3 1060
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score Gigabyte NVIDIA GeForce GTX 1060 20 40 60 80 100 82.39
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Gigabyte NVIDIA GeForce GTX 1060 5 10 15 20 25 SE +/- 0.02, N = 3 21.28
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.11, N = 3 142.85
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY Gigabyte NVIDIA GeForce GTX 1060 4 8 12 16 20 SE +/- 0.23, N = 15 18.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY Gigabyte NVIDIA GeForce GTX 1060 6 12 18 24 30 SE +/- 0.28, N = 15 27.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT Gigabyte NVIDIA GeForce GTX 1060 6 12 18 24 30 SE +/- 0.29, N = 15 25.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY Gigabyte NVIDIA GeForce GTX 1060 5 10 15 20 25 SE +/- 0.24, N = 15 18.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY Gigabyte NVIDIA GeForce GTX 1060 7 14 21 28 35 SE +/- 0.33, N = 15 29.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT Gigabyte NVIDIA GeForce GTX 1060 6 12 18 24 30 SE +/- 0.43, N = 15 26.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N Gigabyte NVIDIA GeForce GTX 1060 7 14 21 28 35 SE +/- 0.24, N = 15 28.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T Gigabyte NVIDIA GeForce GTX 1060 7 14 21 28 35 SE +/- 0.39, N = 15 28.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN Gigabyte NVIDIA GeForce GTX 1060 9 18 27 36 45 SE +/- 0.18, N = 15 38.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT Gigabyte NVIDIA GeForce GTX 1060 9 18 27 36 45 SE +/- 0.28, N = 15 37.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN Gigabyte NVIDIA GeForce GTX 1060 10 20 30 40 50 SE +/- 0.32, N = 15 42.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT Gigabyte NVIDIA GeForce GTX 1060 9 18 27 36 45 SE +/- 0.22, N = 15 39.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic
VkFFT Test: FFT + iFFT R2C / C2R OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT R2C / C2R Gigabyte NVIDIA GeForce GTX 1060 3K 6K 9K 12K 15K SE +/- 9.54, N = 3 14157 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in half precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in half precision Gigabyte NVIDIA GeForce GTX 1060 9K 18K 27K 36K 45K SE +/- 12.42, N = 3 40331 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C Bluestein in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein in single precision Gigabyte NVIDIA GeForce GTX 1060 900 1800 2700 3600 4500 SE +/- 5.84, N = 3 4430 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in double precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in double precision Gigabyte NVIDIA GeForce GTX 1060 1500 3000 4500 6000 7500 SE +/- 15.60, N = 3 7166 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision Gigabyte NVIDIA GeForce GTX 1060 5K 10K 15K 20K 25K SE +/- 2.85, N = 3 24645 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C multidimensional in single precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C multidimensional in single precision Gigabyte NVIDIA GeForce GTX 1060 3K 6K 9K 12K 15K SE +/- 22.45, N = 3 14111 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C Bluestein benchmark in double precision OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein benchmark in double precision Gigabyte NVIDIA GeForce GTX 1060 300 600 900 1200 1500 SE +/- 1.86, N = 3 1469 1. (CXX) g++ options: -O3
VkFFT Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Gigabyte NVIDIA GeForce GTX 1060 5K 10K 15K 20K 25K SE +/- 5.57, N = 3 25624 1. (CXX) g++ options: -O3
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-scalar Gigabyte NVIDIA GeForce GTX 1060 1000 2000 3000 4000 5000 SE +/- 20.31, N = 3 4810.60
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp32-vec4 Gigabyte NVIDIA GeForce GTX 1060 1000 2000 3000 4000 5000 SE +/- 13.20, N = 3 4640.91
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp64-scalar Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.48, N = 3 154.49
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20230730 fp64-vec4 Gigabyte NVIDIA GeForce GTX 1060 30 60 90 120 150 SE +/- 0.46, N = 3 154.51
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-scalar Gigabyte NVIDIA GeForce GTX 1060 300 600 900 1200 1500 SE +/- 4.21, N = 3 1620.74
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20230730 int32-vec4 Gigabyte NVIDIA GeForce GTX 1060 300 600 900 1200 1500 SE +/- 1.46, N = 3 1496.67
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double Gigabyte NVIDIA GeForce GTX 1060 110 220 330 440 550 SE +/- 0.00, N = 3 500.00 1. (CXX) g++ options: -O3
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single Gigabyte NVIDIA GeForce GTX 1060 14 28 42 56 70 SE +/- 0.02, N = 3 60.61 1. (CXX) g++ options: -O3
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Gigabyte NVIDIA GeForce GTX 1060 0.5378 1.0756 1.6134 2.1512 2.689 SE +/- 0.019, N = 15 2.390
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Gigabyte NVIDIA GeForce GTX 1060 3 6 9 12 15 SE +/- 0.003, N = 3 9.477
Phoronix Test Suite v10.8.5