5950X-RTX4080 AMD Ryzen 9 5950X 16-Core testing with a ASUS TUF GAMING X570-PLUS (WI-FI) (5003 BIOS) and MSI NVIDIA GeForce RTX 4080 16GB on Gentoo 2.14 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2312043-NE-5950XRTX448&grw .
5950X-RTX4080 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX MSI NVIDIA GeForce RTX 4080 AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (4403 BIOS) AMD Starship/Matisse 64GB 2000GB Samsung SSD 990 PRO with Heatsink 2TB + 1000GB Samsung SSD 990 PRO 1TB + 8002GB Elements 25A3 + 2000GB EZRZ-00Z5HB0 MSI NVIDIA GeForce RTX 4080 16GB Creative CA0110-IBG XG320U Realtek RTL8111/8168/8411 + Intel-AC 9260 Gentoo 2.14 6.5.0-gentoo-x86_64 (x86_64) KDE Plasma 5.27.7 X Server 1.21.1.8 NVIDIA 535.104.05 GCC 12.3.1 20230526 + Clang 16.0.6 + LLVM 16.0.6 xfs 1920x1080 AMD Ryzen 9 5950X 16-Core @ 4.00GHz (16 Cores / 32 Threads) ASUS TUF GAMING X570-PLUS (WI-FI) (5003 BIOS) 2000GB Samsung SSD 990 PRO with Heatsink 2TB + 1000GB Samsung SSD 990 PRO 1TB + 2000GB EZRZ-00Z5HB0 + 8002GB Elements 25A3 6.6.0-gentoo-x86_64 (x86_64) X Server 1.21.1.9 NVIDIA GCC 13.2.1 20230826 + Clang 16.0.6 + LLVM 16.0.6 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/12 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/12 --disable-cet --disable-fixed-point --disable-libada --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-libvtv --disable-systemtap --disable-valgrind-annotations --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/12/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/12/man --with-gcc-major-version-only --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/12/python --without-isl --without-zstd - MSI NVIDIA GeForce RTX 4080: --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/13 --build=x86_64-pc-linux-gnu --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/13 --disable-cet --disable-dependency-tracking --disable-fixed-point --disable-libada --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-libvtv --disable-silent-rules --disable-systemtap --disable-valgrind-annotations --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-languages=c,c++,fortran --enable-libgomp --enable-libstdcxx-time --enable-lto --enable-multilib --enable-nls --enable-obsolete --enable-secureplt --enable-shared --enable-targets=all --enable-threads=posix --host=x86_64-pc-linux-gnu --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/13/include --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/13/man --with-gcc-major-version-only --with-multilib-list=m32,m64 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/13/python --without-isl --without-zstd Processor Details - AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016 - MSI NVIDIA GeForce RTX 4080: Scaling Governor: acpi-cpufreq schedutil (Boost: Disabled) - CPU Microcode: 0xa20102b Security Details - AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - MSI NVIDIA GeForce RTX 4080: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected Graphics Details - MSI NVIDIA GeForce RTX 4080: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 95.03.2b.00.63 Python Details - MSI NVIDIA GeForce RTX 4080: Python 3.11.6
5950X-RTX4080 stress-ng: Hash stress-ng: MMAP stress-ng: NUMA stress-ng: Pipe stress-ng: Poll stress-ng: Zlib stress-ng: Futex stress-ng: MEMFD stress-ng: Mutex stress-ng: Atomic stress-ng: Crypto stress-ng: Malloc stress-ng: Cloning stress-ng: Forking stress-ng: Pthread stress-ng: AVL Tree stress-ng: IO_uring stress-ng: SENDFILE stress-ng: CPU Cache stress-ng: CPU Stress stress-ng: Semaphores stress-ng: Matrix Math stress-ng: Vector Math stress-ng: AVX-512 VNNI stress-ng: Function Call stress-ng: x86_64 RdRand stress-ng: Floating Point stress-ng: Matrix 3D Math stress-ng: Memory Copying stress-ng: Vector Shuffle stress-ng: Mixed Scheduler stress-ng: Socket Activity stress-ng: Wide Vector Math stress-ng: Context Switching stress-ng: Fused Multiply-Add stress-ng: Vector Floating Point stress-ng: Glibc C String Functions stress-ng: Glibc Qsort Data Sorting stress-ng: System V Message Passing shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet rodinia: OpenCL Particle Filter blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Barbershop - NVIDIA OptiX blender: Pabellon Barcelona - NVIDIA OptiX neatbench: GPU luxcorerender: DLSC - GPU luxcorerender: Danish Mood - GPU luxcorerender: Orange Juice - GPU luxcorerender: LuxCore Benchmark - GPU luxcorerender: Rainbow Colors and Prism - GPU fahbench: hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS financebench: Black-Scholes OpenCL cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX MSI NVIDIA GeForce RTX 4080 3885253.64 278.33 557.82 11206671.23 1797689.35 1627.76 3407094.81 437.58 5460828.05 199.10 45508.84 11369932.39 1473.33 32626.21 129241.53 144.92 10546.14 354422.22 2267861.96 44692.23 22330215.55 74055.33 102091.22 797839.84 13075.24 5964185.24 6009.83 1513.65 5974.68 11961.21 11422.92 4842.26 516608.09 4339638.47 41301304.64 39790.83 8933007.81 414.14 13352148.72 426.463 20.1727 1836.99 58.6058 608.240 16670.9 53352.7 26.2787 27.1157 2930.42 11.98 4.54 3.90 4.51 4.04 5.75 1.72 12.24 45.47 8.11 7.11 18.23 21.19 10.43 10.30 58.03 5.06 2.818 4.98 10.53 8.30 43.79 11.71 4080 15.4 12.34 12.30 13.77 32.80 394.1096 93006300000 29664566667 1667533 3786333333 1127767 4.490 377.2 626.5 592.8 24381.19 47636.58 836.12 613.69 80.7 121 122.3 29.8 44.6 49.5 58.2 65.3 83.6 82.7 90.1 87.7 378 482 405 536 607 588 219 425 743 765 799 817 OpenBenchmarking.org
Stress-NG Test: Hash OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Hash AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 800K 1600K 2400K 3200K 4000K SE +/- 17636.63, N = 3 3885253.64 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: MMAP OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 60 120 180 240 300 SE +/- 0.32, N = 3 278.33 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: NUMA OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: NUMA AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 120 240 360 480 600 SE +/- 2.25, N = 3 557.82 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Pipe OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 2M 4M 6M 8M 10M SE +/- 49512.35, N = 3 11206671.23 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Poll OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 400K 800K 1200K 1600K 2000K SE +/- 3642.08, N = 3 1797689.35 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Zlib OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 300 600 900 1200 1500 SE +/- 0.56, N = 3 1627.76 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Futex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Futex AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 700K 1400K 2100K 2800K 3500K SE +/- 36058.76, N = 3 3407094.81 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: MEMFD OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 90 180 270 360 450 SE +/- 6.90, N = 15 437.58 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Mutex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 1.2M 2.4M 3.6M 4.8M 6M SE +/- 7438.59, N = 3 5460828.05 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Atomic OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Atomic AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 40 80 120 160 200 SE +/- 0.15, N = 3 199.10 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Crypto OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 10K 20K 30K 40K 50K SE +/- 121.22, N = 3 45508.84 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Malloc OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 2M 4M 6M 8M 10M SE +/- 53298.36, N = 3 11369932.39 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Cloning OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Cloning AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 300 600 900 1200 1500 SE +/- 1.54, N = 3 1473.33 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Forking OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Forking AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 7K 14K 21K 28K 35K SE +/- 206.37, N = 3 32626.21 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Pthread OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pthread AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 30K 60K 90K 120K 150K SE +/- 481.27, N = 3 129241.53 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: AVL Tree OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 30 60 90 120 150 SE +/- 0.08, N = 3 144.92 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: IO_uring OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: IO_uring AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 2K 4K 6K 8K 10K SE +/- 152.02, N = 15 10546.14 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 80K 160K 240K 320K 400K SE +/- 1075.08, N = 3 354422.22 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: CPU Cache OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 500K 1000K 1500K 2000K 2500K SE +/- 22280.96, N = 3 2267861.96 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: CPU Stress OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 10K 20K 30K 40K 50K SE +/- 69.04, N = 3 44692.23 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 5M 10M 15M 20M 25M SE +/- 261796.09, N = 4 22330215.55 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Matrix Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 16K 32K 48K 64K 80K SE +/- 316.60, N = 3 74055.33 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 20K 40K 60K 80K 100K SE +/- 54.29, N = 3 102091.22 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: AVX-512 VNNI OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 200K 400K 600K 800K 1000K SE +/- 562.17, N = 3 797839.84 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Function Call OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Function Call AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 3K 6K 9K 12K 15K SE +/- 16.93, N = 3 13075.24 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: x86_64 RdRand OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 4699.59, N = 3 5964185.24 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 1300 2600 3900 5200 6500 SE +/- 6.53, N = 3 6009.83 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Matrix 3D Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 300 600 900 1200 1500 SE +/- 1.28, N = 3 1513.65 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 1300 2600 3900 5200 6500 SE +/- 6.38, N = 3 5974.68 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Vector Shuffle OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 3K 6K 9K 12K 15K SE +/- 17.43, N = 3 11961.21 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Mixed Scheduler OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 2K 4K 6K 8K 10K SE +/- 76.21, N = 3 11422.92 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 1000 2000 3000 4000 5000 SE +/- 25.15, N = 3 4842.26 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Wide Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 110K 220K 330K 440K 550K SE +/- 729.30, N = 3 516608.09 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 900K 1800K 2700K 3600K 4500K SE +/- 19596.85, N = 3 4339638.47 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Fused Multiply-Add OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 9M 18M 27M 36M 45M SE +/- 17071.04, N = 3 41301304.64 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Vector Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 9K 18K 27K 36K 45K SE +/- 40.92, N = 3 39790.83 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Glibc C String Functions OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 2M 4M 6M 8M 10M SE +/- 42515.03, N = 3 8933007.81 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: Glibc Qsort Data Sorting OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 90 180 270 360 450 SE +/- 0.44, N = 3 414.14 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing AMD Ryzen 9 5950X 16-Core - MSI NVIDIA GeForce RTX 3M 6M 9M 12M 15M SE +/- 15812.61, N = 3 13352148.72 1. (CC) gcc options: -O2 -std=gnu99 -lbsd -lc
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D MSI NVIDIA GeForce RTX 4080 90 180 270 360 450 SE +/- 0.21, N = 3 426.46 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad MSI NVIDIA GeForce RTX 4080 5 10 15 20 25 SE +/- 0.02, N = 3 20.17 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP MSI NVIDIA GeForce RTX 4080 400 800 1200 1600 2000 SE +/- 3.11, N = 3 1836.99 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash MSI NVIDIA GeForce RTX 4080 13 26 39 52 65 SE +/- 0.51, N = 3 58.61 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction MSI NVIDIA GeForce RTX 4080 130 260 390 520 650 SE +/- 0.05, N = 3 608.24 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N MSI NVIDIA GeForce RTX 4080 4K 8K 12K 16K 20K SE +/- 105.53, N = 15 16670.9 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops MSI NVIDIA GeForce RTX 4080 11K 22K 33K 44K 55K SE +/- 1.82, N = 3 53352.7 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download MSI NVIDIA GeForce RTX 4080 6 12 18 24 30 SE +/- 0.02, N = 3 26.28 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback MSI NVIDIA GeForce RTX 4080 6 12 18 24 30 SE +/- 0.01, N = 3 27.12 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth MSI NVIDIA GeForce RTX 4080 600 1200 1800 2400 3000 SE +/- 2.07, N = 3 2930.42 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.06, N = 3 11.98 MIN: 11.71 / MAX: 18.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 MSI NVIDIA GeForce RTX 4080 1.0215 2.043 3.0645 4.086 5.1075 SE +/- 0.00, N = 3 4.54 MIN: 4.41 / MAX: 10.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 MSI NVIDIA GeForce RTX 4080 0.8775 1.755 2.6325 3.51 4.3875 SE +/- 0.01, N = 3 3.90 MIN: 3.81 / MAX: 9.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 MSI NVIDIA GeForce RTX 4080 1.0148 2.0296 3.0444 4.0592 5.074 SE +/- 0.01, N = 3 4.51 MIN: 4.45 / MAX: 9.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet MSI NVIDIA GeForce RTX 4080 0.909 1.818 2.727 3.636 4.545 SE +/- 0.01, N = 3 4.04 MIN: 3.93 / MAX: 10.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 MSI NVIDIA GeForce RTX 4080 1.2938 2.5876 3.8814 5.1752 6.469 SE +/- 0.02, N = 3 5.75 MIN: 5.65 / MAX: 11.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface MSI NVIDIA GeForce RTX 4080 0.387 0.774 1.161 1.548 1.935 SE +/- 0.01, N = 3 1.72 MIN: 1.69 / MAX: 7.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.25, N = 3 12.24 MIN: 11.36 / MAX: 235.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 MSI NVIDIA GeForce RTX 4080 10 20 30 40 50 SE +/- 0.30, N = 3 45.47 MIN: 44.02 / MAX: 351.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 MSI NVIDIA GeForce RTX 4080 2 4 6 8 10 SE +/- 0.02, N = 3 8.11 MIN: 7.95 / MAX: 14.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet MSI NVIDIA GeForce RTX 4080 2 4 6 8 10 SE +/- 0.01, N = 3 7.11 MIN: 6.93 / MAX: 13.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 MSI NVIDIA GeForce RTX 4080 4 8 12 16 20 SE +/- 0.42, N = 3 18.23 MIN: 16.93 / MAX: 226.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny MSI NVIDIA GeForce RTX 4080 5 10 15 20 25 SE +/- 0.02, N = 3 21.19 MIN: 20.76 / MAX: 35.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.06, N = 3 10.43 MIN: 10.14 / MAX: 16.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.02, N = 3 10.30 MIN: 10.14 / MAX: 15.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer MSI NVIDIA GeForce RTX 4080 13 26 39 52 65 SE +/- 0.20, N = 3 58.03 MIN: 57.16 / MAX: 80.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet MSI NVIDIA GeForce RTX 4080 1.1385 2.277 3.4155 4.554 5.6925 SE +/- 0.24, N = 3 5.06 MIN: 4.43 / MAX: 10.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter MSI NVIDIA GeForce RTX 4080 0.6341 1.2682 1.9023 2.5364 3.1705 SE +/- 0.021, N = 13 2.818 1. (CXX) g++ options: -O2 -lOpenCL
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA OptiX MSI NVIDIA GeForce RTX 4080 1.1205 2.241 3.3615 4.482 5.6025 SE +/- 0.10, N = 14 4.98
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: NVIDIA OptiX MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.02, N = 3 10.53
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA OptiX MSI NVIDIA GeForce RTX 4080 2 4 6 8 10 SE +/- 0.10, N = 13 8.30
Blender Blend File: Barbershop - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Barbershop - Compute: NVIDIA OptiX MSI NVIDIA GeForce RTX 4080 10 20 30 40 50 SE +/- 0.05, N = 3 43.79
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.01, N = 3 11.71
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU MSI NVIDIA GeForce RTX 4080 900 1800 2700 3600 4500 SE +/- 0.00, N = 3 4080
LuxCoreRender Scene: DLSC - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU MSI NVIDIA GeForce RTX 4080 4 8 12 16 20 SE +/- 0.00, N = 3 15.4 MIN: 15 / MAX: 15.56
LuxCoreRender Scene: Danish Mood - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.07, N = 3 12.34 MIN: 4.48 / MAX: 14.75
LuxCoreRender Scene: Orange Juice - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU MSI NVIDIA GeForce RTX 4080 3 6 9 12 15 SE +/- 0.06, N = 3 12.30 MIN: 9 / MAX: 16.09
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU MSI NVIDIA GeForce RTX 4080 4 8 12 16 20 SE +/- 0.02, N = 3 13.77 MIN: 3.82 / MAX: 16.66
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: GPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU MSI NVIDIA GeForce RTX 4080 8 16 24 32 40 SE +/- 0.39, N = 3 32.80 MIN: 30.13 / MAX: 36.09
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 MSI NVIDIA GeForce RTX 4080 90 180 270 360 450 SE +/- 0.63, N = 3 394.11
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 MSI NVIDIA GeForce RTX 4080 20000M 40000M 60000M 80000M 100000M SE +/- 38403515.46, N = 3 93006300000
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 MSI NVIDIA GeForce RTX 4080 6000M 12000M 18000M 24000M 30000M SE +/- 352766.84, N = 3 29664566667
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip MSI NVIDIA GeForce RTX 4080 400K 800K 1200K 1600K 2000K SE +/- 417.67, N = 3 1667533
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 MSI NVIDIA GeForce RTX 4080 800M 1600M 2400M 3200M 4000M SE +/- 517472.49, N = 3 3786333333
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS MSI NVIDIA GeForce RTX 4080 200K 400K 600K 800K 1000K SE +/- 8358.69, N = 3 1127767
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL MSI NVIDIA GeForce RTX 4080 1.0103 2.0206 3.0309 4.0412 5.0515 SE +/- 0.003, N = 3 4.490 1. (CXX) g++ options: -O3 -march=native -fopenmp
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy MSI NVIDIA GeForce RTX 4080 80 160 240 320 400 SE +/- 0.03, N = 3 377.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read MSI NVIDIA GeForce RTX 4080 140 280 420 560 700 SE +/- 0.00, N = 3 626.5 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write MSI NVIDIA GeForce RTX 4080 130 260 390 520 650 SE +/- 1.25, N = 3 592.8 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute INT MSI NVIDIA GeForce RTX 4080 5K 10K 15K 20K 25K SE +/- 5.08, N = 3 24381.19 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Float MSI NVIDIA GeForce RTX 4080 10K 20K 30K 40K 50K SE +/- 0.15, N = 3 47636.58 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Double MSI NVIDIA GeForce RTX 4080 200 400 600 800 1000 SE +/- 2.36, N = 3 836.12 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth MSI NVIDIA GeForce RTX 4080 130 260 390 520 650 SE +/- 0.01, N = 3 613.69 1. (CXX) g++ options: -O3
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY MSI NVIDIA GeForce RTX 4080 20 40 60 80 100 SE +/- 0.19, N = 3 80.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY MSI NVIDIA GeForce RTX 4080 30 60 90 120 150 SE +/- 0.58, N = 3 121 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT MSI NVIDIA GeForce RTX 4080 30 60 90 120 150 SE +/- 11.20, N = 3 122.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY MSI NVIDIA GeForce RTX 4080 7 14 21 28 35 SE +/- 0.07, N = 3 29.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY MSI NVIDIA GeForce RTX 4080 10 20 30 40 50 SE +/- 0.15, N = 3 44.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT MSI NVIDIA GeForce RTX 4080 11 22 33 44 55 SE +/- 0.06, N = 3 49.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N MSI NVIDIA GeForce RTX 4080 13 26 39 52 65 SE +/- 0.31, N = 3 58.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T MSI NVIDIA GeForce RTX 4080 15 30 45 60 75 SE +/- 0.12, N = 3 65.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN MSI NVIDIA GeForce RTX 4080 20 40 60 80 100 SE +/- 0.60, N = 3 83.6 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT MSI NVIDIA GeForce RTX 4080 20 40 60 80 100 SE +/- 0.85, N = 3 82.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN MSI NVIDIA GeForce RTX 4080 20 40 60 80 100 SE +/- 0.21, N = 3 90.1 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT MSI NVIDIA GeForce RTX 4080 20 40 60 80 100 SE +/- 0.80, N = 3 87.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY MSI NVIDIA GeForce RTX 4080 80 160 240 320 400 SE +/- 0.33, N = 3 378 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY MSI NVIDIA GeForce RTX 4080 100 200 300 400 500 SE +/- 0.00, N = 3 482 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT MSI NVIDIA GeForce RTX 4080 90 180 270 360 450 SE +/- 0.33, N = 3 405 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY MSI NVIDIA GeForce RTX 4080 120 240 360 480 600 SE +/- 0.33, N = 3 536 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY MSI NVIDIA GeForce RTX 4080 130 260 390 520 650 SE +/- 0.00, N = 3 607 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT MSI NVIDIA GeForce RTX 4080 130 260 390 520 650 SE +/- 0.00, N = 3 588 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N MSI NVIDIA GeForce RTX 4080 50 100 150 200 250 SE +/- 0.00, N = 3 219 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T MSI NVIDIA GeForce RTX 4080 90 180 270 360 450 SE +/- 0.33, N = 3 425 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN MSI NVIDIA GeForce RTX 4080 160 320 480 640 800 SE +/- 0.00, N = 3 743 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT MSI NVIDIA GeForce RTX 4080 160 320 480 640 800 SE +/- 0.00, N = 3 765 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN MSI NVIDIA GeForce RTX 4080 200 400 600 800 1000 SE +/- 0.00, N = 3 799 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT MSI NVIDIA GeForce RTX 4080 200 400 600 800 1000 SE +/- 0.00, N = 3 817 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.5