brw-7000-nv3090-tun-2

AMD Ryzen 7 7800X3D 8-Core testing with a ASUS ROG STRIX X670E-I GAMING WIFI (1616 BIOS) and NVIDIA GeForce RTX 3090 24GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2307035-NE-BRW7000NV23&grs.

brw-7000-nv3090-tun-2ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGPU tests on 7800X3D with 3090AMD Ryzen 7 7800X3D 8-Core @ 4.20GHz (8 Cores / 16 Threads)ASUS ROG STRIX X670E-I GAMING WIFI (1616 BIOS)AMD Device 14d862GB2 x 4001GB Western Digital WD_BLACK SN850X 4000GB + 8002GB Samsung SSD 870 + 128GB Flash Drive FITNVIDIA GeForce RTX 3090 24GBNVIDIA GA102 HD AudioHP Z27Intel I225-V + MEDIATEK Device 0616Ubuntu 22.045.19.0-46-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.4NVIDIA 530.30.024.6.0OpenCL 3.0 CUDA 12.1.681.3.236GCC 11.3.0 + CUDA 12.1ext43840x2160OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601203- GLAMOR - BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 94.02.32.00.02- GPU Compute Cores: 10496- Python 3.10.6- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

brw-7000-nv3090-tun-2v-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUneatbench: GPUmandelgpu: GPUindigobench: OpenCL GPU - Supercarindigobench: OpenCL GPU - Bedroomblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacecaffe: GoogleNet - NVIDIA CUDA - 1000caffe: GoogleNet - NVIDIA CUDA - 200caffe: GoogleNet - NVIDIA CUDA - 100caffe: AlexNet - NVIDIA CUDA - 1000caffe: AlexNet - NVIDIA CUDA - 200caffe: AlexNet - NVIDIA CUDA - 100gromacs: NVIDIA CUDA GPU - water_GMX50_bareviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYfinancebench: Black-Scholes OpenCLluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUluxcorerender: DLSC - GPUarrayfire: Conjugate Gradient OpenCLrodinia: OpenCL Particle Filterclpeak: Global Memory Bandwidthclpeak: Double-Precision Doubleclpeak: Single-Precision Floatclpeak: Integer Compute INTfahbench: octanebench: Total Scorevkresample: 2x - Singlevkresample: 2x - Doublenamd-cuda: ATPase Simulation - 327,506 Atomscl-mem: Writecl-mem: Readcl-mem: Copyshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Max SP Flopsshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Reductionshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPshoc: OpenCL - Triadshoc: OpenCL - S3Dmixbench: NVIDIA CUDA - Single Precisionmixbench: NVIDIA CUDA - Half Precisionmixbench: OpenCL - Single Precisionmixbench: NVIDIA CUDA - Integermixbench: OpenCL - Integerhashcat: TrueCrypt RIPEMD160 + XTShashcat: SHA-512hashcat: 7-Ziphashcat: SHA1hashcat: MD5vkfft: waifu2x-ncnn: 2x - 3 - Yesrealsr-ncnn: 4x - Yesrealsr-ncnn: 4x - Novkpeak: int16-vec4vkpeak: int16-scalarvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp64-vec4vkpeak: fp64-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkpeak: fp32-scalarncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetmixbench: NVIDIA CUDA - Double Precisionmixbench: OpenCL - Double Precisionwaifu2x-ncnn: 2x - 3 - NoGPU tests on 7800X3D with 3090202427413090558514981.551.05420.54816.1953.1110.6714.176.06167.451.866.494.140.8428117.45577.042850.497517.581454.17686.33622.83257557557536418363470558836348635451.053.648.350.314017195.712277.81717665165.93834.3811.9312.299.3814.051.6254.025801.24626.1734054.4017355.85318.4467664.3939439.499123.4250.07795726.2810.2352.52172.6326.402726.894739191.87874.68391.75243.30522353.7325.8026426.88834024.3934767.2935904.6815272.5519219.36807433313350000010795672167260000068165866667495133.25130.0085.73816746.8413322.9120099.1420193.59635.25635.2840025.3620263.8026887.0420333.612.1015.372.561.152.313.454.041.191.452.131.423.19476.86496.08OpenBenchmarking.org

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA CUDA GPUGPU tests on 7800X3D with 3090400800120016002000SE +/- 0.88, N = 32024

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA RTX GPUGPU tests on 7800X3D with 30906001200180024003000SE +/- 3.38, N = 32741

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUGPU tests on 7800X3D with 30907001400210028003500SE +/- 0.00, N = 33090

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGPU tests on 7800X3D with 3090120M240M360M480M600MSE +/- 1684953.85, N = 3558514981.51. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarGPU tests on 7800X3D with 30901224364860SE +/- 0.03, N = 351.05

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomGPU tests on 7800X3D with 3090510152025SE +/- 0.01, N = 320.55

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXGPU tests on 7800X3D with 309048121620SE +/- 0.01, N = 316.19

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: NVIDIA OptiXGPU tests on 7800X3D with 30901224364860SE +/- 0.08, N = 353.11

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: NVIDIA OptiXGPU tests on 7800X3D with 30903691215SE +/- 0.08, N = 1010.67

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: NVIDIA OptiXGPU tests on 7800X3D with 309048121620SE +/- 0.03, N = 314.17

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: NVIDIA OptiXGPU tests on 7800X3D with 3090246810SE +/- 0.06, N = 146.06

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vision_transformerGPU tests on 7800X3D with 30904080120160200SE +/- 0.71, N = 15167.45MIN: 134.45 / MAX: 824.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: regnety_400mGPU tests on 7800X3D with 30900.41850.8371.25551.6742.0925SE +/- 0.02, N = 151.86MIN: 1.53 / MAX: 13.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: yolov4-tinyGPU tests on 7800X3D with 3090246810SE +/- 0.07, N = 156.49MIN: 4.49 / MAX: 48.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: googlenetGPU tests on 7800X3D with 30900.93151.8632.79453.7264.6575SE +/- 0.04, N = 154.14MIN: 1.78 / MAX: 35.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: blazefaceGPU tests on 7800X3D with 30900.1890.3780.5670.7560.945SE +/- 0.01, N = 150.84MIN: 0.67 / MAX: 8.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000GPU tests on 7800X3D with 30906K12K18K24K30KSE +/- 35.02, N = 328117.41. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200GPU tests on 7800X3D with 309012002400360048006000SE +/- 32.14, N = 35577.041. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100GPU tests on 7800X3D with 30906001200180024003000SE +/- 8.96, N = 32850.491. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000GPU tests on 7800X3D with 309016003200480064008000SE +/- 94.97, N = 37517.581. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200GPU tests on 7800X3D with 309030060090012001500SE +/- 18.33, N = 31454.171. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100GPU tests on 7800X3D with 3090150300450600750SE +/- 1.97, N = 3686.341. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

GROMACS

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bareGPU tests on 7800X3D with 3090510152025SE +/- 0.05, N = 322.831. (CXX) g++ options: -O3

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTGPU tests on 7800X3D with 3090120240360480600SE +/- 0.00, N = 25751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTGPU tests on 7800X3D with 3090120240360480600SE +/- 1.00, N = 25751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNGPU tests on 7800X3D with 3090120240360480600SE +/- 1.45, N = 35751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TGPU tests on 7800X3D with 309080160240320400SE +/- 0.00, N = 33641. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NGPU tests on 7800X3D with 30904080120160200SE +/- 0.00, N = 31831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTGPU tests on 7800X3D with 3090140280420560700SE +/- 1.20, N = 36341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYGPU tests on 7800X3D with 3090150300450600750SE +/- 1.76, N = 37051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYGPU tests on 7800X3D with 3090130260390520650SE +/- 0.88, N = 35881. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTGPU tests on 7800X3D with 309080160240320400SE +/- 0.33, N = 33631. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYGPU tests on 7800X3D with 3090110220330440550SE +/- 0.58, N = 34861. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYGPU tests on 7800X3D with 309080160240320400SE +/- 1.00, N = 33541. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTGPU tests on 7800X3D with 30901224364860SE +/- 0.21, N = 1551.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNGPU tests on 7800X3D with 30901224364860SE +/- 0.10, N = 1553.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTGPU tests on 7800X3D with 30901122334455SE +/- 0.15, N = 1548.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNGPU tests on 7800X3D with 30901122334455SE +/- 0.19, N = 1550.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TGPU tests on 7800X3D with 3090306090120150SE +/- 1.28, N = 151401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NGPU tests on 7800X3D with 30904080120160200SE +/- 1.42, N = 151711. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTGPU tests on 7800X3D with 309020406080100SE +/- 0.63, N = 1595.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYGPU tests on 7800X3D with 3090306090120150SE +/- 0.56, N = 151221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYGPU tests on 7800X3D with 309020406080100SE +/- 1.02, N = 1577.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTGPU tests on 7800X3D with 30904080120160200SE +/- 1.76, N = 151711. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYGPU tests on 7800X3D with 3090170340510680850SE +/- 6.08, N = 157661. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYGPU tests on 7800X3D with 3090110220330440550SE +/- 5.79, N = 155161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLGPU tests on 7800X3D with 30901.33612.67224.00835.34446.6805SE +/- 0.018, N = 35.9381. (CXX) g++ options: -O3 -march=native -fopenmp

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPUGPU tests on 7800X3D with 3090816243240SE +/- 0.05, N = 334.38MIN: 30.56 / MAX: 37.57

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPUGPU tests on 7800X3D with 30903691215SE +/- 0.02, N = 311.93MIN: 3.32 / MAX: 14.39

LuxCoreRender

Scene: Orange Juice - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPUGPU tests on 7800X3D with 30903691215SE +/- 0.03, N = 312.29MIN: 10.06 / MAX: 16.69

LuxCoreRender

Scene: Danish Mood - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPUGPU tests on 7800X3D with 30903691215SE +/- 0.10, N = 49.38MIN: 2.93 / MAX: 11.45

LuxCoreRender

Scene: DLSC - Acceleration: GPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPUGPU tests on 7800X3D with 309048121620SE +/- 0.02, N = 314.05MIN: 13.48 / MAX: 14.61

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLGPU tests on 7800X3D with 30900.36560.73121.09681.46241.828SE +/- 0.004, N = 31.6251. (CXX) g++ options: -rdynamic

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterGPU tests on 7800X3D with 30900.90561.81122.71683.62244.528SE +/- 0.029, N = 124.0251. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthGPU tests on 7800X3D with 30902004006008001000SE +/- 2.05, N = 3801.241. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision DoubleGPU tests on 7800X3D with 3090140280420560700SE +/- 0.43, N = 3626.171. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision FloatGPU tests on 7800X3D with 30907K14K21K28K35KSE +/- 181.26, N = 334054.401. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Compute INTGPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 164.14, N = 317355.851. (CXX) g++ options: -O3

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GPU tests on 7800X3D with 309070140210280350SE +/- 0.35, N = 3318.45

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreGPU tests on 7800X3D with 3090140280420560700664.39

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleGPU tests on 7800X3D with 30903691215SE +/- 0.012, N = 39.4991. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleGPU tests on 7800X3D with 3090306090120150SE +/- 0.13, N = 3123.431. (CXX) g++ options: -O3

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsGPU tests on 7800X3D with 30900.01750.0350.05250.070.0875SE +/- 0.00034, N = 30.07795

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGPU tests on 7800X3D with 3090160320480640800SE +/- 0.80, N = 3726.21. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGPU tests on 7800X3D with 30902004006008001000SE +/- 0.82, N = 3810.21. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGPU tests on 7800X3D with 309080160240320400SE +/- 0.26, N = 3352.51. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthGPU tests on 7800X3D with 30905001000150020002500SE +/- 4.85, N = 32172.631. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackGPU tests on 7800X3D with 3090612182430SE +/- 0.00, N = 326.401. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadGPU tests on 7800X3D with 3090612182430SE +/- 0.00, N = 326.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsGPU tests on 7800X3D with 30908K16K24K32K40KSE +/- 355.87, N = 339191.81. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NGPU tests on 7800X3D with 30902K4K6K8K10KSE +/- 17.44, N = 37874.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionGPU tests on 7800X3D with 309080160240320400SE +/- 0.07, N = 3391.751. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashGPU tests on 7800X3D with 30901020304050SE +/- 0.12, N = 343.311. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPGPU tests on 7800X3D with 30905001000150020002500SE +/- 2.45, N = 32353.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadGPU tests on 7800X3D with 3090612182430SE +/- 0.04, N = 325.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DGPU tests on 7800X3D with 309090180270360450SE +/- 0.68, N = 3426.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Mixbench

Backend: NVIDIA CUDA - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionGPU tests on 7800X3D with 30907K14K21K28K35KSE +/- 501.94, N = 1534024.391. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Half Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionGPU tests on 7800X3D with 30907K14K21K28K35KSE +/- 0.00, N = 334767.291. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Single PrecisionGPU tests on 7800X3D with 30908K16K24K32K40KSE +/- 551.07, N = 1535904.681. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerGPU tests on 7800X3D with 30903K6K9K12K15KSE +/- 4.63, N = 315272.551. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: IntegerGPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 290.36, N = 1519219.361. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSGPU tests on 7800X3D with 3090200K400K600K800K1000KSE +/- 617.34, N = 3807433

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512GPU tests on 7800X3D with 3090700M1400M2100M2800M3500MSE +/- 2542308.66, N = 33133500000

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipGPU tests on 7800X3D with 3090200K400K600K800K1000KSE +/- 260.34, N = 31079567

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1GPU tests on 7800X3D with 30905000M10000M15000M20000M25000MSE +/- 19890282.38, N = 321672600000

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5GPU tests on 7800X3D with 309015000M30000M45000M60000M75000MSE +/- 32963734.28, N = 368165866667

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1GPU tests on 7800X3D with 309011K22K33K44K55KSE +/- 651.37, N = 9495131. (CXX) g++ options: -O3

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesGPU tests on 7800X3D with 30900.73151.4632.19452.9263.6575SE +/- 0.004, N = 33.251

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesGPU tests on 7800X3D with 3090714212835SE +/- 0.05, N = 330.01

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoGPU tests on 7800X3D with 30901.29112.58223.87335.16446.4555SE +/- 0.072, N = 35.738

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4GPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 8.51, N = 316746.84

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarGPU tests on 7800X3D with 30903K6K9K12K15KSE +/- 34.38, N = 313322.91

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4GPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 52.35, N = 320099.14

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarGPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 52.00, N = 320193.59

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4GPU tests on 7800X3D with 3090140280420560700SE +/- 1.66, N = 3635.25

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarGPU tests on 7800X3D with 3090140280420560700SE +/- 1.51, N = 3635.28

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4GPU tests on 7800X3D with 30909K18K27K36K45KSE +/- 102.99, N = 340025.36

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarGPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 96.24, N = 320263.80

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4GPU tests on 7800X3D with 30906K12K18K24K30KSE +/- 123.22, N = 326887.04

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarGPU tests on 7800X3D with 30904K8K12K16K20KSE +/- 89.35, N = 320333.61

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: FastestDetGPU tests on 7800X3D with 30900.47250.9451.41751.892.3625SE +/- 0.05, N = 142.10MIN: 1.32 / MAX: 20.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: squeezenet_ssdGPU tests on 7800X3D with 309048121620SE +/- 0.87, N = 1515.37MIN: 2.81 / MAX: 69.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet50GPU tests on 7800X3D with 30900.5761.1521.7282.3042.88SE +/- 0.06, N = 142.56MIN: 1.57 / MAX: 21.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: alexnetGPU tests on 7800X3D with 30900.25880.51760.77641.03521.294SE +/- 0.03, N = 151.15MIN: 0.91 / MAX: 16.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet18GPU tests on 7800X3D with 30900.51981.03961.55942.07922.599SE +/- 0.13, N = 152.31MIN: 0.96 / MAX: 24.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vgg16GPU tests on 7800X3D with 30900.77631.55262.32893.10523.8815SE +/- 0.27, N = 143.45MIN: 1.41 / MAX: 33.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: efficientnet-b0GPU tests on 7800X3D with 30900.9091.8182.7273.6364.545SE +/- 0.09, N = 154.04MIN: 2.09 / MAX: 27.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mnasnetGPU tests on 7800X3D with 30900.26780.53560.80341.07121.339SE +/- 0.02, N = 151.19MIN: 0.96 / MAX: 16.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: shufflenet-v2GPU tests on 7800X3D with 30900.32630.65260.97891.30521.6315SE +/- 0.03, N = 151.45MIN: 1.17 / MAX: 16.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3GPU tests on 7800X3D with 30900.47930.95861.43791.91722.3965SE +/- 0.04, N = 152.13MIN: 1.23 / MAX: 16.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2GPU tests on 7800X3D with 30900.31950.6390.95851.2781.5975SE +/- 0.08, N = 151.42MIN: 0.92 / MAX: 14.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mobilenetGPU tests on 7800X3D with 30900.71781.43562.15342.87123.589SE +/- 0.06, N = 153.19MIN: 2.52 / MAX: 36.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Mixbench

Backend: NVIDIA CUDA - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionGPU tests on 7800X3D with 3090100200300400500SE +/- 9.22, N = 15476.861. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: OpenCL - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Double PrecisionGPU tests on 7800X3D with 3090110220330440550SE +/- 8.82, N = 15496.081. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2


Phoronix Test Suite v10.8.5