NGC

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and Zotac NVIDIA GeForce GTX 1070 Ti 8GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2009179-FI-NGC40795300&sor&grw.

NGCProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGTX 980GTX 970GTX 980 TiGTX 1070 TiAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce GTX 980 4GB (1126/3505MHz)NVIDIA GM204 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-47-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.228 + OpenCL 2.0 AMD-APP (3182.0)1.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160eVGA NVIDIA GeForce GTX 970 4GB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz)NVIDIA GM200 HD AudioZotac NVIDIA GeForce GTX 1070 Ti 8GB (139/4006MHz)NVIDIA GP104 HD AudioOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- GTX 980: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013- GTX 970: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013- GTX 980 Ti: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013- GTX 1070 Ti: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013OpenCL Details- GTX 980: GPU Compute Cores: 2048- GTX 970: GPU Compute Cores: 1664- GTX 980 Ti: GPU Compute Cores: 2816- GTX 1070 Ti: GPU Compute Cores: 2432Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NGCglmark2: 800 x 600glmark2: 1024 x 768glmark2: 1280 x 1024glmark2: 1600 x 1200glmark2: 1920 x 1080glmark2: 1920 x 1200glmark2: 2560 x 1440glmark2: 3840 x 2160plaidml: No - Training - Mobilenet - OpenCLplaidml: No - Inference - IMDB LSTM - OpenCLplaidml: No - Inference - Mobilenet - OpenCLplaidml: Yes - Inference - Mobilenet - OpenCLplaidml: No - Inference - DenseNet 201 - OpenCLgromacs-gpu: Water Benchmarklczero: OpenCLrodinia: OpenCL Particle Filterarrayfire: Conjugate Gradient OpenCLblender: BMW27 - CUDAblender: Classroom - CUDAblender: Fishy Cat - CUDAblender: Barbershop - CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - CUDAblender: Pabellon Barcelona - NVIDIA OptiXneatbench: GPUfahbench: namd-cuda: ATPase Simulation - 327,506 Atomsoctanebench: Total Scoreredshift: financebench: Black-Scholes OpenCLcl-mem: Copycl-mem: Readcl-mem: Writeclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthmandelgpu: GPUviennacl: OpenCL LU FactorizationGTX 980GTX 970GTX 980 TiGTX 1070 Ti80.43228.97810.311053.4082.063.288627511.7294.883165.20421.77328.651415.17151.49423.50319.37918.07858.5517.2101.82570.32163110.594254100212.019145.0165.41571303.294448.81159.64165.23126002111.263.156711325914870075688560151853805205773.01202.94717.50909.5073.693.001486213.0495.403173.01444.58351.061540.95175.44481.80369.541012.281031.9015.490.94390.3516096.072862111914.198125.8144.6134.01145.393877.42137.81144.51112885853.758.6527128861125489677472736369165293293389.44266.291030.271281.9799.513.881829110.0953.623149.21354.53292.201193.54123.07342.76258.49780.37690.29115.48760.28182143.09273976410.061217.5266.0244.21615.045556.00196.28264.34152063380.564.9328104.88387.56875.351607.1588.370.958463111.5255.115200.04880.20931.553430.14460.102435.500.3329224.871588253079.121999107.8178.6194.440.1134OpenBenchmarking.org

GLmark2

Resolution: 800 x 600

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 800 x 600GTX 980 TiGTX 9703K6K9K12K15K1288611325

GLmark2

Resolution: 1024 x 768

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1024 x 768GTX 980 TiGTX 9702K4K6K8K10K112549148

GLmark2

Resolution: 1280 x 1024

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1280 x 1024GTX 980 TiGTX 9702K4K6K8K10K89677007

GLmark2

Resolution: 1600 x 1200

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1600 x 1200GTX 980 TiGTX 9701600320048006400800074725688

GLmark2

Resolution: 1920 x 1080

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1920 x 1080GTX 980 TiGTX 9701600320048006400800073635601

GLmark2

Resolution: 1920 x 1200

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 1920 x 1200GTX 980 TiGTX 9701500300045006000750069165185

GLmark2

Resolution: 2560 x 1440

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 2560 x 1440GTX 980 TiGTX 9701100220033004400550052933805

GLmark2

Resolution: 3840 x 2160

OpenBenchmarking.orgScore, More Is BetterGLmark2 2020.04Resolution: 3840 x 2160GTX 980 TiGTX 970600120018002400300029332057

PlaidML

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLGTX 1070 TiGTX 980 TiGTX 980GTX 97020406080100SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3104.8889.4480.4373.01

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLGTX 1070 TiGTX 980 TiGTX 980GTX 97080160240320400SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 3387.56266.29228.97202.94

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 980 TiGTX 1070 TiGTX 980GTX 9702004006008001000SE +/- 1.62, N = 3SE +/- 59.39, N = 12SE +/- 0.50, N = 3SE +/- 0.32, N = 31030.27875.35810.31717.50

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLGTX 1070 TiGTX 980 TiGTX 980GTX 97030060090012001500SE +/- 2.45, N = 3SE +/- 0.94, N = 3SE +/- 1.60, N = 3SE +/- 1.17, N = 31607.151281.971053.40909.50

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLGTX 980 TiGTX 1070 TiGTX 980GTX 97020406080100SE +/- 0.05, N = 3SE +/- 6.24, N = 12SE +/- 0.14, N = 3SE +/- 0.01, N = 399.5188.3782.0673.69

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkGTX 980 TiGTX 980GTX 970GTX 1070 Ti0.87321.74642.61963.49284.366SE +/- 0.005, N = 3SE +/- 0.007, N = 3SE +/- 0.000, N = 3SE +/- 0.235, N = 93.8813.2883.0010.9581. (CXX) g++ options: -O3 -lpthread -ldl -lrt -lm

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLGTX 980 TiGTX 980GTX 970GTX 1070 Ti2K4K6K8K10KSE +/- 4.91, N = 3SE +/- 37.68, N = 3SE +/- 41.16, N = 3SE +/- 984.05, N = 682916275486246311. (CXX) g++ options: -flto -pthread

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterGTX 980 TiGTX 1070 TiGTX 980GTX 9703691215SE +/- 0.15, N = 4SE +/- 1.42, N = 12SE +/- 0.14, N = 3SE +/- 0.10, N = 310.1011.5311.7313.051. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLGTX 980 TiGTX 980GTX 1070 TiGTX 9701.21572.43143.64714.86286.0785SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.062, N = 3SE +/- 0.004, N = 33.6234.8835.1155.4031. (CXX) g++ options: -rdynamic

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDAGTX 980 TiGTX 980GTX 970GTX 1070 Ti4080120160200SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.07, N = 3SE +/- 7.36, N = 9149.21165.20173.01200.04

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDAGTX 980 TiGTX 980GTX 970GTX 1070 Ti2004006008001000SE +/- 0.14, N = 3SE +/- 0.35, N = 3SE +/- 0.05, N = 3SE +/- 92.48, N = 9354.53421.77444.58880.20

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDAGTX 980 TiGTX 980GTX 970GTX 1070 Ti2004006008001000SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 107.89, N = 9292.20328.65351.06931.55

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDAGTX 980 TiGTX 980GTX 970GTX 1070 Ti7001400210028003500SE +/- 0.91, N = 3SE +/- 0.23, N = 3SE +/- 0.20, N = 31193.541415.171540.953430.14

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXGTX 980 TiGTX 980GTX 970GTX 1070 Ti100200300400500SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.16, N = 3SE +/- 40.73, N = 9123.07151.49175.44460.10

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXGTX 980 TiGTX 980GTX 970GTX 1070 Ti5001000150020002500SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 241.03, N = 6342.76423.50481.802435.50

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXGTX 980 TiGTX 980GTX 97080160240320400SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3258.49319.37369.54

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDAGTX 980 TiGTX 980GTX 9702004006008001000SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 12.17, N = 3780.37918.071012.28

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXGTX 980 TiGTX 980GTX 9702004006008001000SE +/- 0.36, N = 3SE +/- 0.33, N = 3SE +/- 0.90, N = 3690.29858.551031.90

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUGTX 980GTX 97048121620SE +/- 0.12, N = 3SE +/- 0.00, N = 317.215.4

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GTX 980 TiGTX 980GTX 970306090120150SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3115.49101.8390.94

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsGTX 980 TiGTX 980GTX 1070 TiGTX 9700.07910.15820.23730.31640.3955SE +/- 0.00249, N = 15SE +/- 0.00070, N = 3SE +/- 0.01554, N = 15SE +/- 0.00210, N = 30.281820.321630.332920.35160

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreGTX 980 TiGTX 980GTX 970GTX 1070 Ti306090120150143.09110.5996.0724.87

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0GTX 980 TiGTX 980GTX 970GTX 1070 Ti5001000150020002500SE +/- 2.40, N = 3SE +/- 1.76, N = 3SE +/- 2.19, N = 3SE +/- 371.60, N = 6764100211192530

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLGTX 980 TiGTX 980GTX 970GTX 1070 Ti20406080100SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.44, N = 310.0612.0214.2079.121. (CXX) g++ options: -O3 -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGTX 980 TiGTX 980GTX 970GTX 1070 Ti50100150200250SE +/- 0.23, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 13.47, N = 12217.5145.0125.8107.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGTX 980 TiGTX 1070 TiGTX 980GTX 97060120180240300SE +/- 0.15, N = 3SE +/- 12.40, N = 12SE +/- 0.06, N = 3SE +/- 0.03, N = 3266.0178.6165.4144.61. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGTX 980 TiGTX 1070 TiGTX 980GTX 97050100150200250SE +/- 0.06, N = 3SE +/- 0.92, N = 3SE +/- 0.03, N = 3244.2194.4157.0134.01. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 980 TiGTX 980GTX 97030060090012001500SE +/- 11.49, N = 3SE +/- 3.33, N = 3SE +/- 14.81, N = 51615.041303.291145.391. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 980 TiGTX 980GTX 97012002400360048006000SE +/- 20.25, N = 3SE +/- 67.71, N = 3SE +/- 43.66, N = 155556.004448.813877.421. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 980 TiGTX 980GTX 9704080120160200SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3196.28159.64137.811. (CXX) g++ options: -O3 -rdynamic -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 980 TiGTX 980GTX 97060120180240300SE +/- 0.41, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3264.34165.23144.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGTX 980 TiGTX 980GTX 97030M60M90M120M150MSE +/- 367945.59, N = 3SE +/- 172505.04, N = 3SE +/- 116092.47, N = 3152063380.5126002111.2112885853.71. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGTX 980 TiGTX 980GTX 970GTX 1070 Ti1428425670SE +/- 0.45, N = 3SE +/- 0.21, N = 3SE +/- 0.19, N = 3SE +/- 5.70, N = 1564.9363.1658.6540.111. (CXX) g++ options: -rdynamic -lOpenCL


Phoronix Test Suite v10.8.4