3950X Wednesday

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and ASUS NVIDIA GeForce GTX 1660 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2009300-PTS-3950XWED66
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C/C++ Compiler Tests 4 Tests
CPU Massive 5 Tests
Database Test Suite 2 Tests
Fortran Tests 2 Tests
HPC - High Performance Computing 8 Tests
Machine Learning 3 Tests
Molecular Dynamics 2 Tests
NVIDIA GPU Compute 4 Tests
Python Tests 2 Tests
Scientific Computing 5 Tests
Server 2 Tests
Single-Threaded 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
1
September 30 2020
  4 Hours, 6 Minutes
2
September 30 2020
  3 Hours, 29 Minutes
3
September 30 2020
  3 Hours, 47 Minutes
Invert Hiding All Results Option
  3 Hours, 47 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


3950X WednesdayProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)NVIDIA TU116 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-48-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160ASUS NVIDIA GeForce GTX 1660 6GB (405/405MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013OpenCL Details- GPU Compute Cores: 1408Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%102%103%105%106%Hierarchical INTegrationBYTE Unix BenchmarkDolfynApache CouchDBFFTETimed MAFFT AlignmentGROMACSMlpack BenchmarkLeelaChessZeroTimed HMMer SearchKeyDBVkFFTCaffe

3950X Wednesdayvkfft: lczero: BLASlczero: OpenCLdolfyn: Computational Fluid Dynamicsffte: N=256, 3D Complex FFT Routinehmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAbyte: Dhrystone 2couchdb: 100 - 1000 - 24keydb: gromacs: Water Benchmarkcaffe: AlexNet - CPU - 100caffe: AlexNet - CPU - 200caffe: AlexNet - CPU - 1000caffe: GoogleNet - CPU - 100caffe: GoogleNet - CPU - 200caffe: GoogleNet - CPU - 1000caffe: AlexNet - NVIDIA CUDA - 100caffe: AlexNet - NVIDIA CUDA - 200caffe: AlexNet - NVIDIA CUDA - 1000caffe: GoogleNet - NVIDIA CUDA - 100caffe: GoogleNet - NVIDIA CUDA - 200caffe: GoogleNet - NVIDIA CUDA - 1000hint: FLOATmlpack: scikit_icamlpack: scikit_qdamlpack: scikit_svmmlpack: scikit_linearridgeregression12313770376552415.84338706.37691228596.4209.43943619437.198.696618019.571.2455184110342451958512806525799212765202318.134620.6523157.25868.8011728.658778.3382793511.6563154.0766.2718.671.9513760381550215.64638211.98282727495.6709.30945887832.598.925616286.881.2405159410326551517612791625681912807102317.654617.6723139.55831.5111706.958563.1402390670.7088053.2768.6318.561.9913765387545515.34737849.57268538195.9369.25144196437.5101.402618339.771.2285147510339551737712798025723112854232317.774620.9623122.55868.5111716.158666.6406002766.7334953.9766.1918.761.98OpenBenchmarking.org

VkFFT

VkFFT is a Fast Fourier Transform (FFT) Library that is GPU accelerated by means of the Vulkan API. The VkFFT benchmark runs FFT performance differences of many different sizes before returning an overall benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 2020-09-291233K6K9K12K15KSE +/- 18.76, N = 3SE +/- 28.35, N = 3SE +/- 21.13, N = 3137701376013765
OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 2020-09-291232K4K6K8K10KMin: 13738 / Avg: 13770.33 / Max: 13803Min: 13718 / Avg: 13760 / Max: 13814Min: 13727 / Avg: 13765 / Max: 13800

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS12380160240320400SE +/- 3.64, N = 9SE +/- 2.08, N = 3SE +/- 4.26, N = 33763813871. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS12370140210280350Min: 364 / Avg: 376.44 / Max: 395Min: 378 / Avg: 381 / Max: 385Min: 381 / Avg: 386.67 / Max: 3951. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCL12312002400360048006000SE +/- 31.17, N = 3SE +/- 48.62, N = 3SE +/- 20.07, N = 35524550254551. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCL12310002000300040005000Min: 5492 / Avg: 5523.67 / Max: 5586Min: 5442 / Avg: 5501.67 / Max: 5598Min: 5415 / Avg: 5455 / Max: 54781. (CXX) g++ options: -flto -pthread

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620SE +/- 0.00, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 315.8415.6515.35
OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620Min: 15.83 / Avg: 15.84 / Max: 15.85Min: 15.33 / Avg: 15.65 / Max: 15.81Min: 15.2 / Avg: 15.35 / Max: 15.52

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1238K16K24K32K40KSE +/- 38.19, N = 3SE +/- 30.46, N = 3SE +/- 59.41, N = 338706.3838211.9837849.571. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1237K14K21K28K35KMin: 38630.13 / Avg: 38706.38 / Max: 38748.42Min: 38152.28 / Avg: 38211.98 / Max: 38252.33Min: 37732.9 / Avg: 37849.57 / Max: 37927.351. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100SE +/- 0.27, N = 3SE +/- 0.21, N = 3SE +/- 0.41, N = 396.4295.6795.941. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100Min: 96.03 / Avg: 96.42 / Max: 96.95Min: 95.26 / Avg: 95.67 / Max: 95.94Min: 95.27 / Avg: 95.94 / Max: 96.691. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.037, N = 3SE +/- 0.013, N = 3SE +/- 0.088, N = 39.4399.3099.2511. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215Min: 9.38 / Avg: 9.44 / Max: 9.51Min: 9.29 / Avg: 9.31 / Max: 9.33Min: 9.09 / Avg: 9.25 / Max: 9.391. (CC) gcc options: -std=c99 -O3 -lm -lpthread

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 212310M20M30M40M50MSE +/- 473134.27, N = 3SE +/- 632372.12, N = 3SE +/- 492041.08, N = 1243619437.145887832.544196437.5
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 21238M16M24M32M40MMin: 42686984.7 / Avg: 43619437.07 / Max: 44225189.4Min: 44643419.1 / Avg: 45887832.5 / Max: 46705640.7Min: 41350455.7 / Avg: 44196437.51 / Max: 46720919.2

Apache CouchDB

This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.1.1Bulk Size: 100 - Inserts: 1000 - Rounds: 2412320406080100SE +/- 1.13, N = 3SE +/- 0.54, N = 3SE +/- 1.32, N = 398.7098.93101.401. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD
OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.1.1Bulk Size: 100 - Inserts: 1000 - Rounds: 2412320406080100Min: 96.46 / Avg: 98.7 / Max: 100.12Min: 98.26 / Avg: 98.93 / Max: 99.99Min: 99.64 / Avg: 101.4 / Max: 103.971. (CXX) g++ options: -std=c++14 -lmozjs-68 -lm -lerl_interface -lei -fPIC -MMD

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123130K260K390K520K650KSE +/- 1841.98, N = 3SE +/- 575.49, N = 3SE +/- 1123.47, N = 3618019.57616286.88618339.771. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123110K220K330K440K550KMin: 616096.35 / Avg: 618019.57 / Max: 621702.31Min: 615656.04 / Avg: 616286.88 / Max: 617436.02Min: 616392.33 / Avg: 618339.77 / Max: 620284.151. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.28010.56020.84031.12041.4005SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.2451.2401.2281. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark123246810Min: 1.24 / Avg: 1.24 / Max: 1.25Min: 1.24 / Avg: 1.24 / Max: 1.24Min: 1.23 / Avg: 1.23 / Max: 1.231. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 10012311K22K33K44K55KSE +/- 258.97, N = 3SE +/- 59.94, N = 3SE +/- 176.24, N = 35184151594514751. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001239K18K27K36K45KMin: 51396 / Avg: 51841 / Max: 52293Min: 51501 / Avg: 51594 / Max: 51706Min: 51290 / Avg: 51474.67 / Max: 518271. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012320K40K60K80K100KSE +/- 55.01, N = 3SE +/- 321.72, N = 3SE +/- 364.50, N = 31034241032651033951. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012320K40K60K80K100KMin: 103339 / Avg: 103424 / Max: 103527Min: 102757 / Avg: 103265 / Max: 103861Min: 103030 / Avg: 103395 / Max: 1041241. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1000123110K220K330K440K550KSE +/- 1748.88, N = 3SE +/- 606.69, N = 3SE +/- 549.28, N = 35195855151765173771. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100012390K180K270K360K450KMin: 517257 / Avg: 519585.33 / Max: 523010Min: 514371 / Avg: 515176.33 / Max: 516365Min: 516300 / Avg: 517376.67 / Max: 5181041. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012330K60K90K120K150KSE +/- 388.76, N = 3SE +/- 226.90, N = 3SE +/- 213.26, N = 31280651279161279801. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KMin: 127295 / Avg: 128064.67 / Max: 128545Min: 127523 / Avg: 127915.67 / Max: 128309Min: 127634 / Avg: 127980 / Max: 1283691. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012360K120K180K240K300KSE +/- 241.89, N = 3SE +/- 618.83, N = 3SE +/- 601.46, N = 32579922568192572311. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KMin: 257511 / Avg: 257992.33 / Max: 258275Min: 255632 / Avg: 256819 / Max: 257716Min: 256055 / Avg: 257230.67 / Max: 2580391. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 1000123300K600K900K1200K1500KSE +/- 1280.00, N = 3SE +/- 3059.50, N = 3SE +/- 1395.78, N = 31276520128071012854231. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 1000123200K400K600K800K1000KMin: 1275240 / Avg: 1276520 / Max: 1279080Min: 1276670 / Avg: 1280710 / Max: 1286710Min: 1283450 / Avg: 1285423.33 / Max: 12881201. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1001235001000150020002500SE +/- 4.07, N = 3SE +/- 2.67, N = 3SE +/- 3.57, N = 32318.132317.652317.771. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100123400800120016002000Min: 2311.59 / Avg: 2318.13 / Max: 2325.6Min: 2314.43 / Avg: 2317.65 / Max: 2322.95Min: 2311.22 / Avg: 2317.77 / Max: 2323.51. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 20012310002000300040005000SE +/- 4.37, N = 3SE +/- 3.68, N = 3SE +/- 2.36, N = 34620.654617.674620.961. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 2001238001600240032004000Min: 4612.96 / Avg: 4620.65 / Max: 4628.09Min: 4613.14 / Avg: 4617.67 / Max: 4624.97Min: 4617.16 / Avg: 4620.96 / Max: 4625.271. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 10001235K10K15K20K25KSE +/- 19.01, N = 3SE +/- 24.02, N = 3SE +/- 41.98, N = 323157.223139.523122.51. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 10001234K8K12K16K20KMin: 23121.4 / Avg: 23157.17 / Max: 23186.2Min: 23091.5 / Avg: 23139.5 / Max: 23165.2Min: 23046.2 / Avg: 23122.53 / Max: 231911. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 10012313002600390052006500SE +/- 3.35, N = 3SE +/- 3.21, N = 3SE +/- 4.23, N = 35868.805831.515868.511. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 10012310002000300040005000Min: 5863.24 / Avg: 5868.8 / Max: 5874.83Min: 5826.88 / Avg: 5831.51 / Max: 5837.67Min: 5861.63 / Avg: 5868.51 / Max: 5876.211. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 2001233K6K9K12K15KSE +/- 13.69, N = 3SE +/- 6.42, N = 3SE +/- 4.03, N = 311728.611706.911716.11. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 2001232K4K6K8K10KMin: 11707.2 / Avg: 11728.6 / Max: 11754.1Min: 11695 / Avg: 11706.93 / Max: 11717Min: 11709.6 / Avg: 11716.13 / Max: 11723.51. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100012313K26K39K52K65KSE +/- 36.83, N = 3SE +/- 117.09, N = 3SE +/- 67.70, N = 358778.358563.158666.61. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas
OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100012310K20K30K40K50KMin: 58705 / Avg: 58778.27 / Max: 58821.5Min: 58334.6 / Avg: 58563.13 / Max: 58721.7Min: 58589.1 / Avg: 58666.6 / Max: 58801.51. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT12390M180M270M360M450MSE +/- 644301.63, N = 3SE +/- 3712843.15, N = 3SE +/- 3487903.54, N = 3382793511.66402390670.71406002766.731. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT12370M140M210M280M350MMin: 382028368.86 / Avg: 382793511.66 / Max: 384074020.21Min: 398355906.63 / Avg: 402390670.71 / Max: 409806767.59Min: 399050852.07 / Avg: 406002766.73 / Max: 409978299.091. (CC) gcc options: -O3 -march=native -lm

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_ica1231224364860SE +/- 0.19, N = 3SE +/- 0.52, N = 3SE +/- 0.50, N = 354.0753.2753.97
OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_ica1231122334455Min: 53.7 / Avg: 54.07 / Max: 54.35Min: 52.31 / Avg: 53.27 / Max: 54.09Min: 53.14 / Avg: 53.97 / Max: 54.88

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qda1231530456075SE +/- 0.67, N = 3SE +/- 0.41, N = 3SE +/- 0.81, N = 366.2768.6366.19
OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qda1231326395265Min: 64.93 / Avg: 66.27 / Max: 67.02Min: 68.22 / Avg: 68.63 / Max: 69.44Min: 64.73 / Avg: 66.19 / Max: 67.53

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm123510152025SE +/- 0.19, N = 3SE +/- 0.24, N = 4SE +/- 0.29, N = 318.6718.5618.76
OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm123510152025Min: 18.47 / Avg: 18.67 / Max: 19.05Min: 18.02 / Avg: 18.56 / Max: 19.18Min: 18.19 / Avg: 18.76 / Max: 19.05

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression1230.44780.89561.34341.79122.239SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 31.951.991.98
OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression123246810Min: 1.92 / Avg: 1.95 / Max: 1.97Min: 1.92 / Avg: 1.99 / Max: 2.03Min: 1.95 / Avg: 1.98 / Max: 2.01