AMD Ryzen 9 3950X Ubuntu Linux

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2009247-FI-AMDRYZEN924
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
CPU Massive 4 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 5 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 3 Tests
Multi-Core 3 Tests
NVIDIA GPU Compute 16 Tests
OpenCL 5 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Run 1
September 23 2020
  3 Hours, 47 Minutes
Run 2
September 24 2020
  3 Hours, 43 Minutes
Run 3
September 24 2020
  3 Hours, 44 Minutes
Invert Hiding All Results Option
  3 Hours, 45 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD Ryzen 9 3950X Ubuntu Linux - Phoronix Test Suite

AMD Ryzen 9 3950X Ubuntu Linux

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce RTX 2080 Ti 11GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2009247-FI-AMDRYZEN924&grs&rdt&rro.

AMD Ryzen 9 3950X Ubuntu LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRun 1Run 2Run 3AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-47-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.228 + OpenCL 2.0 AMD-APP (3182.0)1.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013OpenCL Details- GPU Compute Cores: 4352Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD Ryzen 9 3950X Ubuntu Linuxopencv: Object Detectionperf-bench: Syscall Basicosbench: Create Filesviennacl: OpenCL LU Factorizationperf-bench: Epoll Waitosbench: Create Processeswebp: Quality 100, Highest Compressionperf-bench: Memset 1MBmixbench: NVIDIA CUDA - Integerwebp: Quality 100, Losslessperf-bench: Futex Lock-Piwebp: Defaultinfluxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000perf-bench: Memcpy 1MBosbench: Memory Allocationswebp: Quality 100, Lossless, Highest Compressionosbench: Launch Programsmandelgpu: GPUperf-bench: Sched Pipeespeak: Text-To-Speech Synthesismpv: Big Buck Bunny Sunflower 1080p - Software Onlylibraw: Post-Processing Benchmarkhashcat: SHA1arrayfire: Conjugate Gradient OpenCLoctanebench: Total Scoreclpeak: Double-Precision Doublehashcat: SHA-512hashcat: MD5hashcat: TrueCrypt RIPEMD160 + XTSmixbench: NVIDIA CUDA - Half Precisionclpeak: Integer Compute INTwebp: Quality 100rodinia: OpenCL Particle Filterplaidml: No - Inference - IMDB LSTM - OpenCLmpv: Big Buck Bunny Sunflower 4K - Software Onlyclpeak: Single-Precision Floatfahbench: plaidml: No - Inference - DenseNet 201 - OpenCLblender: Pabellon Barcelona - NVIDIA OptiXplaidml: No - Inference - Mobilenet - OpenCLcl-mem: Writeplaidml: Yes - Inference - Mobilenet - OpenCLblender: BMW27 - NVIDIA OptiXhashcat: 7-Zipblender: Classroom - CUDAperf-bench: Futex Hashblender: Barbershop - NVIDIA OptiXredshift: blender: Classroom - NVIDIA OptiXlczero: OpenCLnamd-cuda: ATPase Simulation - 327,506 Atomscl-mem: Copyblender: Fishy Cat - CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - CUDAfinancebench: Black-Scholes OpenCLplaidml: No - Training - Mobilenet - OpenCLblender: BMW27 - CUDAcl-mem: Readclpeak: Global Memory Bandwidthblender: Barbershop - CUDAopencv: Features 2Dosbench: Create Threadsmixbench: NVIDIA CUDA - Single Precisionmixbench: NVIDIA CUDA - Double PrecisionRun 1Run 2Run 3373912157002011.18751179.40303301228.9471946.71873.23964114651.0115.5054511.4561503297.61349119.41534587.915.34063065.83635032.33737.233829450731366.039238826.7791236.9935.30179628333331.676308.84294522.5424697000005655400000065096732630.5213318.752.2254.485749.41372.2913379.77287.4210213.43103.932409.86447.72750.1820.16880967151.945042694896.8324773.37116930.17955325.373.1533.10292.146.030187.7440.73545.4507.56538.7214967613.88820014126.10440.80364732290005810.59133275.70653415928.0531256.81173.36974614263.8215.8764541.4241530852.51372772.21562422.115.36276364.73024732.87937.083626455807158.039706326.6051249.0034.98178227000001.671310.026199521.1324518666675613713333364636732596.2713258.652.2124.459751.39374.3913387.08288.8771212.51104.282421.41449.82763.0520.25877133152.475021169893.0324873.33116730.17951325.273.2733.05292.456.037187.6340.77545.4507.87538.6014783813.91299613791.73419.44348952273544510.79124876.82593404928.1866396.91271.41835414510.1115.4664431.4261534746.91376801.61563580.115.09465965.16472532.84737.697156448875915.039153626.4911242.6835.05178121000001.662307.443811518.2124504666675614560000064660032409.0813234.842.2134.466747.07372.4813313.50288.9235212.36104.432414.96447.72756.9220.24879667152.605028714893.6424873.58116570.18004324.573.1733.05292.506.033187.8440.73544.9507.91538.8114820614.06844515130.47428.03OpenBenchmarking.org

OpenCV

Test: Object Detection

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Object DetectionRun 3Run 2Run 18K16K24K32K40KSE +/- 588.74, N = 3SE +/- 484.41, N = 3SE +/- 382.04, N = 33489536473373911. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

perf-bench

Benchmark: Syscall Basic

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall BasicRun 3Run 2Run 15M10M15M20M25MSE +/- 164860.89, N = 3SE +/- 98896.75, N = 3SE +/- 81316.08, N = 32273544522900058215700201. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Create Files

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create FilesRun 3Run 2Run 13691215SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 310.7910.5911.191. (CC) gcc options: -lm

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRun 3Run 2Run 120406080100SE +/- 1.21, N = 3SE +/- 0.98, N = 3SE +/- 1.01, N = 376.8375.7179.401. (CXX) g++ options: -rdynamic -lOpenCL

perf-bench

Benchmark: Epoll Wait

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll WaitRun 3Run 2Run 17K14K21K28K35KSE +/- 438.57, N = 4SE +/- 308.84, N = 3SE +/- 204.50, N = 33404934159330121. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Create Processes

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ProcessesRun 3Run 2Run 1714212835SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.34, N = 328.1928.0528.951. (CC) gcc options: -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionRun 3Run 2Run 1246810SE +/- 0.091, N = 3SE +/- 0.071, N = 3SE +/- 0.032, N = 36.9126.8116.7181. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

perf-bench

Benchmark: Memset 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBRun 3Run 2Run 11632486480SE +/- 1.13, N = 3SE +/- 1.08, N = 4SE +/- 0.94, N = 471.4273.3773.241. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Mixbench

Backend: NVIDIA CUDA - Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerRun 3Run 2Run 13K6K9K12K15KSE +/- 28.60, N = 3SE +/- 207.34, N = 15SE +/- 20.03, N = 314510.1114263.8214651.011. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessRun 3Run 2Run 148121620SE +/- 0.23, N = 3SE +/- 0.11, N = 3SE +/- 0.19, N = 515.4715.8815.511. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

perf-bench

Benchmark: Futex Lock-Pi

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-PiRun 3Run 2Run 1100200300400500SE +/- 5.86, N = 3SE +/- 3.89, N = 15SE +/- 2.08, N = 34434544511. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultRun 3Run 2Run 10.32760.65520.98281.31041.638SE +/- 0.019, N = 3SE +/- 0.024, N = 3SE +/- 0.024, N = 31.4261.4241.4561. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

InfluxDB

Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2016.10, N = 3SE +/- 1988.25, N = 3SE +/- 691.55, N = 31534746.91530852.51503297.6

InfluxDB

Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 2246.17, N = 3SE +/- 2002.03, N = 3SE +/- 2562.57, N = 31376801.61372772.21349119.4

InfluxDB

Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Run 3Run 2Run 1300K600K900K1200K1500KSE +/- 929.80, N = 3SE +/- 2819.29, N = 3SE +/- 2777.14, N = 31563580.11562422.11534587.9

perf-bench

Benchmark: Memcpy 1MB

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBRun 3Run 2Run 148121620SE +/- 0.18, N = 5SE +/- 0.10, N = 3SE +/- 0.20, N = 315.0915.3615.341. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

OSBench

Test: Memory Allocations

OpenBenchmarking.orgNs Per Event, Fewer Is BetterOSBenchTest: Memory AllocationsRun 3Run 2Run 11530456075SE +/- 0.43, N = 3SE +/- 0.11, N = 3SE +/- 0.84, N = 365.1664.7365.841. (CC) gcc options: -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionRun 3Run 2Run 1816243240SE +/- 0.16, N = 3SE +/- 0.30, N = 3SE +/- 0.39, N = 332.8532.8832.341. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OSBench

Test: Launch Programs

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Launch ProgramsRun 3Run 2Run 1918273645SE +/- 0.19, N = 3SE +/- 0.25, N = 3SE +/- 0.28, N = 337.7037.0837.231. (CC) gcc options: -lm

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURun 3Run 2Run 1100M200M300M400M500MSE +/- 1749343.37, N = 3SE +/- 5831583.77, N = 3SE +/- 6814468.33, N = 3448875915.0455807158.0450731366.01. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

perf-bench

Benchmark: Sched Pipe

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched PipeRun 3Run 2Run 190K180K270K360K450KSE +/- 3250.27, N = 3SE +/- 4833.02, N = 3SE +/- 3949.47, N = 33915363970633923881. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisRun 3Run 2Run 1612182430SE +/- 0.26, N = 4SE +/- 0.09, N = 4SE +/- 0.08, N = 426.4926.6126.781. (CC) gcc options: -O2 -std=c99

MPV

Video Input: Big Buck Bunny Sunflower 1080p - Decode: Software Only

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 1080p - Decode: Software OnlyRun 3Run 2Run 130060090012001500SE +/- 0.55, N = 3SE +/- 3.00, N = 3SE +/- 2.06, N = 31242.681249.001236.99MIN: 824.09 / MAX: 1672.6MIN: 823.81 / MAX: 1678.43MIN: 818.67 / MAX: 1669.11. mpv 0.32.0

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkRun 3Run 2Run 1816243240SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 335.0534.9835.301. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1Run 3Run 2Run 14000M8000M12000M16000M20000MSE +/- 18200000.00, N = 3SE +/- 28850361.06, N = 3SE +/- 28555054.04, N = 3178121000001782270000017962833333

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRun 3Run 2Run 10.37710.75421.13131.50841.8855SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 31.6621.6711.6761. (CXX) g++ options: -rdynamic

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreRun 3Run 2Run 170140210280350307.44310.03308.84

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRun 3Run 2Run 1110220330440550SE +/- 1.44, N = 3SE +/- 0.32, N = 3SE +/- 1.65, N = 3518.21521.13522.541. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512Run 3Run 2Run 1500M1000M1500M2000M2500MSE +/- 2796624.95, N = 3SE +/- 3868390.42, N = 3SE +/- 3523256.07, N = 3245046666724518666672469700000

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5Run 3Run 2Run 112000M24000M36000M48000M60000MSE +/- 62943175.43, N = 3SE +/- 48887501.24, N = 3SE +/- 78954438.34, N = 3561456000005613713333356554000000

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSRun 3Run 2Run 1140K280K420K560K700KSE +/- 346.41, N = 3SE +/- 433.33, N = 3SE +/- 648.93, N = 3646600646367650967

Mixbench

Backend: NVIDIA CUDA - Benchmark: Half Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionRun 3Run 2Run 17K14K21K28K35KSE +/- 29.77, N = 3SE +/- 18.28, N = 3SE +/- 6.67, N = 332409.0832596.2732630.521. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRun 3Run 2Run 13K6K9K12K15KSE +/- 159.10, N = 6SE +/- 137.42, N = 15SE +/- 159.82, N = 1513234.8413258.6513318.751. (CXX) g++ options: -O3 -rdynamic -lOpenCL

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Run 3Run 2Run 10.50061.00121.50182.00242.503SE +/- 0.037, N = 3SE +/- 0.026, N = 3SE +/- 0.026, N = 32.2132.2122.2251. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRun 3Run 2Run 11.00912.01823.02734.03645.0455SE +/- 0.017, N = 3SE +/- 0.016, N = 3SE +/- 0.027, N = 34.4664.4594.4851. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

PlaidML

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLRun 3Run 2Run 1160320480640800SE +/- 1.84, N = 3SE +/- 3.71, N = 3SE +/- 2.47, N = 3747.07751.39749.41

MPV

Video Input: Big Buck Bunny Sunflower 4K - Decode: Software Only

OpenBenchmarking.orgFPS, More Is BetterMPVVideo Input: Big Buck Bunny Sunflower 4K - Decode: Software OnlyRun 3Run 2Run 180160240320400SE +/- 1.32, N = 3SE +/- 0.92, N = 3SE +/- 0.73, N = 3372.48374.39372.29MIN: 286.84 / MAX: 432.21MIN: 288.04 / MAX: 445.07MIN: 286.67 / MAX: 441.691. mpv 0.32.0

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRun 3Run 2Run 13K6K9K12K15KSE +/- 178.75, N = 15SE +/- 184.15, N = 15SE +/- 169.22, N = 1513313.5013387.0813379.771. (CXX) g++ options: -O3 -rdynamic -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Run 3Run 2Run 160120180240300SE +/- 0.37, N = 3SE +/- 0.56, N = 3SE +/- 0.87, N = 3288.92288.88287.42

PlaidML

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLRun 3Run 2Run 150100150200250SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3212.36212.51213.43

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRun 3Run 2Run 120406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3104.43104.28103.93

PlaidML

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 15001000150020002500SE +/- 3.60, N = 3SE +/- 2.19, N = 3SE +/- 3.69, N = 32414.962421.412409.86

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRun 3Run 2Run 1100200300400500SE +/- 1.25, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3447.7449.8447.71. (CC) gcc options: -O2 -flto -lOpenCL

PlaidML

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 16001200180024003000SE +/- 0.87, N = 3SE +/- 2.19, N = 3SE +/- 2.56, N = 32756.922763.052750.18

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiXRun 3Run 2Run 1510152025SE +/- 0.26, N = 3SE +/- 0.30, N = 3SE +/- 0.20, N = 320.2420.2520.16

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-ZipRun 3Run 2Run 1200K400K600K800K1000KSE +/- 1848.72, N = 3SE +/- 1278.45, N = 3SE +/- 1637.41, N = 3879667877133880967

Blender

Blend File: Classroom - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CUDARun 3Run 2Run 1306090120150SE +/- 0.39, N = 3SE +/- 0.24, N = 3SE +/- 0.01, N = 3152.60152.47151.94

perf-bench

Benchmark: Futex Hash

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex HashRun 3Run 2Run 11.1M2.2M3.3M4.4M5.5MSE +/- 6643.91, N = 3SE +/- 7280.95, N = 3SE +/- 7897.00, N = 35028714502116950426941. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lz -llzma -lnuma

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiXRun 3Run 2Run 12004006008001000SE +/- 1.68, N = 3SE +/- 0.32, N = 3SE +/- 0.44, N = 3893.64893.03896.83

RedShift Demo

OpenBenchmarking.orgSeconds, Fewer Is BetterRedShift Demo 3.0Run 3Run 2Run 150100150200250SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3248248247

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiXRun 3Run 2Run 11632486480SE +/- 0.18, N = 3SE +/- 0.28, N = 3SE +/- 0.25, N = 373.5873.3373.37

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: OpenCLRun 3Run 2Run 13K6K9K12K15KSE +/- 41.53, N = 3SE +/- 69.91, N = 3SE +/- 82.15, N = 31165711673116931. (CXX) g++ options: -flto -pthread

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRun 3Run 2Run 10.04050.0810.12150.1620.2025SE +/- 0.00039, N = 3SE +/- 0.00020, N = 3SE +/- 0.00010, N = 30.180040.179510.17955

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRun 3Run 2Run 170140210280350SE +/- 0.50, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 3324.5325.2325.31. (CC) gcc options: -O2 -flto -lOpenCL

Blender

Blend File: Fishy Cat - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CUDARun 3Run 2Run 11632486480SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 373.1773.2773.15

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiXRun 3Run 2Run 1816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 333.0533.0533.10

Blender

Blend File: Pabellon Barcelona - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CUDARun 3Run 2Run 160120180240300SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3292.50292.45292.14

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLRun 3Run 2Run 1246810SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 36.0336.0376.0301. (CXX) g++ options: -O3 -lOpenCL

PlaidML

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Training - Network: Mobilenet - Device: OpenCLRun 3Run 2Run 14080120160200SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3187.84187.63187.74

Blender

Blend File: BMW27 - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CUDARun 3Run 2Run 1918273645SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 340.7340.7740.73

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRun 3Run 2Run 1120240360480600SE +/- 1.08, N = 3SE +/- 0.27, N = 3SE +/- 0.32, N = 3544.9545.4545.41. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRun 3Run 2Run 1110220330440550SE +/- 0.44, N = 3SE +/- 0.76, N = 3SE +/- 0.68, N = 3507.91507.87507.561. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Blender

Blend File: Barbershop - Compute: CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CUDARun 3Run 2Run 1120240360480600SE +/- 0.06, N = 3SE +/- 0.40, N = 3SE +/- 0.16, N = 3538.81538.60538.72

OpenCV

Test: Features 2D

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.4Test: Features 2DRun 3Run 2Run 130K60K90K120K150KSE +/- 3470.33, N = 12SE +/- 2777.13, N = 9SE +/- 1997.08, N = 121482061478381496761. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OSBench

Test: Create Threads

OpenBenchmarking.orgus Per Event, Fewer Is BetterOSBenchTest: Create ThreadsRun 3Run 2Run 148121620SE +/- 0.30, N = 15SE +/- 0.29, N = 15SE +/- 0.26, N = 1514.0713.9113.891. (CC) gcc options: -lm

Mixbench

Backend: NVIDIA CUDA - Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionRun 3Run 2Run 13K6K9K12K15KSE +/- 510.15, N = 15SE +/- 646.88, N = 15SE +/- 646.92, N = 1515130.4713791.7314126.101. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Backend: NVIDIA CUDA - Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionRun 3Run 2Run 1100200300400500SE +/- 4.74, N = 15SE +/- 7.42, N = 15SE +/- 0.03, N = 3428.03419.44440.801. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2


Phoronix Test Suite v10.8.4