g-64cpu-236mem-8v100-24ssd

g-64cpu-236mem-8v100-24ssd

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102154-HA-G64CPU23620
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
g-64cpu-236mem-8v100-24ssd
February 12 2021
  16 Hours, 59 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


g-64cpu-236mem-8v100-24ssdOpenBenchmarking.orgPhoronix Test SuiteIntel Xeon (32 Cores / 64 Threads)Google Compute Engine n1-standard-64236GB24 x 403GB nvme_card + 137GB PersistentDiskTesla V100-SXM2-16GBUbuntu 20.045.4.0-1036-gcp (x86_64)GNOME Shell 3.36.4X Server 1.20.9NVIDIAOpenCL 1.2 CUDA 11.2.1091.2.155GCC 9.3.0 + CUDA 10.1ext4KVMProcessorMotherboardMemoryDiskGraphicsOSKernelDesktopDisplay ServerDisplay DriverOpenCLVulkanCompilerFile-SystemSystem LayerG-64cpu-236mem-8v100-24ssd BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / relatime,rw,stripe=3072 / raid0 nvme0n24[23] nvme0n23[22] nvme0n22[21] nvme0n21[20] nvme0n20[19] nvme0n19[18] nvme0n18[17] nvme0n17[16] nvme0n16[15] nvme0n15[14] nvme0n14[13] nvme0n13[12] nvme0n12[11] nvme0n11[10] nvme0n10[9] nvme0n9[8] nvme0n8[7] nvme0n7[6] nvme0n6[5] nvme0n5[4] nvme0n4[3] nvme0n3[2] nvme0n2[1] nvme0n1[0] Block Size: 4096 - CPU Microcode: 0x1- Python 3.8.5- itlb_multihit: Not affected + l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

g-64cpu-236mem-8v100-24ssdior: 1024MB - Default Test Directoryior: 512MB - Default Test Directoryior: 256MB - Default Test Directorysqlite: 128sqlite: 64sqlite: 32dbench: 1 Clientsdbench: 12 Clientssqlite: 8tinymembench: Standard Memsettinymembench: Standard Memcpyior: 64MB - Default Test Directorysqlite: 1fs-mark: 5000 Files, 1MB Size, 4 Threadsbuild-linux-kernel: Time To Compileasmfish: 1024 Hash Memory, 26 Depthior: 32MB - Default Test Directorypovray: Trace Timeblender: Barbershop - CPU-Onlyfahbench: cachebench: Write Cachecachebench: Read Cachencnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetior: 16MB - Default Test Directoryramspeed: Average - Floating Pointramspeed: Triad - Floating Pointramspeed: Scale - Floating Pointramspeed: Copy - Floating Pointramspeed: Add - Floating Pointramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerramspeed: Add - Integerstockfish: Total Timeplaidml: No - Inference - DenseNet 201 - OpenCLkvazaar: Bosphorus 4K - Slowpostmark: Disk Transaction Performancecompress-7zip: Compress Speed Testkvazaar: Bosphorus 4K - Mediumior: 8MB - Default Test Directoryfs-mark: 4000 Files, 32 Sub Dirs, 1MB Sizehashcat: MD5hashcat: SHA1namd: ATPase Simulation - 327,506 Atomsior: 4MB - Default Test Directoryx265: Bosphorus 4Kfio: Seq Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Rand Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Rand Read - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Rand Read - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Rand Read - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Seq Read - Linux AIO - No - Yes - 2MB - Default Test Directoryplaidml: No - Inference - IMDB LSTM - OpenCLfio: Seq Write - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Rand Write - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Rand Write - Linux AIO - No - Yes - 2MB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Seq Write - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Rand Write - Linux AIO - No - Yes - 4KB - Default Test Directoryfio: Rand Write - Linux AIO - No - Yes - 4KB - Default Test Directoryt-test1: 1kvazaar: Bosphorus 4K - Very Fastmbw: Memory Copy, Fixed Block Size - 1024 MiBior: 2MB - Default Test Directoryclpeak: Integer Compute INTplaidml: Yes - Inference - Mobilenet - OpenCLclpeak: Single-Precision Floathashcat: SHA-512plaidml: No - Inference - Mobilenet - OpenCLkvazaar: Bosphorus 1080p - Slowhashcat: TrueCrypt RIPEMD160 + XTSkvazaar: Bosphorus 1080p - Mediummbw: Memory Copy - 1024 MiBviennacl: OpenCL LU Factorizationopenssl: RSA 4096-bit Performancehashcat: 7-Ziprodinia: OpenCL Particle Filterkvazaar: Bosphorus 4K - Ultra Faststream: Copyfs-mark: 1000 Files, 1MB Sizerodinia: OpenMP CFD Solverclpeak: Double-Precision Doublerodinia: OpenMP LavaMDx265: Bosphorus 1080pt-test1: 2sysbench: CPUkvazaar: Bosphorus 1080p - Very Fastarrayfire: Conjugate Gradient OpenCLctx-clock: Context Switch Timekvazaar: Bosphorus 1080p - Ultra Fastclpeak: Global Memory Bandwidthx264: H.264 Video Encodingcl-mem: Copycl-mem: Writecl-mem: Readfinancebench: Black-Scholes OpenCLfs-mark: 1000 Files, 1MB Size, No Sync/FSyncstream: Addstream: Triadstream: Scaleg-64cpu-236mem-8v100-24ssd1011.01695.04697.071187.7211017.768873.51538.1117193.816676.15510175.34572.6705.84252.09188.841.01063431219674.7327.676435.16246.340521095.6089492530.00643571.0527.3035.5328.379.7414.7240.0123.445.1613.029.9510.439.8210.8325.38530.9817675.0022284.6016118.6115098.9417619.4117240.6223625.2322497.8721168.1621256.1360103975245.338.8737311223359.07434.3962.6252486693750801422562500.75360338.9518.971383335401706676662342469123864780687.80212342542105421822333387218100070729.40521.003819.110273.9115214.393386.6815314.19192313000002899.5624.82537823325.725073.86447.06124232.371365674.51633.2395515.761.711.5107746.3212.81048.0610.08050977.865960.772.315793111.25767.65130.55281.7750.9789.51.3531378.693753.095092.881387.1OpenBenchmarking.org

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 1024MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd2004006008001000SE +/- 2.80, N = 31011.01MIN: 827.77 / MAX: 1150.661. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 512MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 1.00, N = 3695.04MIN: 653.13 / MAX: 8021. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 256MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 1.08, N = 3697.07MIN: 645.32 / MAX: 820.621. (CC) gcc options: -O2 -lm -pthread -lmpi

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 128g-64cpu-236mem-8v100-24ssd30060090012001500SE +/- 2.55, N = 31187.721. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 64g-64cpu-236mem-8v100-24ssd2004006008001000SE +/- 0.75, N = 31017.771. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 32g-64cpu-236mem-8v100-24ssd2004006008001000SE +/- 2.52, N = 3873.521. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

Dbench

Dbench is a benchmark designed by the Samba project as a free alternative to netbench, but dbench contains only file-system calls for testing the disk performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.01 Clientsg-64cpu-236mem-8v100-24ssd918273645SE +/- 0.03, N = 338.111. (CC) gcc options: -lpopt -O2

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.012 Clientsg-64cpu-236mem-8v100-24ssd4080120160200SE +/- 0.87, N = 3193.821. (CC) gcc options: -lpopt -O2

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 8g-64cpu-236mem-8v100-24ssd150300450600750SE +/- 4.66, N = 3676.161. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

Tinymembench

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memsetg-64cpu-236mem-8v100-24ssd2K4K6K8K10KSE +/- 80.78, N = 310175.31. (CC) gcc options: -O2 -lm

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpyg-64cpu-236mem-8v100-24ssd10002000300040005000SE +/- 24.33, N = 34572.61. (CC) gcc options: -O2 -lm

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 64MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 1.56, N = 3705.84MIN: 586.8 / MAX: 1136.511. (CC) gcc options: -O2 -lm -pthread -lmpi

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1g-64cpu-236mem-8v100-24ssd60120180240300SE +/- 0.90, N = 3252.091. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

FS-Mark

FS_Mark is designed to test a system's file-system performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFiles/s, More Is BetterFS-Mark 3.3Test: 5000 Files, 1MB Size, 4 Threadsg-64cpu-236mem-8v100-24ssd20406080100SE +/- 0.00, N = 388.81. (CC) gcc options: -static

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To Compileg-64cpu-236mem-8v100-24ssd918273645SE +/- 0.27, N = 1441.01

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthg-64cpu-236mem-8v100-24ssd14M28M42M56M70MSE +/- 509920.83, N = 363431219

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 2.72, N = 3674.73MIN: 542.77 / MAX: 1131.191. (CC) gcc options: -O2 -lm -pthread -lmpi

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timeg-64cpu-236mem-8v100-24ssd714212835SE +/- 0.63, N = 1527.681. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-Onlyg-64cpu-236mem-8v100-24ssd90180270360450435.16

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2g-64cpu-236mem-8v100-24ssd50100150200250SE +/- 1.29, N = 3246.34

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchWrite Cacheg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 50.80, N = 321095.61MIN: 18753.79 / MAX: 22789.531. (CC) gcc options: -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchRead Cacheg-64cpu-236mem-8v100-24ssd5001000150020002500SE +/- 7.44, N = 32530.01MIN: 2480.01 / MAX: 2567.231. (CC) gcc options: -lrt

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400mg-64cpu-236mem-8v100-24ssd1632486480SE +/- 1.04, N = 471.05MIN: 64.38 / MAX: 88.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssdg-64cpu-236mem-8v100-24ssd612182430SE +/- 0.08, N = 427.30MIN: 26.04 / MAX: 42.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tinyg-64cpu-236mem-8v100-24ssd816243240SE +/- 0.41, N = 435.53MIN: 33.93 / MAX: 43.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50g-64cpu-236mem-8v100-24ssd714212835SE +/- 0.48, N = 428.37MIN: 26.77 / MAX: 40.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnetg-64cpu-236mem-8v100-24ssd3691215SE +/- 0.53, N = 49.74MIN: 8.46 / MAX: 14.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18g-64cpu-236mem-8v100-24ssd48121620SE +/- 0.40, N = 414.72MIN: 13.75 / MAX: 16.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16g-64cpu-236mem-8v100-24ssd918273645SE +/- 0.21, N = 440.01MIN: 38.5 / MAX: 45.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenetg-64cpu-236mem-8v100-24ssd612182430SE +/- 0.46, N = 423.44MIN: 21.96 / MAX: 25.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazefaceg-64cpu-236mem-8v100-24ssd1.1612.3223.4834.6445.805SE +/- 0.04, N = 45.16MIN: 4.89 / MAX: 5.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0g-64cpu-236mem-8v100-24ssd3691215SE +/- 0.11, N = 413.02MIN: 12.37 / MAX: 18.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnetg-64cpu-236mem-8v100-24ssd3691215SE +/- 0.06, N = 49.95MIN: 9.52 / MAX: 11.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v2g-64cpu-236mem-8v100-24ssd3691215SE +/- 0.13, N = 410.43MIN: 9.65 / MAX: 12.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3g-64cpu-236mem-8v100-24ssd3691215SE +/- 0.05, N = 49.82MIN: 9.18 / MAX: 23.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2g-64cpu-236mem-8v100-24ssd3691215SE +/- 0.10, N = 410.83MIN: 10.26 / MAX: 20.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenetg-64cpu-236mem-8v100-24ssd612182430SE +/- 0.30, N = 425.38MIN: 23.95 / MAX: 28.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd110220330440550SE +/- 5.52, N = 3530.98MIN: 454.13 / MAX: 1084.381. (CC) gcc options: -O2 -lm -pthread -lmpi

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointg-64cpu-236mem-8v100-24ssd4K8K12K16K20KSE +/- 22.22, N = 317675.001. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Pointg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 6.15, N = 322284.601. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointg-64cpu-236mem-8v100-24ssd3K6K9K12K15KSE +/- 20.73, N = 316118.611. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointg-64cpu-236mem-8v100-24ssd3K6K9K12K15KSE +/- 24.26, N = 315098.941. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointg-64cpu-236mem-8v100-24ssd4K8K12K16K20KSE +/- 18.26, N = 317619.411. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integerg-64cpu-236mem-8v100-24ssd4K8K12K16K20KSE +/- 8.25, N = 317240.621. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 9.84, N = 323625.231. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integerg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 37.17, N = 322497.871. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 31.19, N = 321168.161. (CC) gcc options: -O3 -march=native

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerg-64cpu-236mem-8v100-24ssd5K10K15K20K25KSE +/- 28.93, N = 321256.131. (CC) gcc options: -O3 -march=native

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Timeg-64cpu-236mem-8v100-24ssd13M26M39M52M65MSE +/- 88076.33, N = 3601039751. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCLg-64cpu-236mem-8v100-24ssd50100150200250SE +/- 0.36, N = 3245.33

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slowg-64cpu-236mem-8v100-24ssd246810SE +/- 0.00, N = 38.871. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PostMark

This is a test of NetApp's PostMark benchmark designed to simulate small-file testing similar to the tasks endured by web and mail servers. This test profile will set PostMark to perform 25,000 transactions with 500 files simultaneously with the file sizes ranging between 5 and 512 kilobytes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performanceg-64cpu-236mem-8v100-24ssd800160024003200400037311. (CC) gcc options: -O3

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testg-64cpu-236mem-8v100-24ssd30K60K90K120K150KSE +/- 92.71, N = 31223351. (CXX) g++ options: -pipe -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Mediumg-64cpu-236mem-8v100-24ssd3691215SE +/- 0.01, N = 39.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd90180270360450SE +/- 3.93, N = 3434.39MIN: 361.53 / MAX: 905.531. (CC) gcc options: -O2 -lm -pthread -lmpi

FS-Mark

FS_Mark is designed to test a system's file-system performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFiles/s, More Is BetterFS-Mark 3.3Test: 4000 Files, 32 Sub Dirs, 1MB Sizeg-64cpu-236mem-8v100-24ssd1428425670SE +/- 0.06, N = 362.61. (CC) gcc options: -static

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5g-64cpu-236mem-8v100-24ssd50000M100000M150000M200000M250000MSE +/- 50697561856.81, N = 16252486693750

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1g-64cpu-236mem-8v100-24ssd20000M40000M60000M80000M100000MSE +/- 16100700259.24, N = 1680142256250

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 Atomsg-64cpu-236mem-8v100-24ssd0.16960.33920.50880.67840.848SE +/- 0.00061, N = 30.75360

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd70140210280350SE +/- 3.26, N = 3338.95MIN: 269.97 / MAX: 914.511. (CC) gcc options: -O2 -lm -pthread -lmpi

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4Kg-64cpu-236mem-8v100-24ssd510152025SE +/- 0.02, N = 318.971. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Flexible IO Tester

FIO, the Flexible I/O Tester, is an advanced Linux disk benchmark supporting multiple I/O engines and a wealth of options. FIO was written by Jens Axboe for testing of the Linux I/O subsystem and schedulers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd30K60K90K120K150KSE +/- 666.67, N = 31383331. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd120240360480600SE +/- 3.21, N = 35401. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd40K80K120K160K200KSE +/- 881.92, N = 31706671. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd140280420560700SE +/- 3.06, N = 36661. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd5001000150020002500SE +/- 7.51, N = 323421. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd10002000300040005000SE +/- 15.04, N = 346911. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd5001000150020002500SE +/- 9.82, N = 323861. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd10002000300040005000SE +/- 19.97, N = 347801. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCLg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 1.07, N = 3687.80

Flexible IO Tester

FIO, the Flexible I/O Tester, is an advanced Linux disk benchmark supporting multiple I/O engines and a wealth of options. FIO was written by Jens Axboe for testing of the Linux I/O subsystem and schedulers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd5001000150020002500SE +/- 4.84, N = 321231. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd9001800270036004500SE +/- 10.17, N = 342541. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd5001000150020002500SE +/- 4.10, N = 321051. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd9001800270036004500SE +/- 7.84, N = 342181. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd50K100K150K200K250KSE +/- 1452.97, N = 32233331. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd2004006008001000SE +/- 4.93, N = 38721. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.25Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd40K80K120K160K200KSE +/- 1527.53, N = 31810001. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.25Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd150300450600750SE +/- 6.66, N = 37071. (CC) gcc options: -rdynamic -ll -lnuma -lrt -lz -lpthread -lm -ldl -laio -std=gnu99 -ffast-math -include -O3 -fcommon -U_FORTIFY_SOURCE -march=native

t-test1

This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1g-64cpu-236mem-8v100-24ssd714212835SE +/- 0.25, N = 329.411. (CC) gcc options: -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fastg-64cpu-236mem-8v100-24ssd510152025SE +/- 0.05, N = 321.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiBg-64cpu-236mem-8v100-24ssd8001600240032004000SE +/- 6.05, N = 33819.111. (CC) gcc options: -O3 -march=native

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test Directoryg-64cpu-236mem-8v100-24ssd60120180240300SE +/- 2.52, N = 3273.91MIN: 170.64 / MAX: 628.81. (CC) gcc options: -O2 -lm -pthread -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTg-64cpu-236mem-8v100-24ssd3K6K9K12K15KSE +/- 147.60, N = 1515214.391. (CXX) g++ options: -O3 -rdynamic -lOpenCL

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCLg-64cpu-236mem-8v100-24ssd7001400210028003500SE +/- 1.89, N = 33386.68

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatg-64cpu-236mem-8v100-24ssd3K6K9K12K15KSE +/- 177.16, N = 1515314.191. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512g-64cpu-236mem-8v100-24ssd4000M8000M12000M16000M20000MSE +/- 4752192.48, N = 319231300000

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCLg-64cpu-236mem-8v100-24ssd6001200180024003000SE +/- 2.58, N = 32899.56

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slowg-64cpu-236mem-8v100-24ssd612182430SE +/- 0.03, N = 324.821. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSg-64cpu-236mem-8v100-24ssd1.2M2.4M3.6M4.8M6MSE +/- 592.55, N = 35378233

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Mediumg-64cpu-236mem-8v100-24ssd612182430SE +/- 0.03, N = 325.721. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 1024 MiBg-64cpu-236mem-8v100-24ssd11002200330044005500SE +/- 9.62, N = 35073.861. (CC) gcc options: -O3 -march=native

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU Factorizationg-64cpu-236mem-8v100-24ssd1122334455SE +/- 0.13, N = 347.061. (CXX) g++ options: -rdynamic -lOpenCL

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performanceg-64cpu-236mem-8v100-24ssd9001800270036004500SE +/- 7.91, N = 34232.31. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-Zipg-64cpu-236mem-8v100-24ssd1.5M3M4.5M6M7.5MSE +/- 12247.49, N = 37136567

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filterg-64cpu-236mem-8v100-24ssd1.01612.03223.04834.06445.0805SE +/- 0.036, N = 134.5161. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fastg-64cpu-236mem-8v100-24ssd816243240SE +/- 0.03, N = 333.231. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Stream

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copyg-64cpu-236mem-8v100-24ssd20K40K60K80K100KSE +/- 487.72, N = 595515.71. (CC) gcc options: -O3 -march=native -fopenmp

FS-Mark

FS_Mark is designed to test a system's file-system performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFiles/s, More Is BetterFS-Mark 3.3Test: 1000 Files, 1MB Sizeg-64cpu-236mem-8v100-24ssd1428425670SE +/- 0.37, N = 361.71. (CC) gcc options: -static

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solverg-64cpu-236mem-8v100-24ssd3691215SE +/- 0.14, N = 411.511. (CXX) g++ options: -O2 -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doubleg-64cpu-236mem-8v100-24ssd17003400510068008500SE +/- 71.34, N = 77746.321. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDg-64cpu-236mem-8v100-24ssd3691215SE +/- 0.02, N = 312.811. (CXX) g++ options: -O2 -lOpenCL

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pg-64cpu-236mem-8v100-24ssd1122334455SE +/- 0.22, N = 348.061. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

t-test1

This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2g-64cpu-236mem-8v100-24ssd3691215SE +/- 0.03, N = 310.081. (CC) gcc options: -pthread

Sysbench

This is a benchmark of Sysbench with CPU and memory sub-tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPUg-64cpu-236mem-8v100-24ssd11K22K33K44K55KSE +/- 22.56, N = 350977.871. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=haswell -rdynamic -ldl -laio -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fastg-64cpu-236mem-8v100-24ssd1428425670SE +/- 0.13, N = 360.771. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLg-64cpu-236mem-8v100-24ssd0.52091.04181.56272.08362.6045SE +/- 0.016, N = 32.3151. (CXX) g++ options: -rdynamic

ctx_clock

Ctx_clock is a simple test program to measure the context switch time in clock cycles. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgClocks, Fewer Is Betterctx_clockContext Switch Timeg-64cpu-236mem-8v100-24ssd2004006008001000793

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fastg-64cpu-236mem-8v100-24ssd20406080100SE +/- 0.37, N = 3111.251. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthg-64cpu-236mem-8v100-24ssd170340510680850SE +/- 0.07, N = 3767.651. (CXX) g++ options: -O3 -rdynamic -lOpenCL

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encodingg-64cpu-236mem-8v100-24ssd306090120150SE +/- 0.92, N = 3130.551. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyg-64cpu-236mem-8v100-24ssd60120180240300SE +/- 0.07, N = 3281.71. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writeg-64cpu-236mem-8v100-24ssd160320480640800SE +/- 0.92, N = 3750.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readg-64cpu-236mem-8v100-24ssd2004006008001000SE +/- 0.27, N = 3789.51. (CC) gcc options: -O2 -flto -lOpenCL

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLg-64cpu-236mem-8v100-24ssd0.30440.60880.91321.21761.522SE +/- 0.018, N = 31.3531. (CXX) g++ options: -O3 -march=native -fopenmp

FS-Mark

FS_Mark is designed to test a system's file-system performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFiles/s, More Is BetterFS-Mark 3.3Test: 1000 Files, 1MB Size, No Sync/FSyncg-64cpu-236mem-8v100-24ssd30060090012001500SE +/- 3.38, N = 31378.61. (CC) gcc options: -static

Stream

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Addg-64cpu-236mem-8v100-24ssd20K40K60K80K100KSE +/- 135.42, N = 593753.01. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triadg-64cpu-236mem-8v100-24ssd20K40K60K80K100KSE +/- 174.67, N = 595092.81. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scaleg-64cpu-236mem-8v100-24ssd20K40K60K80K100KSE +/- 190.64, N = 581387.11. (CC) gcc options: -O3 -march=native -fopenmp

118 Results Shown

IOR:
  1024MB - Default Test Directory
  512MB - Default Test Directory
  256MB - Default Test Directory
SQLite:
  128
  64
  32
Dbench:
  1 Clients
  12 Clients
SQLite
Tinymembench:
  Standard Memset
  Standard Memcpy
IOR
SQLite
FS-Mark
Timed Linux Kernel Compilation
asmFish
IOR
POV-Ray
Blender
FAHBench
CacheBench:
  Write Cache
  Read Cache
NCNN:
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
IOR
RAMspeed SMP:
  Average - Floating Point
  Triad - Floating Point
  Scale - Floating Point
  Copy - Floating Point
  Add - Floating Point
  Copy - Integer
  Scale - Integer
  Triad - Integer
  Average - Integer
  Add - Integer
Stockfish
PlaidML
Kvazaar
PostMark
7-Zip Compression
Kvazaar
IOR
FS-Mark
Hashcat:
  MD5
  SHA1
NAMD
IOR
x265
Flexible IO Tester:
  Seq Read - Linux AIO - No - Yes - 4KB - Default Test Directory:
    IOPS
    MB/s
  Rand Read - Linux AIO - No - Yes - 4KB - Default Test Directory:
    IOPS
    MB/s
  Rand Read - Linux AIO - No - Yes - 2MB - Default Test Directory:
    IOPS
    MB/s
  Seq Read - Linux AIO - No - Yes - 2MB - Default Test Directory:
    IOPS
    MB/s
PlaidML
Flexible IO Tester:
  Seq Write - Linux AIO - No - Yes - 2MB - Default Test Directory:
    IOPS
    MB/s
  Rand Write - Linux AIO - No - Yes - 2MB - Default Test Directory:
    IOPS
    MB/s
  Seq Write - Linux AIO - No - Yes - 4KB - Default Test Directory:
    IOPS
    MB/s
  Rand Write - Linux AIO - No - Yes - 4KB - Default Test Directory:
    IOPS
    MB/s
t-test1
Kvazaar
MBW
IOR
clpeak
PlaidML
clpeak
Hashcat
PlaidML
Kvazaar
Hashcat
Kvazaar
MBW
ViennaCL
OpenSSL
Hashcat
Rodinia
Kvazaar
Stream
FS-Mark
Rodinia
clpeak
Rodinia
x265
t-test1
Sysbench
Kvazaar
ArrayFire
ctx_clock
Kvazaar
clpeak
x264
cl-mem:
  Copy
  Write
  Read
FinanceBench
FS-Mark
Stream:
  Add
  Triad
  Scale