DL385_workstation_mixed_3trials_nogpu

2 x AMD EPYC 9654 96-Core testing with a HPE ProLiant DL385 Gen11 (1.30 BIOS) and Matrox MGA G200eH3 on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308016-DT9-DL385WOR82
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
DL385 Workstation mixed 3 trials, no gpu
July 31 2023
  1 Day, 5 Hours, 29 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


DL385_workstation_mixed_3trials_nogpuOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 9654 96-Core (192 Cores / 384 Threads)HPE ProLiant DL385 Gen11 (1.30 BIOS)AMD Device 14a4768GB5759GB MR416i-o Gen11Matrox MGA G200eH3PHL 243V74 x Broadcom NetXtreme BCM5719 PCIeUbuntu 22.045.19.0-50-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.34.5 Mesa 22.2.5-0ubuntu0.1~22.04.3 (LLVM 15.0.7 256 bits)1.3.224GCC 11.3.0ext41680x1050ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionDL385_workstation_mixed_3trials_nogpu BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - MQ-DEADLINE / errors=remount-ro,relatime,rw,stripe=48 / Block Size: 4096- CPU Microcode: 0x0- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.04.1)- Python 3.10.6- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

DL385_workstation_mixed_3trials_nogpuintel-mpi: IMB-MPI1 PingPongminife: Smallamg: openvino: Face Detection FP16 - CPUmt-dgemm: Sustained Floating-Point Ratearrayfire: BLAS CPUtensorflow: CPU - 32 - VGG-16tensorflow: CPU - 32 - ResNet-50fio: Rand Read - Linux AIO - No - Yes - 1MB - 1 - /datafio: Rand Write - Linux AIO - No - Yes - 1MB - 1 - /datafio: Seq Read - Linux AIO - No - Yes - 8MB - 1 - /datafio: Seq Read - Linux AIO - No - Yes - 64MB - 1 - /datafio: Seq Write - Linux AIO - No - Yes - 8MB - 1 - /datafio: Seq Write - Linux AIO - No - Yes - 64MB - 1 - /datagraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceleveldb: Rand Filldbench: 256compilebench: Compileramspeed: Copy - Integerstream: Scalestream: Triadstream: Addtinymembench: Standard Memcpytinymembench: Standard Memsetcachebench: Read / Modify / Writetjbench: Decompression Throughputffte: N=256, 3D Complex FFT Routinefftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 4096himeno: Poisson Pressure Solverfio: Rand Read - Linux AIO - No - Yes - 1MB - 1 - /datafio: Rand Write - Linux AIO - No - Yes - 1MB - 1 - /datafio: Seq Read - Linux AIO - No - Yes - 8MB - 1 - /datafio: Seq Read - Linux AIO - No - Yes - 64MB - 1 - /datafio: Seq Write - Linux AIO - No - Yes - 8MB - 1 - /datafio: Seq Write - Linux AIO - No - Yes - 64MB - 1 - /datambw: Memory Copy, Fixed Block Size - 8192 MiBjpegxl: PNG - 100jpegxl: JPEG - 100jpegxl-decode: Allwebp: Defaultwebp2: Defaultlibraw: Post-Processing Benchmarkaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddinggromacs: MPI CPU - water_GMX50_barecassandra: Mixed 1:1rocksdb: Rand Fillcockroach: KV, 50% Reads - 128java-jmh: Throughputmemcached: 1:100etcd: RANGE - 500 - 1000numpy: daphne: OpenMP - Points2Imageinfluxdb: 4 - 10000 - 2,5000,1 - 10000lulesh: namd: ATPase Simulation - 327,506 Atomsleveldb: Rand Filltensorflow-lite: Inception V4tensorflow-lite: Mobilenet Floatpybench: Total For Average Test Timespyperformance: gopyperformance: 2to3pyperformance: chaospyperformance: floatpyperformance: nbodypyperformance: pathlibpyperformance: raytracepyperformance: json_loadspyperformance: crypto_pyaespyperformance: regex_compilepyperformance: python_startuppyperformance: django_templatepyperformance: pickle_pure_pythonrenaissance: Rand Forestopenjpeg: NASA Curiosity Panorama M34etcd: RANGE - 500 - 1000 - Average Latencyncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDettnn: CPU - DenseNetopenvino: Face Detection FP16 - CPUopencv: Coreopencv: Stitchingopencv: Features 2Dopencv: Image Processingopencv: Object Detectiondacapobench: Jythonsqlite: 1sqlite: 32cloverleaf: Lagrangian-Eulerian Hydrodynamicsdolfyn: Computational Fluid Dynamicsneat: incompact3d: X3D-benchmarking input.i3dmafft: Multiple Sequence Alignment - LSU RNAmocassin: Dust 2D tau100.0bork: File Encryption Timeavifenc: 6cython-bench: N-Queenssqlite-speedtest: Timed Time - Size 1,000darktable: Boat - CPU-onlygegl: Rotate 90 Degreesgimp: rotategmic: 2D Function Plotting, 1000 Timesinkscape: SVG Files To PNGoctave-benchmark: rawtherapee: Total Benchmark Timersvg: SVG Files To PNGgpaw: Carbon Nanotubeblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlypyhpc: CPU - Numpy - 1048576 - Equation of Statesunflow: DL385 Workstation mixed 3 trials, no gpu4706.9218886.3226858066792.1168.0504882038.6053.5339.86657977821334169102613062175995118320.81277.492339.5662827.54322972.8369956.3342237.425708.728412.451017.776543221.062285222661.518890896777.6225224451.6658416578.977782.3310670.1710787.508210.838305.489791.4150.740.67250.9818.549.7244.4654624.457469.518.30313731150083342895.4477087527680.062652923.8738623.9321570.387597.881300502.450884.8080.136252040.02484774348997.285018325078.282.111015.135416.888.81147.0936.8326625.48770025.9147.78383.90232.48168.32149.8763.51167.86491.08114.1297.8318.75166.69144.9474.731226.35168.7585.903496.9661034.512498342443371372871669773648133282.004111.726123.8215.61553.252243.2662769.038209.1417.1522.89421.85550.1821.48854.01914.154103.19428.1796.19553.43522.70931.0868.9321.400.190OpenBenchmarking.org

Redis 7.0.12 + memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongDL385 Workstation mixed 3 trials, no gpu10002000300040005000SE +/- 32.11, N = 34706.92MIN: 3.2 / MAX: 16380.961. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallDL385 Workstation mixed 3 trials, no gpu4K8K12K16K20KSE +/- 3315.45, N = 318886.31. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2DL385 Workstation mixed 3 trials, no gpu500M1000M1500M2000M2500MSE +/- 19231296.75, N = 322685806671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.30, N = 392.111. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateDL385 Workstation mixed 3 trials, no gpu1530456075SE +/- 0.05, N = 368.051. (CC) gcc options: -O3 -march=native -fopenmp

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUDL385 Workstation mixed 3 trials, no gpu400800120016002000SE +/- 41.20, N = 32038.601. (CXX) g++ options: -rdynamic

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: VGG-16DL385 Workstation mixed 3 trials, no gpu1224364860SE +/- 0.33, N = 353.53

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50DL385 Workstation mixed 3 trials, no gpu918273645SE +/- 0.45, N = 339.86

Flexible IO Tester

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Random Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 1MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu14002800420056007000SE +/- 0.50, N = 36579MIN: 6578.32 / MAX: 6579.941. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Random Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 1MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 248.70, N = 37782MIN: 7432.19 / MAX: 8263.331. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu30060090012001500SE +/- 15.71, N = 31334MIN: 1306.7 / MAX: 1361.131. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 64MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 3.97, N = 3169MIN: 161.08 / MAX: 174.651. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2004006008001000SE +/- 9.14, N = 31026MIN: 1013.1 / MAX: 1043.871. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.35Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 64MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 4.15, N = 3130MIN: 124.15 / MAX: 137.851. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: RotateDL385 Workstation mixed 3 trials, no gpu130260390520650SE +/- 0.67, N = 36211. (CC) gcc options: -fopenmp -O2 -ljbig -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: ResizingDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 1.45, N = 3751. (CC) gcc options: -fopenmp -O2 -ljbig -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: Noise-GaussianDL385 Workstation mixed 3 trials, no gpu2004006008001000SE +/- 23.69, N = 39951. (CC) gcc options: -fopenmp -O2 -ljbig -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: HWB Color SpaceDL385 Workstation mixed 3 trials, no gpu30060090012001500SE +/- 5.36, N = 311831. (CC) gcc options: -fopenmp -O2 -ljbig -llcms2 -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lzstd -lm -lpthread

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.23Benchmark: Random FillDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.06, N = 320.81. (CXX) g++ options: -fno-exceptions -fno-rtti -O3 -lgmock -lgtest -lsnappy

Dbench

Dbench is a benchmark designed by the Samba project as a free alternative to netbench, but dbench contains only file-system calls for testing the disk performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterDbench 4.0Client Count: 256DL385 Workstation mixed 3 trials, no gpu30060090012001500SE +/- 72.31, N = 31277.491. (CC) gcc options: -lpopt -O2

Compile Bench

Compilebench tries to age a filesystem by simulating some of the disk IO common in creating, compiling, patching, stating and reading kernel trees. It indirectly measures how well filesystems can maintain directory locality as the disk fills up and directories age. This current test is setup to use the makej mode with 10 initial directories Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: CompileDL385 Workstation mixed 3 trials, no gpu5001000150020002500SE +/- 0.37, N = 32339.56

RAMspeed SMP

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: IntegerDL385 Workstation mixed 3 trials, no gpu13K26K39K52K65KSE +/- 241.33, N = 362827.541. (CC) gcc options: -O3 -march=native

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: ScaleDL385 Workstation mixed 3 trials, no gpu70K140K210K280K350KSE +/- 1135.34, N = 3322972.81. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: TriadDL385 Workstation mixed 3 trials, no gpu80K160K240K320K400KSE +/- 1110.57, N = 3369956.31. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: AddDL385 Workstation mixed 3 trials, no gpu70K140K210K280K350KSE +/- 1651.79, N = 3342237.41. (CC) gcc options: -O3 -march=native -fopenmp

Tinymembench

This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemcpyDL385 Workstation mixed 3 trials, no gpu6K12K18K24K30KSE +/- 10.18, N = 325708.71. (CC) gcc options: -O2 -lm

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard MemsetDL385 Workstation mixed 3 trials, no gpu6K12K18K24K30KSE +/- 6.42, N = 328412.41. (CC) gcc options: -O2 -lm

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteDL385 Workstation mixed 3 trials, no gpu11K22K33K44K55KSE +/- 57.27, N = 351017.78MIN: 42895.09 / MAX: 55027.211. (CC) gcc options: -lrt

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression ThroughputDL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 0.14, N = 3221.061. (CC) gcc options: -O3 -rdynamic

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineDL385 Workstation mixed 3 trials, no gpu50K100K150K200K250KSE +/- 2845.97, N = 3222661.521. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096DL385 Workstation mixed 3 trials, no gpu15003000450060007500SE +/- 38.51, N = 36777.61. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096DL385 Workstation mixed 3 trials, no gpu5K10K15K20K25KSE +/- 474.19, N = 3225221. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverDL385 Workstation mixed 3 trials, no gpu10002000300040005000SE +/- 70.28, N = 34451.671. (CC) gcc options: -O3 -mavx2

Flexible IO Tester

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Random Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 1MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu14002800420056007000SE +/- 0.50, N = 36578.97MIN: 6578.33 / MAX: 6579.961. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Random Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 1MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 248.70, N = 37782.33MIN: 7432.24 / MAX: 8263.371. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 125.76, N = 310670.17MIN: 10454.15 / MAX: 10889.751. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Sequential Read - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 64MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 254.85, N = 310787.50MIN: 10310.17 / MAX: 11180.981. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 73.08, N = 38210.83MIN: 8105.21 / MAX: 8351.131. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

OpenBenchmarking.orgMiB/s, More Is BetterFlexible IO Tester 3.35Type: Sequential Write - Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 64MB - Job Count: 1 - Disk Target: /dataDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 265.54, N = 38305.48MIN: 7947.95 / MAX: 8824.341. (CC) gcc options: -rdynamic -ll -lnuma -lz -lm -laio -lpthread -ldl -lcurl -lssl -lcrypto -std=gnu99 -ffast-math -include -O3 -fcommon -march=native

MBW

This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiBDL385 Workstation mixed 3 trials, no gpu2K4K6K8K10KSE +/- 110.53, N = 39791.421. (CC) gcc options: -O3 -march=native

JPEG XL libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: PNG - Quality: 100DL385 Workstation mixed 3 trials, no gpu0.16650.3330.49950.6660.8325SE +/- 0.01, N = 30.741. (CXX) g++ options: -fno-rtti -funwind-tables -O3 -O2 -fPIE -pie -lm -latomic

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: JPEG - Quality: 100DL385 Workstation mixed 3 trials, no gpu0.15080.30160.45240.60320.754SE +/- 0.01, N = 30.671. (CXX) g++ options: -fno-rtti -funwind-tables -O3 -O2 -fPIE -pie -lm -latomic

JPEG XL Decoding libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding libjxl 0.7CPU Threads: AllDL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 1.69, N = 3250.98

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: DefaultDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.04, N = 318.541. (CC) gcc options: -fvisibility=hidden -O2 -lm

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterWebP2 Image Encode 20220823Encode Settings: DefaultDL385 Workstation mixed 3 trials, no gpu3691215SE +/- 0.02, N = 39.721. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkDL385 Workstation mixed 3 trials, no gpu1020304050SE +/- 0.09, N = 344.461. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -llcms2 -lm

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingDL385 Workstation mixed 3 trials, no gpu12K24K36K48K60KSE +/- 6916.47, N = 354624.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingDL385 Workstation mixed 3 trials, no gpu12K24K36K48K60KSE +/- 4777.83, N = 357469.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.04, N = 318.301. (CXX) g++ options: -O3

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Mixed 1:1DL385 Workstation mixed 3 trials, no gpu30K60K90K120K150KSE +/- 6871.68, N = 3137311

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random FillDL385 Workstation mixed 3 trials, no gpu110K220K330K440K550KSE +/- 5272.38, N = 35008331. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

CockroachDB

CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128DL385 Workstation mixed 3 trials, no gpu9K18K27K36K45KSE +/- 1434.95, N = 342895.4

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughputDL385 Workstation mixed 3 trials, no gpu100000M200000M300000M400000M500000MSE +/- 15549806690.14, N = 3477087527680.06

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100DL385 Workstation mixed 3 trials, no gpu600K1200K1800K2400K3000KSE +/- 3259.29, N = 32652923.871. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

KeyDB

A benchmark of KeyDB as an open-source, multi-threaded fork of the Redis server now developed by Snapchat. The developers self-describe the KeyDB in-memory data structure store as a "faster drop in alternative to Redis". Learn more via the OpenBenchmarking.org test page.

Test: SET - Parallel Connections: 50

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

etcd

Etcd is a distributed, reliable key-value store intended for critical data of a distributed system. Etcd is written in Golang and part of the Cloud Native Computing Foundation (CNCF) and used by Kubernetes, Rook, CoreDNS, and other open-source software. This test profile uses Etcd's built-in benchmark to stress the PUT and RANGE performance of a single node / local system. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests/sec, More Is Betteretcd 3.5.4Test: RANGE - Connections: 500 - Clients: 1000DL385 Workstation mixed 3 trials, no gpu8K16K24K32K40KSE +/- 2129.36, N = 338623.93

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkDL385 Workstation mixed 3 trials, no gpu120240360480600SE +/- 0.66, N = 3570.38

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Points2ImageDL385 Workstation mixed 3 trials, no gpu16003200480064008000SE +/- 1035.66, N = 37597.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

DL385 Workstation mixed 3 trials, no gpu: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000DL385 Workstation mixed 3 trials, no gpu300K600K900K1200K1500KSE +/- 6614.65, N = 31300502.4

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3DL385 Workstation mixed 3 trials, no gpu11K22K33K44K55KSE +/- 1589.74, N = 350884.811. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsDL385 Workstation mixed 3 trials, no gpu0.03070.06140.09210.12280.1535SE +/- 0.00164, N = 30.13625

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.23Benchmark: Random FillDL385 Workstation mixed 3 trials, no gpu400800120016002000SE +/- 5.54, N = 32040.021. (CXX) g++ options: -fno-exceptions -fno-rtti -O3 -lgmock -lgtest -lsnappy

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4DL385 Workstation mixed 3 trials, no gpu200K400K600K800K1000KSE +/- 77413.43, N = 3847743

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet FloatDL385 Workstation mixed 3 trials, no gpu10K20K30K40K50KSE +/- 12080.23, N = 348997.2

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesDL385 Workstation mixed 3 trials, no gpu2004006008001000SE +/- 4.04, N = 3850

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: goDL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 0.33, N = 3183

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to3DL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 0.00, N = 3250

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaosDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.00, N = 378.2

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: floatDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.19, N = 382.1

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbodyDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.33, N = 3110

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlibDL385 Workstation mixed 3 trials, no gpu48121620SE +/- 0.03, N = 315.1

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytraceDL385 Workstation mixed 3 trials, no gpu80160240320400SE +/- 0.00, N = 3354

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loadsDL385 Workstation mixed 3 trials, no gpu48121620SE +/- 0.00, N = 316.8

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaesDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.12, N = 388.8

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compileDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 0.00, N = 3114

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startupDL385 Workstation mixed 3 trials, no gpu246810SE +/- 0.00, N = 37.09

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_templateDL385 Workstation mixed 3 trials, no gpu816243240SE +/- 0.00, N = 336.8

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_pythonDL385 Workstation mixed 3 trials, no gpu70140210280350SE +/- 0.58, N = 3326

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.14Test: Random ForestDL385 Workstation mixed 3 trials, no gpu140280420560700SE +/- 4.69, N = 3625.4MIN: 548.57 / MAX: 794.71

OpenJPEG

OpenJPEG is an open-source JPEG 2000 codec written in the C programming language. The default input for this test profile is the NASA/JPL-Caltech/MSSS Curiosity panorama 717MB TIFF image file converting to JPEG2000 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenJPEG 2.4Encode: NASA Curiosity Panorama M34DL385 Workstation mixed 3 trials, no gpu20K40K60K80K100KSE +/- 1683.48, N = 3877001. (CXX) g++ options: -rdynamic

etcd

Etcd is a distributed, reliable key-value store intended for critical data of a distributed system. Etcd is written in Golang and part of the Cloud Native Computing Foundation (CNCF) and used by Kubernetes, Rook, CoreDNS, and other open-source software. This test profile uses Etcd's built-in benchmark to stress the PUT and RANGE performance of a single node / local system. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is Betteretcd 3.5.4Test: RANGE - Connections: 500 - Clients: 1000 - Average LatencyDL385 Workstation mixed 3 trials, no gpu612182430SE +/- 1.36, N = 325.9

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 57.73, N = 3147.78MIN: 49.17 / MAX: 3840.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2DL385 Workstation mixed 3 trials, no gpu80160240320400SE +/- 94.35, N = 3383.90MIN: 36.24 / MAX: 3384.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3DL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 49.74, N = 3232.48MIN: 35.02 / MAX: 3404.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2DL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 58.90, N = 3168.32MIN: 44.03 / MAX: 4118.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 92.92, N = 3149.87MIN: 35.4 / MAX: 3329.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0DL385 Workstation mixed 3 trials, no gpu1428425670SE +/- 12.49, N = 363.51MIN: 49.22 / MAX: 4089.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceDL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 88.61, N = 3167.86MIN: 21.76 / MAX: 2107.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetDL385 Workstation mixed 3 trials, no gpu110220330440550SE +/- 181.26, N = 3491.08MIN: 63.22 / MAX: 5270.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16DL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 11.19, N = 3114.12MIN: 45.61 / MAX: 1184.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18DL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 50.57, N = 397.83MIN: 29.33 / MAX: 2078.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.64, N = 318.75MIN: 16.98 / MAX: 174.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50DL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 79.67, N = 3166.69MIN: 58.83 / MAX: 4771.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 57.87, N = 3144.94MIN: 52.66 / MAX: 1960.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 6.63, N = 374.73MIN: 61.43 / MAX: 3509.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mDL385 Workstation mixed 3 trials, no gpu30060090012001500SE +/- 501.65, N = 31226.35MIN: 216.29 / MAX: 23729.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerDL385 Workstation mixed 3 trials, no gpu4080120160200SE +/- 12.13, N = 3168.75MIN: 141.79 / MAX: 1100.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 23.85, N = 385.90MIN: 51.33 / MAX: 4532.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetDL385 Workstation mixed 3 trials, no gpu8001600240032004000SE +/- 34.11, N = 33496.97MIN: 2546.03 / MAX: 3626.331. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUDL385 Workstation mixed 3 trials, no gpu2004006008001000SE +/- 3.28, N = 31034.51MIN: 457.43 / MAX: 1838.51. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: CoreDL385 Workstation mixed 3 trials, no gpu50K100K150K200K250KSE +/- 20962.23, N = 32498341. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: StitchingDL385 Workstation mixed 3 trials, no gpu50K100K150K200K250KSE +/- 5548.87, N = 32443371. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Features 2DDL385 Workstation mixed 3 trials, no gpu30K60K90K120K150KSE +/- 27000.76, N = 31372871. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Image ProcessingDL385 Workstation mixed 3 trials, no gpu40K80K120K160K200KSE +/- 10461.52, N = 31669771. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Object DetectionDL385 Workstation mixed 3 trials, no gpu8K16K24K32K40KSE +/- 4251.37, N = 3364811. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonDL385 Workstation mixed 3 trials, no gpu7001400210028003500SE +/- 8.45, N = 33328

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 1DL385 Workstation mixed 3 trials, no gpu0.45090.90181.35271.80362.2545SE +/- 0.002, N = 22.0041. (CC) gcc options: -O2 -lreadline -ltermcap -lz -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.41.2Threads / Copies: 32DL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 15.34, N = 3111.731. (CC) gcc options: -O2 -lreadline -ltermcap -lz -lm

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsDL385 Workstation mixed 3 trials, no gpu306090120150SE +/- 50.55, N = 3123.821. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsDL385 Workstation mixed 3 trials, no gpu48121620SE +/- 0.08, N = 315.62

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.3DL385 Workstation mixed 3 trials, no gpu1224364860SE +/- 0.12, N = 353.251. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dDL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 1.08, N = 3243.271. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNADL385 Workstation mixed 3 trials, no gpu3691215SE +/- 0.062, N = 39.0381. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0DL385 Workstation mixed 3 trials, no gpu50100150200250SE +/- 4.11, N = 3209.141. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Bork File Encrypter

Bork is a small, cross-platform file encryption utility. It is written in Java and designed to be included along with the files it encrypts for long-term storage. This test measures the amount of time it takes to encrypt a sample file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBork File Encrypter 1.4File Encryption TimeDL385 Workstation mixed 3 trials, no gpu246810SE +/- 0.036, N = 37.152

Java Gradle Build

This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.

Gradle Build: Reactor

DL385 Workstation mixed 3 trials, no gpu: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6DL385 Workstation mixed 3 trials, no gpu0.65121.30241.95362.60483.256SE +/- 0.037, N = 32.8941. (CXX) g++ options: -O3 -fPIC -lm

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.10, N = 321.86

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000DL385 Workstation mixed 3 trials, no gpu1122334455SE +/- 0.19, N = 350.181. (CC) gcc options: -O2 -lz

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: CPU-onlyDL385 Workstation mixed 3 trials, no gpu0.33480.66961.00441.33921.674SE +/- 0.005, N = 31.488

GEGL

GEGL is the Generic Graphics Library and is the library/framework used by GIMP and other applications like GNOME Photos. This test profile times how long it takes to complete various GEGL operations on a static set of sample JPEG images. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGEGLOperation: Rotate 90 DegreesDL385 Workstation mixed 3 trials, no gpu1224364860SE +/- 0.08, N = 354.02

GIMP

GIMP is an open-source image manipulaton program. This test profile will use the system-provided GIMP program otherwise on Windows relys upon a pre-packaged Windows binary from upstream GIMP.org. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGIMP 2.10.30Test: rotateDL385 Workstation mixed 3 trials, no gpu48121620SE +/- 0.02, N = 314.15

G'MIC

G'MIC is an open-source framework for image processing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterG'MICTest: 2D Function Plotting, 1000 TimesDL385 Workstation mixed 3 trials, no gpu20406080100SE +/- 0.53, N = 3103.19

Inkscape

Inkscape is an open-source vector graphics editor. This test profile times how long it takes to complete various operations by Inkscape. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterInkscapeOperation: SVG Files To PNGDL385 Workstation mixed 3 trials, no gpu714212835SE +/- 0.08, N = 328.181. Inkscape 1.1.2 (0a00cf5339, 2022-02-04)

GNU Octave Benchmark

This test profile measures how long it takes to complete several reference GNU Octave files via octave-benchmark. GNU Octave is used for numerical computations and is an open-source alternative to MATLAB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGNU Octave Benchmark 6.4.0DL385 Workstation mixed 3 trials, no gpu246810SE +/- 0.003, N = 36.195

RawTherapee

RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark TimeDL385 Workstation mixed 3 trials, no gpu1224364860SE +/- 0.17, N = 353.441. RawTherapee, version 5.8, command line.

librsvg

RSVG/librsvg is an SVG vector graphics library. This test profile times how long it takes to complete various operations by rsvg-convert. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterlibrsvgOperation: SVG Files To PNGDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.02, N = 322.711. rsvg-convert version 2.52.5

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeDL385 Workstation mixed 3 trials, no gpu714212835SE +/- 0.44, N = 331.091. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-OnlyDL385 Workstation mixed 3 trials, no gpu246810SE +/- 0.02, N = 38.93

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-OnlyDL385 Workstation mixed 3 trials, no gpu510152025SE +/- 0.03, N = 321.40

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of StateDL385 Workstation mixed 3 trials, no gpu0.04280.08560.12840.17120.214SE +/- 0.001, N = 30.190

Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

DL385 Workstation mixed 3 trials, no gpu: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Exception in thread "main" java.lang.UnsatisfiedLinkError: Can't load library: /usr/lib/jvm/java-11-openjdk-amd64/lib/libawt_xawt.so

129 Results Shown

Intel MPI Benchmarks
miniFE
Algebraic Multi-Grid Benchmark
OpenVINO
ACES DGEMM
ArrayFire
TensorFlow:
  CPU - 32 - VGG-16
  CPU - 32 - ResNet-50
Flexible IO Tester:
  Rand Read - Linux AIO - No - Yes - 1MB - 1 - /data
  Rand Write - Linux AIO - No - Yes - 1MB - 1 - /data
  Seq Read - Linux AIO - No - Yes - 8MB - 1 - /data
  Seq Read - Linux AIO - No - Yes - 64MB - 1 - /data
  Seq Write - Linux AIO - No - Yes - 8MB - 1 - /data
  Seq Write - Linux AIO - No - Yes - 64MB - 1 - /data
GraphicsMagick:
  Rotate
  Resizing
  Noise-Gaussian
  HWB Color Space
LevelDB
Dbench
Compile Bench
RAMspeed SMP
Stream:
  Scale
  Triad
  Add
Tinymembench:
  Standard Memcpy
  Standard Memset
CacheBench
libjpeg-turbo tjbench
FFTE
FFTW:
  Stock - 2D FFT Size 4096
  Float + SSE - 2D FFT Size 4096
Himeno Benchmark
Flexible IO Tester:
  Rand Read - Linux AIO - No - Yes - 1MB - 1 - /data
  Rand Write - Linux AIO - No - Yes - 1MB - 1 - /data
  Seq Read - Linux AIO - No - Yes - 8MB - 1 - /data
  Seq Read - Linux AIO - No - Yes - 64MB - 1 - /data
  Seq Write - Linux AIO - No - Yes - 8MB - 1 - /data
  Seq Write - Linux AIO - No - Yes - 64MB - 1 - /data
MBW
JPEG XL libjxl:
  PNG - 100
  JPEG - 100
JPEG XL Decoding libjxl
WebP Image Encode
WebP2 Image Encode
LibRaw
ASKAP:
  tConvolve MPI - Degridding
  tConvolve MPI - Gridding
GROMACS
Apache Cassandra
RocksDB
CockroachDB
Java JMH
Memcached
etcd
Numpy Benchmark
Darmstadt Automotive Parallel Heterogeneous Suite
InfluxDB
LULESH
NAMD
LevelDB
TensorFlow Lite:
  Inception V4
  Mobilenet Float
PyBench
PyPerformance:
  go
  2to3
  chaos
  float
  nbody
  pathlib
  raytrace
  json_loads
  crypto_pyaes
  regex_compile
  python_startup
  django_template
  pickle_pure_python
Renaissance
OpenJPEG
etcd
NCNN:
  CPU - mobilenet
  CPU-v2-v2 - mobilenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU - shufflenet-v2
  CPU - mnasnet
  CPU - efficientnet-b0
  CPU - blazeface
  CPU - googlenet
  CPU - vgg16
  CPU - resnet18
  CPU - alexnet
  CPU - resnet50
  CPU - yolov4-tiny
  CPU - squeezenet_ssd
  CPU - regnety_400m
  CPU - vision_transformer
  CPU - FastestDet
TNN
OpenVINO
OpenCV:
  Core
  Stitching
  Features 2D
  Image Processing
  Object Detection
DaCapo Benchmark
SQLite:
  1
  32
CloverLeaf
Dolfyn
Nebular Empirical Analysis Tool
Xcompact3d Incompact3d
Timed MAFFT Alignment
Monte Carlo Simulations of Ionised Nebulae
Bork File Encrypter
libavif avifenc
Cython Benchmark
SQLite Speedtest
Darktable
GEGL
GIMP
G'MIC
Inkscape
GNU Octave Benchmark
RawTherapee
librsvg
GPAW
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
PyHPC Benchmarks