Xeon Platinum Ubuntu 19.04

2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and llvmpipe on Ubuntu 19.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102095-HA-XEONPLATI27&grr.

Xeon Platinum Ubuntu 19.04ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionUbuntu 19.042 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers378GB280GB INTEL SSDPED1D280GAllvmpipe2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEUbuntu 19.045.0.0-38-generic (x86_64)GNOME Shell 3.32.2X Server 1.20.43.3 Mesa 19.0.8 (LLVM 8.0 256 bits)GCC 8.3.0ext41024x768OpenBenchmarking.org- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c- Python 2.7.16 + Python 3.7.3- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled

Xeon Platinum Ubuntu 19.04qe: AUSURF112openvkl: vklBenchmarkopenvkl: vklBenchmarkUnstructuredVolumeyafaray: Total Time For Sample Scenengspice: C2670webp2: Quality 75, Compression Effort 7plaidml: No - Inference - VGG19 - CPUplaidml: No - Inference - ResNet 50 - CPUcaffe: AlexNet - CPU - 200mysqlslap: 256mysqlslap: 512onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmysqlslap: 128lammps: 20k Atomstensorflow-lite: Inception ResNet V2luxcorerender: Rainbow Colors and Prismtensorflow-lite: SqueezeNettensorflow-lite: Mobilenet Quanttensorflow-lite: Inception V4rodinia: OpenMP Leukocyteopenvkl: vklBenchmarkStructuredVolumewebp2: Quality 95, Compression Effort 7openfoam: Motorbike 60Mtensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floatcaffe: GoogleNet - CPU - 200hmmer: Pfam Database Searchnumpy: mocassin: Dust 2D tau100.0hpcg: kripke: neat: incompact3d: Cylindersvt-av1: Enc Mode 0 - 1080prodinia: OpenMP HotSpot3Dngspice: C7552appleseed: Material Testeronnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUnode-web-tooling: mrbayes: Primate Phylogeny Analysisappleseed: Emilypyperformance: python_startupblender: Classroom - CPU-Onlyfinancebench: Bonds OpenMPaskap: Hogbom Clean OpenMPplaidml: No - Inference - VGG16 - CPUpyperformance: raytraceredis: SETv-ray: CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUredis: GETredis: LPOPopenvkl: vklBenchmarkVdbVolumepyperformance: 2to3sqlite-speedtest: Timed Time - Size 1,000indigobench: CPU - Bedroomfinancebench: Repo OpenMPindigobench: CPU - Supercarluxcorerender: DLSCredis: SADDrodinia: OpenMP LavaMDpyperformance: goonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUappleseed: Disney Materialhimeno: Poisson Pressure Solverpgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlyaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddinggromacs: Water Benchmarknamd: ATPase Simulation - 327,506 Atomsblender: BMW27 - CPU-Onlyqmcpack: simple-H2Orodinia: OpenMP CFD Solverkvazaar: Bosphorus 4K - Very Fastpyperformance: regex_compilepgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlynpb: EP.Dpgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Writepgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 100 - Read Writeopenfoam: Motorbike 30Mquantlib: onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUlibraw: Post-Processing Benchmarkpyperformance: pathlibphpbench: PHP Benchmark Suitex265: Bosphorus 4Ksynthmark: VoiceMark_100pyperformance: pickle_pure_pythonpyperformance: django_templatepyperformance: json_loadstnn: CPU - MobileNet v2pybench: Total For Average Test Timestnn: CPU - SqueezeNet v1.1dolfyn: Computational Fluid Dynamicspyperformance: chaospyperformance: nbodypennant: sedovbigpyperformance: crypto_pyaesamg: onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUarrayfire: Conjugate Gradient CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUkvazaar: Bosphorus 4K - Ultra Fastpyperformance: floatwebp: Quality 100, Losslesssvt-av1: Enc Mode 4 - 1080px265: Bosphorus 1080predis: LPUSHnpb: LU.Conednn: IP Shapes 1D - bf16bf16bf16 - CPUarrayfire: BLAS CPUmafft: Multiple Sequence Alignment - LSU RNAsvt-av1: Enc Mode 8 - 1080pmt-dgemm: Sustained Floating-Point Rateonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUrodinia: OpenMP Streamclusterpennant: leblancbigonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUkvazaar: Bosphorus 1080p - Very Fastwebp2: Quality 100, Compression Effort 5webp: Quality 100, Highest Compressiononednn: Convolution Batch Shapes Auto - u8s8f32 - CPUoidn: Memorialkvazaar: Bosphorus 1080p - Ultra Fastsvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUwebp: Quality 100lammps: Rhodopsin ProteinUbuntu 19.041417.324561993357223.468163.824133.44323.494.7282612499521934.342576.48859926.65410115587.2767294.457234.6110208362.06475077913248.917168.52104739.153606.7225798212.287344.6318929.03959350257732.958150.9552000.138114.036135.573193.0101151384023176156641010.03110.047157.87455211.3104.2288678.114584615.30628.214681747022.3947419923.5341.33794558.1262201881.922338189.982481276734064.9078.41951894.80468719.9307.361901359.6555.8882402.232271.0907069.3196693994.5041770.38664907012085.010262.15.7740.3659738.3836.7257.26016.881730.1158737574069.105.771434522.2584435221.002296.20.33199138.6118.763317719.28535.51244554.126.1365.9971080329.55522.89210911621.8275210914884320004.583328.4830.39140928.6310319.2408.21234.501486892.54140470.313.708413460.9213.04960.76018.7188910.81479911.9608.4620183.1736068.078.4747.3933.6870335.47122.90222.78258.83267.364.452970.3313902.49027.906OpenBenchmarking.org

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112Ubuntu 19.0430060090012001500SE +/- 60.78, N = 41417.321. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUbuntu 19.04100200300400500SE +/- 4.20, N = 9456MIN: 1 / MAX: 1812

OpenVKL

Benchmark: vklBenchmarkUnstructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkUnstructuredVolumeUbuntu 19.04400K800K1200K1600K2000KSE +/- 2118.45, N = 31993357MIN: 20783 / MAX: 6493629

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneUbuntu 19.0450100150200250SE +/- 16.23, N = 12223.471. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lboost_system -lboost_filesystem -lboost_locale

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670Ubuntu 19.044080120160200SE +/- 1.27, N = 12163.821. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7Ubuntu 19.04306090120150SE +/- 1.20, N = 12133.441. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUUbuntu 19.04612182430SE +/- 0.25, N = 1523.49

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUUbuntu 19.041.0622.1243.1864.2485.31SE +/- 0.01, N = 34.72

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200Ubuntu 19.0420K40K60K80K100KSE +/- 2840.64, N = 15826121. (CXX) g++ options: -O3 -march=native -fPIC -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256Ubuntu 19.04110220330440550SE +/- 1.09, N = 34991. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512Ubuntu 19.04110220330440550SE +/- 0.18, N = 35211. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUUbuntu 19.042004006008001000SE +/- 5.96, N = 15934.34MIN: 905.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.04120240360480600SE +/- 9.03, N = 15576.49MIN: 541.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

MariaDB

Clients: 128

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128Ubuntu 19.04130260390520650SE +/- 2.90, N = 35991. (CXX) g++ options: -O3 -march=native -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsUbuntu 19.04612182430SE +/- 0.03, N = 326.651. (CXX) g++ options: -O3 -march=native -pthread -lm

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Ubuntu 19.04200K400K600K800K1000KSE +/- 17126.87, N = 151011558

LuxCoreRender

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismUbuntu 19.04246810SE +/- 0.33, N = 157.27MIN: 4.37 / MAX: 8.24

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetUbuntu 19.0414K28K42K56K70KSE +/- 669.07, N = 1567294.4

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantUbuntu 19.0412K24K36K48K60KSE +/- 727.18, N = 1557234.6

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Ubuntu 19.04200K400K600K800K1000KSE +/- 7251.55, N = 141102083

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteUbuntu 19.041428425670SE +/- 0.71, N = 1362.061. (CXX) g++ options: -O2 -lOpenCL

OpenVKL

Benchmark: vklBenchmarkStructuredVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkStructuredVolumeUbuntu 19.0416M32M48M64M80MSE +/- 1084350.33, N = 1575077913MIN: 1073646 / MAX: 803527920

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7Ubuntu 19.0450100150200250SE +/- 1.25, N = 3248.921. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MUbuntu 19.044080120160200SE +/- 0.34, N = 3168.521. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileUbuntu 19.0420K40K60K80K100KSE +/- 2444.48, N = 12104739.1

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatUbuntu 19.0411K22K33K44K55KSE +/- 675.73, N = 1253606.7

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 200Ubuntu 19.0450K100K150K200K250KSE +/- 1675.73, N = 32257981. (CXX) g++ options: -O3 -march=native -fPIC -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchUbuntu 19.0450100150200250SE +/- 0.20, N = 3212.291. (CC) gcc options: -O3 -march=native -pthread -lhmmer -leasel -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkUbuntu 19.0470140210280350SE +/- 1.31, N = 3344.63

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0Ubuntu 19.0440801201602001891. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Ubuntu 19.04714212835SE +/- 0.15, N = 329.041. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4Ubuntu 19.0420M40M60M80M100MSE +/- 1474539.43, N = 15935025771. (CXX) g++ options: -O3 -march=native -fopenmp

Nebular Empirical Analysis Tool

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29Ubuntu 19.04816243240SE +/- 0.26, N = 1532.961. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

Incompact3D

Input: Cylinder

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: CylinderUbuntu 19.04306090120150SE +/- 0.61, N = 3150.961. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pUbuntu 19.040.03110.06220.09330.12440.1555SE +/- 0.000, N = 30.1381. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DUbuntu 19.04306090120150SE +/- 1.32, N = 4114.041. (CXX) g++ options: -O2 -lOpenCL

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552Ubuntu 19.04306090120150SE +/- 1.71, N = 3135.571. (CC) gcc options: -O3 -march=native -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterUbuntu 19.044080120160200193.01

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUUbuntu 19.04306090120150SE +/- 0.58, N = 31381. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUUbuntu 19.0490180270360450SE +/- 2.57, N = 34021. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUUbuntu 19.0470140210280350SE +/- 3.24, N = 33171. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUUbuntu 19.0413002600390052006500SE +/- 35.31, N = 361561. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUUbuntu 19.0414002800420056007000SE +/- 33.38, N = 364101. (CXX) g++ options: -O3 -march=native -fopenmp -ffunction-sections -fdata-sections -ldl -lrt

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling BenchmarkUbuntu 19.043691215SE +/- 0.12, N = 410.031. Nodejs v10.15.2

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisUbuntu 19.0420406080100SE +/- 0.25, N = 3110.051. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -march=native -lm

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilyUbuntu 19.04306090120150157.87

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startupUbuntu 19.043691215SE +/- 0.00, N = 311.3

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-OnlyUbuntu 19.0420406080100SE +/- 0.38, N = 3104.22

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPUbuntu 19.0420K40K60K80K100KSE +/- 101.32, N = 388678.111. (CXX) g++ options: -O3 -march=native -fopenmp

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPUbuntu 19.04130260390520650SE +/- 17.36, N = 15615.311. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUUbuntu 19.04714212835SE +/- 0.19, N = 328.21

PyPerformance

Benchmark: raytrace

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytraceUbuntu 19.04100200300400500SE +/- 0.33, N = 3468

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETUbuntu 19.04400K800K1200K1600K2000KSE +/- 30273.24, N = 151747022.391. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPUUbuntu 19.0410K20K30K40K50KSE +/- 166.62, N = 347419

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.042004006008001000SE +/- 0.79, N = 3923.53MIN: 910.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.3010.6020.9031.2041.505SE +/- 0.00870, N = 151.33794MIN: 1.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUUbuntu 19.04120240360480600SE +/- 5.67, N = 3558.13MIN: 543.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETUbuntu 19.04500K1000K1500K2000K2500KSE +/- 70535.69, N = 152201881.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPUbuntu 19.04500K1000K1500K2000K2500KSE +/- 68474.33, N = 152338189.981. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

OpenVKL

Benchmark: vklBenchmarkVdbVolume

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkVdbVolumeUbuntu 19.045M10M15M20M25MSE +/- 70798.88, N = 324812767MIN: 868934 / MAX: 162479232

PyPerformance

Benchmark: 2to3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to3Ubuntu 19.0470140210280350340

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Ubuntu 19.041428425670SE +/- 0.15, N = 364.911. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomUbuntu 19.04246810SE +/- 0.031, N = 38.419

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPUbuntu 19.0411K22K33K44K55KSE +/- 498.18, N = 351894.801. (CXX) g++ options: -O3 -march=native -fopenmp

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarUbuntu 19.04510152025SE +/- 0.04, N = 319.93

LuxCoreRender

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCUbuntu 19.04246810SE +/- 0.01, N = 37.36MIN: 7.25 / MAX: 7.66

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDUbuntu 19.04400K800K1200K1600K2000KSE +/- 39203.08, N = 121901359.651. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDUbuntu 19.041326395265SE +/- 0.55, N = 355.891. (CXX) g++ options: -O2 -lOpenCL

PyPerformance

Benchmark: go

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: goUbuntu 19.0450100150200250240

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.040.50231.00461.50692.00922.5115SE +/- 0.05726, N = 152.23227MIN: 2.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.24540.49080.73620.98161.227SE +/- 0.00937, N = 151.09070MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialUbuntu 19.04153045607569.32

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverUbuntu 19.049001800270036004500SE +/- 0.36, N = 33994.501. (CC) gcc options: -O3 -march=native -mavx2

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyUbuntu 19.040.08690.17380.26070.34760.4345SE +/- 0.004, N = 40.3861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyUbuntu 19.04140K280K420K560K700KSE +/- 7278.19, N = 46490701. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingUbuntu 19.043K6K9K12K15KSE +/- 78.97, N = 312085.01. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingUbuntu 19.042K4K6K8K10KSE +/- 66.19, N = 310262.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkUbuntu 19.041.29922.59843.89765.19686.496SE +/- 0.014, N = 35.7741. (CXX) g++ options: -O3 -march=native -pthread -lrt -lpthread -lm

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsUbuntu 19.040.08230.16460.24690.32920.4115SE +/- 0.00094, N = 30.36597

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlyUbuntu 19.04918273645SE +/- 0.17, N = 338.38

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OUbuntu 19.04816243240SE +/- 0.07, N = 336.731. (CXX) g++ options: -O3 -march=native -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -fomit-frame-pointer -ffast-math -lm -pthread

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverUbuntu 19.04246810SE +/- 0.090, N = 157.2601. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastUbuntu 19.0448121620SE +/- 0.05, N = 316.881. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compileUbuntu 19.044080120160200173

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyUbuntu 19.040.02590.05180.07770.10360.1295SE +/- 0.000, N = 30.1151. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyUbuntu 19.04200K400K600K800K1000KSE +/- 2996.12, N = 38737571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DUbuntu 19.049001800270036004500SE +/- 18.13, N = 34069.101. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 3.1.3

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyUbuntu 19.041.29852.5973.89555.1946.4925SE +/- 0.037, N = 35.7711. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteUbuntu 19.049K18K27K36K45KSE +/- 269.58, N = 3434521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyUbuntu 19.040.50811.01621.52432.03242.5405SE +/- 0.007, N = 32.2581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteUbuntu 19.049K18K27K36K45KSE +/- 132.51, N = 3443521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MUbuntu 19.04510152025SE +/- 0.07, N = 321.001. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Ubuntu 19.045001000150020002500SE +/- 10.45, N = 32296.21. (CXX) g++ options: -O3 -march=native -rdynamic -lboost_timer -lboost_system -lboost_chrono

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.07470.14940.22410.29880.3735SE +/- 0.002873, N = 80.331991MIN: 0.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkUbuntu 19.04918273645SE +/- 0.05, N = 338.611. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlibUbuntu 19.04510152025SE +/- 0.00, N = 318.7

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteUbuntu 19.04140K280K420K560K700KSE +/- 321.84, N = 3633177

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KUbuntu 19.04510152025SE +/- 0.19, N = 319.281. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Ubuntu 19.04120240360480600SE +/- 0.56, N = 3535.511. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

PyPerformance

Benchmark: pickle_pure_python

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_pythonUbuntu 19.04100200300400500445

PyPerformance

Benchmark: django_template

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_templateUbuntu 19.041224364860SE +/- 0.07, N = 354.1

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loadsUbuntu 19.04612182430SE +/- 0.00, N = 326.1

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Ubuntu 19.0480160240320400SE +/- 1.21, N = 3366.00MIN: 354.49 / MAX: 501.681. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesUbuntu 19.042004006008001000SE +/- 2.52, N = 31080

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Ubuntu 19.0470140210280350SE +/- 0.06, N = 3329.56MIN: 329.04 / MAX: 331.161. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsUbuntu 19.04510152025SE +/- 0.04, N = 322.89

PyPerformance

Benchmark: chaos

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaosUbuntu 19.0420406080100109

PyPerformance

Benchmark: nbody

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbodyUbuntu 19.04306090120150116

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigUbuntu 19.04510152025SE +/- 0.02, N = 321.831. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaesUbuntu 19.0420406080100109

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Ubuntu 19.04300M600M900M1200M1500MSE +/- 6349082.45, N = 314884320001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.041.03122.06243.09364.12485.156SE +/- 0.00382, N = 34.58332MIN: 4.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ArrayFire

Test: Conjugate Gradient CPU

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient CPUUbuntu 19.04246810SE +/- 0.739, N = 158.4831. (CXX) g++ options: -O3 -march=native -rdynamic

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.08810.17620.26430.35240.4405SE +/- 0.000442, N = 30.391409MIN: 0.351. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastUbuntu 19.04714212835SE +/- 0.22, N = 328.631. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

PyPerformance

Benchmark: float

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: floatUbuntu 19.0420406080100103

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessUbuntu 19.04510152025SE +/- 0.01, N = 319.241. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pUbuntu 19.04246810SE +/- 0.016, N = 38.2121. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pUbuntu 19.04816243240SE +/- 0.14, N = 334.501. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHUbuntu 19.04300K600K900K1200K1500KSE +/- 12883.11, N = 31486892.541. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CUbuntu 19.0430K60K90K120K150KSE +/- 278.43, N = 3140470.311. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 3.1.3

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.040.83441.66882.50323.33764.172SE +/- 0.00380, N = 33.70841MIN: 3.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUUbuntu 19.047001400210028003500SE +/- 4.47, N = 33460.921. (CXX) g++ options: -O3 -march=native -rdynamic

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAUbuntu 19.043691215SE +/- 0.18, N = 313.051. (CC) gcc options: -std=c99 -O3 -lm -lpthread

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pUbuntu 19.041428425670SE +/- 0.48, N = 360.761. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateUbuntu 19.04510152025SE +/- 0.08, N = 318.721. (CC) gcc options: -O3 -march=native -fopenmp

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.040.18330.36660.54990.73320.9165SE +/- 0.001869, N = 30.814799MIN: 0.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterUbuntu 19.043691215SE +/- 0.02, N = 311.961. (CXX) g++ options: -O2 -lOpenCL

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigUbuntu 19.04246810SE +/- 0.013902, N = 38.4620181. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.040.71411.42822.14232.85643.5705SE +/- 0.01090, N = 33.17360MIN: 2.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastUbuntu 19.041530456075SE +/- 0.59, N = 368.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5Ubuntu 19.04246810SE +/- 0.005, N = 38.4741. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionUbuntu 19.04246810SE +/- 0.006, N = 37.3931. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.82961.65922.48883.31844.148SE +/- 0.00220, N = 33.68703MIN: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialUbuntu 19.04816243240SE +/- 0.41, N = 335.47

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastUbuntu 19.04306090120150SE +/- 0.68, N = 3122.901. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pUbuntu 19.0450100150200250SE +/- 2.60, N = 3222.781. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pUbuntu 19.0460120180240300SE +/- 3.52, N = 3258.831. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pUbuntu 19.0460120180240300SE +/- 3.58, N = 3267.361. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUUbuntu 19.041.00192.00383.00574.00765.0095SE +/- 0.00670, N = 34.45297MIN: 4.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUUbuntu 19.040.07460.14920.22380.29840.373SE +/- 0.002349, N = 30.331390MIN: 0.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Ubuntu 19.040.56031.12061.68092.24122.8015SE +/- 0.004, N = 32.4901. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinUbuntu 19.04714212835SE +/- 0.06, N = 327.911. (CXX) g++ options: -O3 -march=native -pthread -lm


Phoronix Test Suite v10.8.4