CompuLab Airtop 3

Intel Xeon E-2288G testing with a Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2011040-FI-COMPULABA81&grr&rdt.

CompuLab Airtop 3ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution123Intel Xeon E-2288G @ 5.00GHz (8 Cores / 16 Threads)Compulab SBC-ATCFL v1.2 (ATOP3.PRD.0.29.2 BIOS)Intel Cannon Lake PCH64GBSamsung SSD 970 EVO Plus 250GBNVIDIA Quadro RTX 4000 8GB (1005/6500MHz)Intel Cannon Lake PCH cAVSVE228Intel I219-LM + Intel I210Ubuntu 20.105.8.0-26-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9NVIDIA 455.284.6.0OpenCL 1.2 CUDA 11.1.961.2.142GCC 10.2.0ext41920x1080NVIDIA Quadro RTX 4000 8GB (300/405MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 2.3OpenCL Details- GPU Compute Cores: 2304Java Details- OpenJDK Runtime Environment (build 11.0.9+11-Ubuntu-0ubuntu1)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

CompuLab Airtop 3lammps: 20k Atomsblender: Barbershop - NVIDIA OptiXjava-gradle-perf: Reactorblender: Barbershop - CPU-Onlybuild-llvm: Time To Compileblender: Pabellon Barcelona - CPU-Onlyblender: Classroom - CPU-Onlylczero: Eigenincompact3d: Cylinderhint: FLOATlczero: BLASrodinia: OpenMP HotSpot3Dbrl-cad: VGR Performance Metricrodinia: OpenMP LavaMDastcenc: Exhaustiveblender: Fishy Cat - CPU-Onlyrodinia: OpenMP CFD Solvergromacs: Water Benchmarkcaffe: GoogleNet - CPU - 200blender: BMW27 - CPU-Onlyrodinia: OpenMP Leukocytewireguard: tensorflow-lite: Inception V4tensorflow-lite: Inception ResNet V2blender: Pabellon Barcelona - NVIDIA OptiXkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumdav1d: Chimera 1080p 10-bitbyte: Dhrystone 2rodinia: OpenMP Streamclusterblender: Classroom - NVIDIA OptiXcaffe: GoogleNet - CPU - 100namd: ATPase Simulation - 327,506 Atomshmmer: Pfam Database Searchavifenc: 0build-linux-kernel: Time To Compilepgbench: 1 - 250 - Read Write - Average Latencypgbench: 1 - 250 - Read Writecaffe: AlexNet - CPU - 200pgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 1 - Read Writeinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000onednn: Deconvolution Batch deconv_1d - f32 - CPUkeydb: onednn: IP Batch 1D - f32 - CPUpyperformance: raytracencnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU - squeezenetrawtherapee: Total Benchmark Timetensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatavifenc: 2onednn: IP Batch All - u8s8f32 - CPUonednn: IP Batch All - f32 - CPUsunflow: Global Illumination + Image Synthesispyperformance: python_startupblender: Fishy Cat - NVIDIA OptiXrodinia: OpenCL Myocytekvazaar: Bosphorus 4K - Very Fastastcenc: Thoroughx265: Bosphorus 4Kpyperformance: 2to3onednn: IP Batch 1D - u8s8f32 - CPUcaffe: AlexNet - CPU - 100pyperformance: goonednn: Recurrent Neural Network Training - f32 - CPUpgbench: 1 - 250 - Read Only - Average Latencypgbench: 1 - 250 - Read Onlyonednn: Recurrent Neural Network Inference - f32 - CPUlibraw: Post-Processing Benchmarkwebp: Quality 100, Lossless, Highest Compressionkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumpyperformance: floataom-av1: Speed 0 Two-Passpyperformance: django_templatepyperformance: chaospyperformance: crypto_pyaesdacapobench: H2blender: BMW27 - NVIDIA OptiXpyperformance: regex_compilekvazaar: Bosphorus 4K - Ultra Fastaom-av1: Speed 6 Realtimepgbench: 1 - 100 - Read Write - Average Latencypgbench: 1 - 100 - Read Writepgbench: 1 - 50 - Read Write - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 1 - Read Only - Average Latencypgbench: 1 - 1 - Read Onlypgbench: 1 - 100 - Read Only - Average Latencypgbench: 1 - 100 - Read Onlypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 50 - Read Onlyaom-av1: Speed 6 Two-Passdav1d: Summer Nature 4Kpyperformance: pathlibocrmypdf: Processing 60 Page PDF Documentrnnoise: onednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUtesseract-ocr: Time To OCR 7 Imagesopenssl: RSA 4096-bit Performancetnn: CPU - MobileNet v2pyperformance: pickle_pure_pythonpyperformance: json_loadsneatbench: CPUtnn: CPU - SqueezeNet v1.1pyperformance: nbodydav1d: Chimera 1080paom-av1: Speed 4 Two-Passncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - squeezenetdolfyn: Computational Fluid Dynamicswebp: Quality 100, Losslesskvazaar: Bosphorus 1080p - Very Fastaom-av1: Speed 8 Realtimedacapobench: Tradesoaponednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUlammps: Rhodopsin Proteinkvazaar: Bosphorus 1080p - Ultra Fastx265: Bosphorus 1080pdacapobench: Tradebeansastcenc: Mediumonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUwebp: Quality 100, Highest Compressiondacapobench: Jythondav1d: Summer Nature 1080ponednn: Convolution Batch Shapes Auto - u8s8f32 - CPUastcenc: Fastavifenc: 8avifenc: 10onednn: Deconvolution Batch deconv_3d - f32 - CPUneatbench: GPUffte: N=256, 3D Complex FFT Routinewebp: Quality 100webp: Default1235.8511312.28197.762787.85771.207651.27584.98751378.478658472824990.6232584492.66899977276.215274.72271.1622.6490.820207902186.64140.313160.51332693532954177145.474.154.21114.5548134384.719.563104.111037861.94453100.12399.66998.182419.725599802732.0844801507808.81534293.61535930.84.89651737463.314.4962737428.7028.8014.3815.2067.2214.991.466.693.923.214.065.1119.3016.1061.74922785220101515725015391958.66427.636468.05251.1386.8054.9334.78611.9533.3312.372641.8010340095200321.8030.982254678149.61833.3134.70018.2418.6990.00.3338.685.785.8286427.9013821.8723.54176.12656881.8656110.03334140.3652744630.1732894014.30156.4014.422.02421.5875.7059420.1342431.8286.03533821.310.9269.712103639.972.728.673.772.101.688.363.530.612.671.481.301.671.454.633.6616.83315.61346.7345.9236523.999112.580816.60087.1654.4926908.483.5870117.11836.3853709581.3416.44185.535.0224.7576.6426129.431760.0230383122.0771.3135.9091264.86199.454783.83762.884650.83587.97762377.557241474322870.4901683789.72999832273.155273.46269.6722.7400.815207889186.19138.389158.36032607102946580144.874.174.23116.3648808319.119.631103.421033491.93847100.23699.27398.147424.257591799222.4124291526391.31543312.51557794.24.67392736916.334.1315337529.5828.9114.3315.4966.6015.091.456.683.913.174.075.1319.2216.1061.85422623919950415629915322158.39927.370768.99941.1326.7454.7134.51412.0533.2412.392611.8057139759199326.9600.995251429139.87133.6634.84418.5818.9290.00.3338.985.885.8292527.7613722.4523.44173.12057883.8865960.030332670.3562812910.1692960884.29160.4714.321.06721.5935.5935720.3152467.9287.55933921.210.8269.656103647.742.728.323.742.081.728.323.220.622.651.481.311.681.424.583.6516.71115.60348.2446.2635504.069702.579266.80592.5356.1426458.443.2418917.13976.3863698582.1316.74415.524.9954.7386.4734930.531509.6340982312.0761.3155.9051325.38197.054787.49650.06584.40671605.889669471834576.34719809213.51599792296.269275.59270.29265.6240.821208264186.87214.308159.75332704332957670145.794.154.20113.4148679363.369.374104.691039032.62968100.83299.341449.114557805072.3124421508093.51530241.01539617.54.70756734176.594.1312937729.0729.2014.7215.6167.8215.381.506.854.173.264.205.3119.2316.3862.12422759920126515771915436258.66427.347368.59481.1276.7555.2335.76011.9133.3412.342611.8670640315200324.3351.004249348136.15133.8734.61218.1118.6990.10.3339.286.185.9277527.8613921.9223.36173.23957882.7336040.031325790.3602780190.1712919954.27156.8414.522.09421.5975.7069020.2242474.6284.02533921.210.9269.435103641.982.728.703.762.091.668.423.230.632.651.501.311.681.464.773.6617.02715.69446.7945.7837043.876142.582326.32287.1854.4228018.553.3040217.19676.3823718579.6716.60465.555.0174.7336.6117230.619217.7211940452.0781.318OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1231.32952.6593.98855.3186.6475SE +/- 0.013, N = 3SE +/- 0.005, N = 3SE +/- 0.046, N = 35.8515.9095.9051. (CXX) g++ options: -O3 -pthread -lm

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: NVIDIA OptiX12330060090012001500SE +/- 19.68, N = 4SE +/- 20.32, N = 3SE +/- 22.81, N = 31312.281264.861325.38

Java Gradle Build

Gradle Build: Reactor

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: Reactor1234080120160200SE +/- 2.21, N = 12SE +/- 2.22, N = 12SE +/- 2.36, N = 12197.76199.45197.05

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-Only1232004006008001000SE +/- 1.21, N = 3SE +/- 0.37, N = 3SE +/- 0.65, N = 3787.85783.83787.49

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile12170340510680850SE +/- 2.53, N = 3SE +/- 1.50, N = 3771.21762.88

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: CPU-Only123140280420560700SE +/- 1.32, N = 3SE +/- 1.12, N = 3SE +/- 0.68, N = 3651.27650.83650.06

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: CPU-Only123130260390520650SE +/- 0.63, N = 3SE +/- 1.27, N = 3SE +/- 0.47, N = 3584.98587.97584.40

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: Eigen123160320480640800SE +/- 8.19, N = 3SE +/- 4.63, N = 3SE +/- 13.55, N = 97517626711. (CXX) g++ options: -flto -pthread

Incompact3D

Input: Cylinder

OpenBenchmarking.orgSeconds, Fewer Is BetterIncompact3D 2020-09-17Input: Cylinder123130260390520650SE +/- 1.16, N = 3SE +/- 1.24, N = 3SE +/- 3.73, N = 3378.48377.56605.891. (F9X) gfortran options: -cpp -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOAT123100M200M300M400M500MSE +/- 493184.05, N = 3SE +/- 93008.17, N = 3SE +/- 1375357.19, N = 3472824990.62474322870.49471834576.351. (CC) gcc options: -O3 -march=native -lm

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLAS1232004006008001000SE +/- 10.82, N = 38448378091. (CXX) g++ options: -flto -pthread

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3D12350100150200250SE +/- 1.38, N = 3SE +/- 1.37, N = 3SE +/- 13.20, N = 1292.6789.73213.521. (CXX) g++ options: -O2 -lOpenCL

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric12320K40K60K80K100K9997799832997921. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lpthread -ldl -luuid -lm

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMD12360120180240300SE +/- 1.21, N = 3SE +/- 0.19, N = 3SE +/- 3.61, N = 3276.22273.16296.271. (CXX) g++ options: -O2 -lOpenCL

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive12360120180240300SE +/- 0.80, N = 3SE +/- 0.56, N = 3SE +/- 0.84, N = 3274.72273.46275.591. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: CPU-Only12360120180240300SE +/- 0.29, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 3271.16269.67270.29

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12360120180240300SE +/- 0.23, N = 3SE +/- 0.15, N = 3SE +/- 37.41, N = 722.6522.74265.621. (CXX) g++ options: -O2 -lOpenCL

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.18470.36940.55410.73880.9235SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 30.8200.8150.8211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 20012340K80K120K160K200KSE +/- 124.50, N = 3SE +/- 420.35, N = 3SE +/- 147.41, N = 32079022078892082641. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-Only1234080120160200SE +/- 0.68, N = 3SE +/- 1.07, N = 3SE +/- 1.31, N = 3186.64186.19186.87

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocyte12350100150200250SE +/- 1.02, N = 3SE +/- 0.74, N = 3SE +/- 0.84, N = 3140.31138.39214.311. (CXX) g++ options: -O2 -lOpenCL

WireGuard + Linux Networking Stack Stress Test

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Test1234080120160200SE +/- 0.83, N = 3SE +/- 0.95, N = 3SE +/- 0.79, N = 3160.51158.36159.75

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4123700K1400K2100K2800K3500KSE +/- 1176.00, N = 3SE +/- 1202.26, N = 3SE +/- 369.56, N = 3326935332607103270433

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2123600K1200K1800K2400K3000KSE +/- 93.33, N = 3SE +/- 2042.95, N = 3SE +/- 867.58, N = 3295417729465802957670

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX123306090120150SE +/- 0.92, N = 3SE +/- 0.89, N = 3SE +/- 0.98, N = 3145.47144.87145.79

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.93831.87662.81493.75324.6915SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 34.154.174.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.95181.90362.85543.80724.759SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.214.234.201. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bit123306090120150SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3114.55116.36113.41MIN: 72.75 / MAX: 275.84MIN: 73.44 / MAX: 275.45MIN: 72.47 / MAX: 269.271. (CC) gcc options: -pthread

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 212310M20M30M40M50MSE +/- 543289.97, N = 3SE +/- 22614.50, N = 3SE +/- 80326.86, N = 348134384.748808319.148679363.3

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster1231530456075SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 7.66, N = 1219.5619.6369.371. (CXX) g++ options: -O2 -lOpenCL

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Classroom - Compute: NVIDIA OptiX12320406080100SE +/- 0.79, N = 3SE +/- 0.75, N = 3SE +/- 0.86, N = 3104.11103.42104.69

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 10012320K40K60K80K100KSE +/- 216.65, N = 3SE +/- 79.24, N = 3SE +/- 179.36, N = 31037861033491039031. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms1230.59171.18341.77512.36682.9585SE +/- 0.00650, N = 3SE +/- 0.00297, N = 3SE +/- 0.04166, N = 31.944531.938472.62968

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.30, N = 3100.12100.24100.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 012320406080100SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 399.6799.2799.341. (CXX) g++ options: -O3 -fPIC

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compile1220406080100SE +/- 0.05, N = 3SE +/- 0.19, N = 398.1898.15

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write - Average Latency123100200300400500SE +/- 7.83, N = 15SE +/- 4.74, N = 15SE +/- 4.44, N = 3419.73424.26449.111. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Write123130260390520650SE +/- 11.83, N = 15SE +/- 6.84, N = 15SE +/- 5.55, N = 35995915571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 20012320K40K60K80K100KSE +/- 78.77, N = 3SE +/- 143.26, N = 3SE +/- 112.10, N = 38027379922805071. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency1230.54271.08541.62812.17082.7135SE +/- 0.025, N = 3SE +/- 0.138, N = 12SE +/- 0.108, N = 122.0842.4122.3121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write123100200300400500SE +/- 5.79, N = 3SE +/- 22.30, N = 12SE +/- 17.55, N = 124804294421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

InfluxDB

Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 2205.46, N = 3SE +/- 2194.00, N = 3SE +/- 2673.51, N = 31507808.81526391.31508093.5

InfluxDB

Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 5130.21, N = 3SE +/- 4759.58, N = 3SE +/- 4617.21, N = 31534293.61543312.51530241.0

InfluxDB

Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000123300K600K900K1200K1500KSE +/- 4676.23, N = 3SE +/- 6187.13, N = 3SE +/- 1033.19, N = 31535930.81557794.21539617.5

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU1231.10172.20343.30514.40685.5085SE +/- 0.04290, N = 15SE +/- 0.05834, N = 12SE +/- 0.04582, N = 34.896514.673924.70756MIN: 3.92MIN: 3.78MIN: 4.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

KeyDB

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16123160K320K480K640K800KSE +/- 1649.46, N = 3SE +/- 3580.61, N = 3SE +/- 673.14, N = 3737463.31736916.33734176.591. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

oneDNN

Harness: IP Batch 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU1231.01172.02343.03514.04685.0585SE +/- 0.04422, N = 15SE +/- 0.03723, N = 13SE +/- 0.04076, N = 124.496274.131534.13129MIN: 3.83MIN: 3.59MIN: 3.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PyPerformance

Benchmark: raytrace

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytrace12380160240320400374375377

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny123714212835SE +/- 0.04, N = 3SE +/- 0.35, N = 3SE +/- 0.12, N = 328.7029.5829.07MIN: 28.5 / MAX: 29.09MIN: 28.17 / MAX: 140.34MIN: 28.8 / MAX: 29.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50123714212835SE +/- 0.17, N = 3SE +/- 0.43, N = 3SE +/- 0.53, N = 328.8028.9129.20MIN: 27.99 / MAX: 145.54MIN: 27.49 / MAX: 158.03MIN: 27.61 / MAX: 140.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet12348121620SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 314.3814.3314.72MIN: 14.11 / MAX: 17.2MIN: 14.26 / MAX: 14.55MIN: 14.22 / MAX: 122.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet1812348121620SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 315.2015.4915.61MIN: 14.69 / MAX: 15.68MIN: 14.84 / MAX: 15.88MIN: 15.01 / MAX: 51.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg161231530456075SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.37, N = 367.2266.6067.82MIN: 65.96 / MAX: 186.53MIN: 66.46 / MAX: 67.69MIN: 66.24 / MAX: 187.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet12348121620SE +/- 0.31, N = 3SE +/- 0.26, N = 3SE +/- 0.01, N = 314.9915.0915.38MIN: 14.17 / MAX: 16.1MIN: 14.25 / MAX: 15.64MIN: 15.04 / MAX: 15.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface1230.33750.6751.01251.351.6875SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.461.451.50MIN: 1.38 / MAX: 1.57MIN: 1.34 / MAX: 1.66MIN: 1.42 / MAX: 1.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b0123246810SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 36.696.686.85MIN: 6.36 / MAX: 7.32MIN: 6.34 / MAX: 7.11MIN: 6.72 / MAX: 7.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet1230.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 33.923.914.17MIN: 3.87 / MAX: 4.28MIN: 3.86 / MAX: 4.25MIN: 3.88 / MAX: 4.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v21230.73351.4672.20052.9343.6675SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 33.213.173.26MIN: 2.98 / MAX: 3.47MIN: 2.99 / MAX: 3.47MIN: 2.99 / MAX: 3.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v31230.9451.892.8353.784.725SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.15, N = 34.064.074.20MIN: 4.01 / MAX: 4.32MIN: 4.03 / MAX: 4.44MIN: 4.01 / MAX: 4.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v21231.19482.38963.58444.77925.974SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.14, N = 35.115.135.31MIN: 5.01 / MAX: 5.37MIN: 5.01 / MAX: 5.35MIN: 5.02 / MAX: 8.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet123510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 319.3019.2219.23MIN: 18.89 / MAX: 19.7MIN: 18.82 / MAX: 26.38MIN: 18.97 / MAX: 19.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 316.1016.1016.38MIN: 15.74 / MAX: 17.79MIN: 15.94 / MAX: 17.11MIN: 15.97 / MAX: 24.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark Time1231428425670SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 0.05, N = 361.7561.8562.121. RawTherapee, version 5.8, command line.

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNet12350K100K150K200K250KSE +/- 345.32, N = 3SE +/- 496.71, N = 3SE +/- 627.42, N = 3227852226239227599

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet Mobile12340K80K120K160K200KSE +/- 363.56, N = 3SE +/- 120.29, N = 3SE +/- 1000.76, N = 3201015199504201265

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Quant12330K60K90K120K150KSE +/- 112.51, N = 3SE +/- 261.09, N = 3SE +/- 197.44, N = 3157250156299157719

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet Float12330K60K90K120K150KSE +/- 387.33, N = 3SE +/- 330.84, N = 3SE +/- 188.88, N = 3153919153221154362

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21231326395265SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.17, N = 358.6658.4058.661. (CXX) g++ options: -O3 -fPIC

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU123714212835SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.04, N = 327.6427.3727.35MIN: 25.19MIN: 24.77MIN: 25.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU1231530456075SE +/- 0.38, N = 3SE +/- 0.04, N = 3SE +/- 0.54, N = 368.0569.0068.59MIN: 64.46MIN: 63.06MIN: 62.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.25610.51220.76831.02441.2805SE +/- 0.010, N = 15SE +/- 0.011, N = 15SE +/- 0.011, N = 151.1381.1321.127MIN: 0.97 / MAX: 1.48MIN: 0.94 / MAX: 1.51MIN: 0.95 / MAX: 1.48

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startup123246810SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.806.746.75

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Fishy Cat - Compute: NVIDIA OptiX1231224364860SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 354.9354.7155.23

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyte123816243240SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.36, N = 834.7934.5135.761. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1233691215SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 311.9512.0511.911. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough123816243240SE +/- 0.40, N = 3SE +/- 0.42, N = 3SE +/- 0.40, N = 633.3333.2433.341. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1233691215SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 312.3712.3912.341. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

PyPerformance

Benchmark: 2to3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to312360120180240300264261261

oneDNN

Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU1230.42010.84021.26031.68042.1005SE +/- 0.01882, N = 8SE +/- 0.01584, N = 15SE +/- 0.03006, N = 31.801031.805711.86706MIN: 1.51MIN: 1.49MIN: 1.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1001239K18K27K36K45KSE +/- 134.25, N = 3SE +/- 121.90, N = 3SE +/- 115.70, N = 34009539759403151. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

PyPerformance

Benchmark: go

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: go1234080120160200200199200

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12370140210280350SE +/- 3.32, N = 3SE +/- 2.20, N = 3SE +/- 1.53, N = 3321.80326.96324.34MIN: 304.56MIN: 315.72MIN: 307.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only - Average Latency1230.22590.45180.67770.90361.1295SE +/- 0.003, N = 3SE +/- 0.013, N = 5SE +/- 0.013, N = 50.9820.9951.0041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 250 - Mode: Read Only12350K100K150K200K250KSE +/- 853.38, N = 3SE +/- 3217.19, N = 5SE +/- 3216.57, N = 52546782514292493481. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123306090120150SE +/- 1.81, N = 3SE +/- 0.87, N = 3SE +/- 1.35, N = 3149.62139.87136.15MIN: 143.31MIN: 136.06MIN: 130.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmark123816243240SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 333.3133.6633.871. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compression123816243240SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 334.7034.8434.611. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123510152025SE +/- 0.01, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 318.2418.5818.111. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123510152025SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 318.6918.9218.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PyPerformance

Benchmark: float

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: float12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.36, N = 390.090.090.1

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 0 Two-Pass1230.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

PyPerformance

Benchmark: django_template

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_template123918273645SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 338.638.939.2

PyPerformance

Benchmark: chaos

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaos12320406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 385.785.886.1

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaes12320406080100SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 385.885.885.9

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H21236001200180024003000SE +/- 46.69, N = 19SE +/- 39.90, N = 20SE +/- 38.16, N = 4286429252775

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: NVIDIA OptiX123714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 327.9027.7627.86

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compile123306090120150138137139

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123510152025SE +/- 0.10, N = 3SE +/- 0.23, N = 3SE +/- 0.11, N = 321.8722.4521.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Realtime123612182430SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 323.5423.4423.361. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency1234080120160200SE +/- 0.42, N = 3SE +/- 1.60, N = 3SE +/- 1.01, N = 3176.13173.12173.241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write123120240360480600SE +/- 1.34, N = 3SE +/- 5.30, N = 3SE +/- 3.40, N = 35685785781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency12320406080100SE +/- 1.15, N = 3SE +/- 0.03, N = 3SE +/- 0.67, N = 381.8783.8982.731. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write123130260390520650SE +/- 8.58, N = 3SE +/- 0.22, N = 3SE +/- 4.93, N = 36115966041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency1230.0070.0140.0210.0280.035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0300.0300.0311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only1237K14K21K28K35KSE +/- 77.57, N = 3SE +/- 373.98, N = 3SE +/- 92.20, N = 33341433267325791. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency1230.08210.16420.24630.32840.4105SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 30.3650.3560.3601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only12360K120K180K240K300KSE +/- 1999.95, N = 3SE +/- 3007.11, N = 3SE +/- 753.18, N = 32744632812912780191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency1230.03890.07780.11670.15560.1945SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.1730.1690.1711. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only12360K120K180K240K300KSE +/- 1001.63, N = 3SE +/- 4333.65, N = 3SE +/- 3723.51, N = 32894012960882919951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-Pass1230.96751.9352.90253.874.8375SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.304.294.271. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4K1234080120160200SE +/- 1.17, N = 3SE +/- 1.07, N = 3SE +/- 0.52, N = 3156.40160.47156.84MIN: 125.77 / MAX: 171.79MIN: 149.07 / MAX: 177.15MIN: 148.02 / MAX: 171.091. (CC) gcc options: -pthread

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlib12348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.09, N = 314.414.314.5

OCRMyPDF

Processing 60 Page PDF Document

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 10.3.1+dfsgProcessing 60 Page PDF Document123510152025SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 322.0221.0722.09

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28123510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 321.5921.5921.601. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU1231.28412.56823.85235.13646.4205SE +/- 0.03901, N = 3SE +/- 0.01208, N = 3SE +/- 0.05691, N = 35.705945.593575.70690MIN: 4.96MIN: 4.91MIN: 4.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Tesseract OCR

Time To OCR 7 Images

OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.1.1Time To OCR 7 Images123510152025SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.1320.3220.22

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance1235001000150020002500SE +/- 27.67, N = 3SE +/- 18.02, N = 3SE +/- 14.41, N = 32431.82467.92474.61. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v212360120180240300SE +/- 1.30, N = 3SE +/- 0.68, N = 3SE +/- 0.57, N = 3286.04287.56284.03MIN: 283.63 / MAX: 356.82MIN: 285.93 / MAX: 307.5MIN: 282.82 / MAX: 325.111. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

PyPerformance

Benchmark: pickle_pure_python

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_python12370140210280350338339339

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loads123510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 321.321.221.2

NeatBench

Acceleration: CPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: CPU1233691215SE +/- 1.26, N = 16SE +/- 1.23, N = 16SE +/- 1.26, N = 1610.910.810.9

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.112360120180240300SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3269.71269.66269.44MIN: 268.38 / MAX: 284.03MIN: 268.59 / MAX: 282.48MIN: 268.04 / MAX: 281.471. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

PyPerformance

Benchmark: nbody

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbody12320406080100103103103

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p123140280420560700SE +/- 2.78, N = 3SE +/- 1.12, N = 3SE +/- 3.61, N = 3639.97647.74641.98MIN: 466.44 / MAX: 934.79MIN: 471.76 / MAX: 969.53MIN: 460.65 / MAX: 940.691. (CC) gcc options: -pthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 4 Two-Pass1230.6121.2241.8362.4483.06SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.722.722.721. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny123246810SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.34, N = 38.678.328.70MIN: 8.03 / MAX: 53MIN: 8.01 / MAX: 8.7MIN: 8.04 / MAX: 83.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet501230.84831.69662.54493.39324.2415SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.773.743.76MIN: 3.72 / MAX: 13.18MIN: 3.72 / MAX: 3.76MIN: 3.74 / MAX: 3.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet1230.47250.9451.41751.892.3625SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.102.082.09MIN: 1.85 / MAX: 2.59MIN: 2.04 / MAX: 2.57MIN: 1.82 / MAX: 2.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet181230.3870.7741.1611.5481.935SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 31.681.721.66MIN: 1.64 / MAX: 17.75MIN: 1.64 / MAX: 23.58MIN: 1.64 / MAX: 1.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 38.368.328.42MIN: 7.77 / MAX: 22.44MIN: 7.77 / MAX: 22.07MIN: 7.88 / MAX: 18.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet1230.79431.58862.38293.17723.9715SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.533.223.23MIN: 3.2 / MAX: 21.92MIN: 3.21 / MAX: 3.25MIN: 3.22 / MAX: 3.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface1230.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.610.620.63MIN: 0.6 / MAX: 0.63MIN: 0.6 / MAX: 0.66MIN: 0.61 / MAX: 2.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b01230.60081.20161.80242.40323.004SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.672.652.65MIN: 2.64 / MAX: 12.58MIN: 2.63 / MAX: 3.85MIN: 2.63 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet1230.33750.6751.01251.351.6875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.481.481.50MIN: 1.47 / MAX: 1.72MIN: 1.47 / MAX: 1.53MIN: 1.47 / MAX: 6.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v21230.29480.58960.88441.17921.474SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.301.311.31MIN: 1.29 / MAX: 1.32MIN: 1.29 / MAX: 1.35MIN: 1.3 / MAX: 1.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31230.3780.7561.1341.5121.89SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.671.681.68MIN: 1.66 / MAX: 1.7MIN: 1.66 / MAX: 1.93MIN: 1.66 / MAX: 1.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21230.32850.6570.98551.3141.6425SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 31.451.421.46MIN: 1.41 / MAX: 20.49MIN: 1.41 / MAX: 1.44MIN: 1.41 / MAX: 20.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet1231.07332.14663.21994.29325.3665SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 34.634.584.77MIN: 4.56 / MAX: 5.02MIN: 4.54 / MAX: 4.65MIN: 4.56 / MAX: 34.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet1230.82351.6472.47053.2944.1175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.663.653.66MIN: 3.6 / MAX: 3.77MIN: 3.6 / MAX: 3.7MIN: 3.61 / MAX: 3.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12348121620SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 316.8316.7117.03

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 315.6115.6015.691. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231122334455SE +/- 0.52, N = 3SE +/- 0.64, N = 5SE +/- 0.02, N = 346.7348.2446.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 Realtime1231020304050SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 345.9246.2645.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

DaCapo Benchmark

Java Test: Tradesoap

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap1238001600240032004000SE +/- 18.80, N = 4SE +/- 25.69, N = 4SE +/- 42.18, N = 4365235503704

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.91571.83142.74713.66284.5785SE +/- 0.00919, N = 3SE +/- 0.04507, N = 3SE +/- 0.00529, N = 33.999114.069703.87614MIN: 3.93MIN: 3.86MIN: 3.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.5811.1621.7432.3242.905SE +/- 0.02285, N = 3SE +/- 0.03056, N = 3SE +/- 0.02621, N = 32.580812.579262.58232MIN: 2.21MIN: 2.16MIN: 2.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123246810SE +/- 0.086, N = 15SE +/- 0.019, N = 3SE +/- 0.197, N = 126.6006.8056.3221. (CXX) g++ options: -O3 -pthread -lm

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12320406080100SE +/- 1.07, N = 3SE +/- 0.90, N = 9SE +/- 1.01, N = 387.1692.5387.181. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1231326395265SE +/- 0.06, N = 3SE +/- 0.96, N = 3SE +/- 0.78, N = 354.4956.1454.421. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

DaCapo Benchmark

Java Test: Tradebeans

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeans1236001200180024003000SE +/- 35.35, N = 5SE +/- 27.27, N = 4SE +/- 21.94, N = 4269026452801

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.488.448.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU1230.80711.61422.42133.22844.0355SE +/- 0.03967, N = 3SE +/- 0.05394, N = 3SE +/- 0.03320, N = 153.587013.241893.30402MIN: 3.47MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU12348121620SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 417.1217.1417.20MIN: 16.89MIN: 16.92MIN: 16.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compression123246810SE +/- 0.014, N = 3SE +/- 0.012, N = 3SE +/- 0.013, N = 36.3856.3866.3821. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython1238001600240032004000SE +/- 52.97, N = 4SE +/- 52.89, N = 4SE +/- 27.24, N = 4370936983718

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080p123130260390520650SE +/- 1.17, N = 3SE +/- 0.67, N = 3SE +/- 1.29, N = 3581.34582.13579.67MIN: 498.37 / MAX: 635.39MIN: 511.78 / MAX: 635.13MIN: 479.84 / MAX: 634.421. (CC) gcc options: -pthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.27, N = 3SE +/- 0.16, N = 316.4416.7416.60MIN: 16.35MIN: 16.38MIN: 16.351. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1231.24882.49763.74644.99526.244SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.535.525.551. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

libavif avifenc

Encoder Speed: 8

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 81231.132.263.394.525.65SE +/- 0.013, N = 3SE +/- 0.019, N = 3SE +/- 0.017, N = 35.0224.9955.0171. (CXX) g++ options: -O3 -fPIC

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 101231.07032.14063.21094.28125.3515SE +/- 0.009, N = 3SE +/- 0.014, N = 3SE +/- 0.010, N = 34.7574.7384.7331. (CXX) g++ options: -O3 -fPIC

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.02212, N = 3SE +/- 0.00067, N = 3SE +/- 0.09853, N = 46.642616.473496.61172MIN: 6.44MIN: 6.31MIN: 6.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPU123714212835SE +/- 0.06, N = 3SE +/- 0.50, N = 15SE +/- 0.50, N = 1529.430.530.6

FFTE

N=256, 3D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT Routine1237K14K21K28K35KSE +/- 39.81, N = 3SE +/- 21.47, N = 3SE +/- 122.08, N = 331760.0231509.6319217.721. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 1001230.46760.93521.40281.87042.338SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 32.0772.0762.0781. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Default1230.29660.59320.88981.18641.483SE +/- 0.000, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 31.3131.3151.3181. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff


Phoronix Test Suite v10.8.4